Feature #6373
closedpublic #self
Description
This was recently suggested to me as an extension:
class Object
# An identity method that provides access to an object's 'self'.
#
# Example:
# [1,2,3,4,5,1,2,2,3].group_by(&:identity)
# #=> {1=>[1, 1], 2=>[2, 2, 2], 3=>[3, 3], 4=>[4], 5=>[5]}
#
def identity
self
end
end
First, is such a method commonly useful enough to warrant existence?
Second, it makes me wonder if #self should be a public method in general.
Files
Updated by mame (Yusuke Endoh) over 12 years ago
- Status changed from Open to Assigned
- Assignee set to matz (Yukihiro Matsumoto)
Updated by duerst (Martin Dürst) over 12 years ago
#identity
(or whatever it's called) is quite important in functional languages. It's handy to pass to another function that e.g. uses it as an argument to map,... So I think it's a good idea to add it.
The name should be identity or id (Haskell) or some such.
Making self public sounds impressive, but first, it doesn't expose anything new (it's the object itself, which is already accessible) an second, I feel it's the wrong name, because #idenity
essentially only makes sense in locations in a program where the actual object isn't directly around.
Updated by headius (Charles Nutter) over 12 years ago
At first I found this laughable, but then I realized there's no clear method you can call against an object that simply returns the object. It is a small thing, but turns out to be very useful.
For example, if you want a chain of method calls that successively transformation some data. Those transformations, in the form of folds or filters or what have you, may accept an object and return some new object by calling a method. In that case, the simplest transformation is to simply return the object unmodified. There is no such method on Object or BasicObject right now, so rather than having the uniformity of a method call you would need a special filter form that calls nothing and returns its one argument. Identity is an easy, simple functional form that should be available.
I would suggest it should be called self or similar, since it is a key core method nobody should ever override. But I do think there is utility in having it always available.
Updated by headius (Charles Nutter) over 12 years ago
Actually, I just realized there is already a method you can call to just return the method itself, albeit in a really gross way: #tap
.
obj.tap{} #returns obj; block is required and invoked
Updated by prijutme4ty (Ilya Vorontsov) over 12 years ago
enum.map(&:identity)
can be replaced with enum.to_a
But I think Object#identity
is useful. Object#tap
requires a block. It's not an option to use it in many cases. I can't imagine how to replace enum.group_by(&:identity)
with tap
instead of identity
.
I think that tap method can be improved to be used without block yielding self. But for readability it'd be however aliased to identity
.
__self__
looks like a method every programmer'd avoid.
Updated by matz (Yukihiro Matsumoto) over 12 years ago
__id__
returns object_id
number, identity here is supposed to return itself.
I agree with introducing method to return self, but not fully satisfied with the name 'identity
'.
Any opinion?
Matz.
Updated by trans (Thomas Sawyer) over 12 years ago
Public #self seems like the most obvious choice. Is there some reason not to use it?
Updated by regularfry (Alex Young) over 12 years ago
On 28/04/2012 16:10, matz (Yukihiro Matsumoto) wrote:
Issue #6373 has been updated by matz (Yukihiro Matsumoto).
__id__
returnsobject_id
number, identity here is supposed to return itself.
I agree with introducing method to return self, but not fully satisfied with the name 'identity
'.
Any opinion?
"itself
"?
--
Alex
Updated by pabloh (Pablo Herrero) over 12 years ago
What about if we borrow #yourself message name from Smalltalk?
Updated by alexeymuranov (Alexey Muranov) over 12 years ago
Thomas, i think an argument against public #self is that 'self' is a reserved word, which moreover is used more as an object name than as a method name. So 'self' would need to stop being a keyword and become a public 'predefined' method of BasicObject (it cannot be defined without the 'self' keyword, i guess).
I like #itself or #yourself, but i do not know which one is a proper way to talk to my objects.
Updated by trans (Thomas Sawyer) over 12 years ago
Like many of Ruby's keywords, it can still be used to define a public method:
class X
def self; "x"; end
end
x = X.new
x.self #=> "x"
Updated by shevegen (Robert A. Heiler) over 12 years ago
Perhaps Smalltalk has the best suggestion. :)
Updated by alexeymuranov (Alexey Muranov) over 12 years ago
Another option: #the_self
. The same number of symbols as in #yourself
, but harder to type :(.
Updated by Eregon (Benoit Daloze) over 12 years ago
On 28 April 2012 17:54, Alex Young alex@blackkettle.org wrote:
"
itself
"?
I agree, #itself
is the best to me.
Updated by marcandre (Marc-Andre Lafortune) over 12 years ago
I second the addition of Object#self
.
For the objection that self
is a keyword, so is class
. And there wouldn't ever be a need to call self.self
:-)
Do we need a slide-show for this?
Updated by trans (Thomas Sawyer) over 12 years ago
Does it really need a slide? Does someone at developers meeting want to bring it up? It's such a simple thing. It's probably a one line addition to code to make self
available as public method. Only question is name, which seems to me, why have different public name than private name? Go with #self. But if necessary for public/private names to differ, everyone seems okay with #itself.
Updated by alexeymuranov (Alexey Muranov) over 12 years ago
I've heard that the underscore _
is commonly used for ignored block variables. Maybe this "public self" can be considered as an "ignored method", and called Object#_
?
Updated by trans (Thomas Sawyer) over 12 years ago
_
is used by irb. Also, I don't really see why. Code would look much more "perlish" using _
.
Please, what's wrong with public #self?
Updated by alexeymuranov (Alexey Muranov) over 12 years ago
trans (Thomas Sawyer) wrote:
Please, what's wrong with public #self?
Nothing, just was wondering how to use _
:). I didn't know IRB uses it.
Updated by citizen428 (Michael Kohl) over 12 years ago
FWIW, I'm the one who suggested this method as an addition to Facets, mainly for the reason headius mentions above, it's the simplest filter available. I'm still torn on the name, but for some reason #self didn't seem right. For my own extension library I finally went with #it.
Updated by citizen428 (Michael Kohl) over 12 years ago
i also found a previous issue where this behavior would come in handy: http://bugs.ruby-lang.org/issues/2172
Updated by alexeymuranov (Alexey Muranov) over 12 years ago
How about merging this with feature request #6721 for #yield_self?
Object#self can optionally accept a block, yield self to the block if block given, and return the result of the block. What do you think?
Updated by trans (Thomas Sawyer) over 12 years ago
Strikes me as a very good idea! I forgot about that. In Facets it is called #ergo
. Essentially,
def ergo
return yield(self) if block_given?
self
end
Call it #self
instead and we get two features for the price of none!
Not sure if should take additional *args
or not, but it could.
Updated by Anonymous over 12 years ago
matz (Yukihiro Matsumoto) wrote:
__id__
returnsobject_id
number, identity here is supposed to return itself.
I agree with introducing method to return self, but not fully satisfied with the name 'identity
'.
Any opinion?Matz.
I did some thinking, and there is hardly anything better than 'identity
'.
'identity_function
' would be hypercorrect, and 'id' is commonly understood
as identifier. In my personal library, I use 'ɪ' (small cap Unicode I) for
identity mapping, though...
Updated by alexeymuranov (Alexey Muranov) over 12 years ago
boris_stitnicky (Boris Stitnicky) wrote:
matz (Yukihiro Matsumoto) wrote:
__id__
returnsobject_id
number, identity here is supposed to return itself.
I agree with introducing method to return self, but not fully satisfied with the name 'identity
'.
Any opinion?Matz.
I did some thinking, and there is hardly anything better than '
identity
'.
'identity_function
' would be hypercorrect, and 'id' is commonly understood
as identifier. In my personal library, I use 'ɪ' (small cap Unicode I) for
identity mapping, though...
ID as an identifier or a piece of identification and Id as the identity function are two different meanings, as far as i understand. Also it does not look to me like "method" and "function" have exactly the same semantics.
Updated by Anonymous over 12 years ago
On Fri, Aug 10, 2012 at 4:25 PM, matz (Yukihiro Matsumoto)
m...@ruby-lang.org wrote:
__id__
returnsobject_id
number, identity here is supposed to return itself.
I agree with introducing method to return self, but not fully satisfied with the name 'identity
'.
Any opinion?
How about allowing Object#tap
to take no block, simply returning self?
This syntactic sugar would allow
this.tap
instead of
this.tap {}
If a new method must be added, please for the love of sanity don't
call it id or identity or identifier. This space already chronically
overloaded by frameworks, the need for entities to have publically
visible identity notwithstanding.
Perhaps __object__
, which would feel natural to anyone already
familiar with __id__
? Failing that, Object#object
, although I'd expect
this to break a fair amount of existing code. :-(
Ciao,
Sheldon.
Updated by trans (Thomas Sawyer) over 12 years ago
Why no answer for: "Why not just public #self"? Why add YAMS?
(YAM = Yet Another Method)
Updated by matz (Yukihiro Matsumoto) about 12 years ago
- Status changed from Assigned to Feedback
- Target version changed from 2.0.0 to 2.6
The point is when we see the code like:
[1,2,3,4,5,1,2,2,3].group_by(&:self)
sometimes it would be less intuitive that self refers elements in the array, not self in the scope.
I think this cause YAM syndrome here. We haven't met name consensus yet (as usual), so I postpone to next minor.
Matz.
Updated by laise (Alexey Chernenkov) over 11 years ago
Quote: "I think that tap method can be improved to be used without block yielding self."
+1
It is a VERY usefull feature! Can't understand why #tap still need to be used with block only.
Updated by phluid61 (Matthew Kerwin) over 11 years ago
laise (Alexey Chernenkov) wrote:
Quote: "I think that tap method can be improved to be used without block yielding self."
+1
It is a VERY usefull feature! Can't understand why #tap still need to be used with block only.
Because it's called "tap." Tap doesn't "return self", it taps into an execution flow, extracting an intermediate value for inspection without interrupting the original flow. The analogy is literally tapping a hole in a pipe, to extract liquid samples at various phases in a process. (Also think of tapping a phone line.) Changing its semantics to a straight-up "returns self" method would just make it idiosyncratic, instead of metaphoric.
Matz wrote:
The point is when we see the code like:
[1,2,3,4,5,1,2,2,3].group_by(&:self)
sometimes it would be less intuitive that self refers elements in the array, not self in the scope.
I have trouble imagining such a scenario. I actually think the some_array.group_by(&:self)
example is a strong plus for this feature.
I'm also not entirely convinced it's really YAM, since 'self' is already a word in the language, all we're doing is pushing it from keyword to keyword+method, i.e. making it more easily accessible from outside the instance. Even more so since this use-case is really the only place it will ever show up; if there was another way to to_proc the 'self' keyword to make it easy to pass to a method like group_by I'd be for that as well/instead.
Updated by Anonymous over 11 years ago
trans (SYSTEM ERROR) wrote:
Why no answer for: "Why not just public
#self
"? Why add YAMS?(YAM = Yet Another Method)
Because explicit identity element is a VIP of the functional space.
Summary of the discussion thus far + my opinions follow:
Proposal: | My opinion: | Remark: |
---|---|---|
#self |
+1 | proposed and defended by OP, objected by Matz |
#identity |
+1 | strong objection by Anonymous |
#__id__ |
0 | |
#__self__ |
-1 | |
#yourself |
-1 | |
#itself |
+3 | |
#_ |
-1 | |
#the_self |
-1 | |
#it |
-1 | |
#__object__ |
-1 | |
#id |
-1 | not proposed, Anonymous strongly objects this option |
#tap |
-2 | me and phluid61 strongly object |
I used to favor #identity
, but today, I favor #itself
. I already advertised this
method, and #ergo
method (#6721) on http://stackoverflow.com/questions/16932711 ,
where these features are in demand. To make the naming decision more difficult,
let me append Freudian #ego
to the list, which would go nicely with #ergo
.
Updated by headius (Charles Nutter) over 11 years ago
phluid61 (Matthew Kerwin) wrote:
laise (Alexey Chernenkov) wrote:
It is a VERY usefull feature! Can't understand why #tap still need to be used with block only.
Because it's called "tap." Tap doesn't "return self", it taps into an execution flow, extracting an intermediate value for inspection without interrupting the original flow. The analogy is literally tapping a hole in a pipe, to extract liquid samples at various phases in a process. (Also think of tapping a phone line.) Changing its semantics to a straight-up "returns self" method would just make it idiosyncratic, instead of metaphoric.
I still like #tap
.
- It would only require removal of the block-yielding requirement.
- It's functionally equivalent to tapping into execution flow but doing nothing, as in tap {}. I think the objection that it no longer means we're tapping execution flow is a bit pedantic; we are tapping execution flow, but the only operation we seek is the reference to the object.
My favorite remains #self, but I can appreciate objections, especially confusion over "self" and "obj.self" and especially "self.self" if #self can be overridden. It does have the advantage that there's probably fewer libraries that define their own "self" method than any other suggestion here.
Other systems I know would use #identity. It also maps to functional programming language for a function that just returns its sole argument (in this case, the 0th argument, the object itself). I would support #identity
as well.
Matz wrote:
The point is when we see the code like:
[1,2,3,4,5,1,2,2,3].group_by(&:self)
sometimes it would be less intuitive that self refers elements in the array, not self in the scope.
I have trouble imagining such a scenario. I actually think the
some_array.group_by(&:self)
example is a strong plus for this feature.I'm also not entirely convinced it's really YAM, since 'self' is already a word in the language, all we're doing is pushing it from keyword to keyword+method, i.e. making it more easily accessible from outside the instance. Even more so since this use-case is really the only place it will ever show up; if there was another way to
to_proc
the 'self' keyword to make it easy to pass to a method like group_by I'd be for that as well/instead.
Another option based on matz's objection: #reference. We want a method that returns the reference to the object we're calling against. #reference seems logical.
[1,2,3,4,5,1,2,2,3].group_by(&:reference)
Variations on this might be #self_reference
, #self_ref
, #selfref
. Also #self_object
, #self_obj
, #selfobj
.
#self
and #identity
are probably the most likely to be guessed by a new user.
Updated by headius (Charles Nutter) over 11 years ago
Another argument why "tap
" is fine...
If tap
were defined in a functional style, it would be simply
def tap(obj, &block)
block.call(obj)
obj
end
Anywhere you can pass a function you should be able to pass a no-op function, so tap
could be defined as
def tap(obj, &block)
block = proc{} unless block
block.call(obj)
obj
end
So defining tap
such that it defaults to a no-op function (i.e. does not yield if block not given) seems perfectly valid to me.
Updated by phluid61 (Matthew Kerwin) over 11 years ago
headius (Charles Nutter) wrote:
Another option based on matz's objection:
#reference
. We want a method that returns the reference to the object we're calling against.#reference
seems logical.[1,2,3,4,5,1,2,2,3].group_by(&:reference)
+1. It's sensible (i.e. anyone who knows OOP knows what 'reference' means), there's no overloading of names, and the intention is clear.
So defining tap such that it defaults to a no-op function (i.e. does not yield if block not given) seems perfectly valid to me.
I know I'm throwing a lot of paint at this shed, but while I agree that a default noop #tap
is valid, I still strongly believe it makes ary.group_by(&:tap)
seem like voodoo. I like #reference
a lot.
Updated by Anonymous over 11 years ago
I think out of all the options proposed, 'identity
' is the most readable/quickly understandable.
For example, I think the use of 'identity
' reads very nicely in [1,2,3,4].group_by(&:identity)
Updated by phluid61 (Matthew Kerwin) over 11 years ago
charliesome (Charlie Somerville) wrote:
I think out of all the options proposed, 'identity' is the most readable/quickly understandable.
For example, I think the use of '
identity
' reads very nicely in[1,2,3,4].group_by(&:identity)
Except that #identity
seems to imply the same thing as #__id__
, and "a".__id__
is not necessarily == "a".__id__
, as Matz said earlier.
The advantage of using #reference
is that there's no existing method or concept we're overloading; there are no such things as "reference" objects (or "pointers") in Ruby -- it is understood that all references are automagically dereferenced when operated on -- so returning the references and then comparing them should be understood by most rubyists as effectively the same as comparing the objects directly (whatever "directly" means).
tl;dr:
Defining #identity
that conflicts with #__id__
is confusing, however slightly it might be.
Updated by headius (Charles Nutter) over 11 years ago
phluid61 (Matthew Kerwin) wrote:
charliesome (Charlie Somerville) wrote:
I think out of all the options proposed, '
identity
' is the most readable/quickly understandable.For example, I think the use of '
identity
' reads very nicely in[1,2,3,4].group_by(&:identity)
Except that
#identity
seems to imply the same thing as#__id__
, and"a".__id__
is not necessarily =="a".__id__
, as Matz said earlier.
I have this concern as well. Having "a".__id__
/object_id
and "a".identity
return drastically different things feels like it will just be confusing.
Updated by vjoel (Joel VanderWerf) over 11 years ago
Another argument against #identity
: it is used by several libraries for something completely different. For example, in narray:
>> NMatrix.float(2,2).identity
=> NMatrixfloat2,2:
[ [ 1.0, 0.0 ],
[ 0.0, 1.0 ] ]
It's also used in celluloid-zmq:
s1 = PubSocket.new
s1.identity = "publisher-A"
I vote for "itself
" or "self
", which are unlikely to be defined anywhere with some fundamentally different meaning.
Updated by marcandre (Marc-Andre Lafortune) over 11 years ago
Slide attached.
I hope to win the prize for simplest slide too.
Updated by Anonymous over 11 years ago
marcandre: I think you made a mistake in your slide. It says "Returns the class of obj", but it should say "Returns obj"
Updated by matz (Yukihiro Matsumoto) over 11 years ago
I can accept #itself
. I want to see it isn't conflict with existing methods.
Matz.
Updated by avit (Andrew Vit) almost 11 years ago
Rails ActiveSupport includes a similar method called presence
. There is also a request to add block support to it, for a similar purpose: https://github.com/rails/rails/pull/13416#issuecomment-32636227
Updated by marcandre (Marc-Andre Lafortune) almost 11 years ago
Andrew Vit wrote:
Rails ActiveSupport includes a similar method called
presence
.
Mmm, no, that's quite different. "".presence # => nil
for example.
Updated by fuadksd (Fuad Saud) almost 11 years ago
Wouldn’t such method accepting a block remove the need to have Object#tap
at all? As I understand this method is just a tap that doesn’t need a block.
--
Fuad Saud
Sent with Airmail
Updated by sawa (Tsuyoshi Sawada) almost 11 years ago
I would like to propose receiver
as the method name.
Updated by phluid61 (Matthew Kerwin) almost 11 years ago
On Jan 21, 2014 12:29 AM, "Fuad Saud" fuadksd@gmail.com wrote:
Wouldn’t such method accepting a block remove the need to have
Object#tap
at all? As I understand this method is just a tap that doesn’t need a
block.
That depends on the contract. I was under the impression that #itself
(or
whatever name) in block form would return the value of the block. e.g:
def tap
yield self if block_given?
self
end
def itself
if block_given?
yield self
else
self
end
end
Updated by fuadksd (Fuad Saud) almost 11 years ago
That is interesting behaviour for chaining; not sure if consistent though.
—
Sent from Mailbox for iPhone
Updated by avit (Andrew Vit) almost 11 years ago
Mmm, no, that's quite different.
"".presence # => nil
for example.
Yes, I was aware of the differences from ActiveSupport presence, I just wanted to point out the similar need for chaining "itself
" with a block.
The proposed "itself
" method also looks similar to "Enumerable#map
" but for a single object, effectively:
["ruby"].map {|n| n.upcase }.first
# same as:
"ruby".itself {|n| n.upcase }
But I suppose "map
" as a method name would be out of the question.
Has anyone considered "yield
" as the method name? It also seems to fit well, and would likely not conflict with anything:
class Object
def yield
if block_given?
yield self
else
self
end
end
end
# yield as a noun means "the result": an object's result (its yield) is itself
"ruby".yield #=> "ruby"
# yield as a verb means "give way to" or "produce": the object gives way to the block
"ruby".yield {|s| s.upcase } #=> "RUBY"
Although there might be confusion between yield as a keyword and self.yield as a method, I do like this symmetry:
yield self # calls block with self, returns result
self.yield(&block) # calls block with self, returns result
self.yield # implicit identity, just like: {|obj| obj }
Updated by nobu (Nobuyoshi Nakada) over 10 years ago
- Description updated (diff)
Yukihiro Matsumoto wrote:
I can accept
#itself
. I want to see it isn't conflict with existing methods.
It's hard to imagine other functionality from that name for me.
The best way to tell if it will break something should be implementing it before previews, IMHO.
Updated by duerst (Martin Dürst) over 10 years ago
- Status changed from Feedback to Open
Updated by rafaelfranca (Rafael França) over 10 years ago
We just implemented it on Active Support https://github.com/rails/rails/commit/702ad710b57bef45b081ebf42e6fa70820fdd810
I believe matz already accepted it as #itself
so we are renaming it to #itself
.
Updated by rafaelfranca (Rafael França) over 10 years ago
- File itself.diff itself.diff added
Here the implementation. This is my first patch to Ruby so I'm not sure it is correct. Let me know if I need to change something.
Updated by nobu (Nobuyoshi Nakada) over 10 years ago
- Status changed from Open to Closed
- % Done changed from 0 to 100
Applied in changeset r47028.
object.c: Object#itsef
- object.c (rb_obj_itself): new method Object#itsef. based on the
patch by Rafael França in [ruby-core:64156].
[EXPERIMENTAL] this method may be renamed due to compatibilities.
[ruby-core:44704] [Feature #6373]
Updated by laise (Alexey Chernenkov) over 9 years ago
-1
For me it would be better to accept #47 - it improves good old #tap and introduces a new #itself with block.
Copied here:
def tap
yield self if block_given?
self
end
def itself
if block_given?
yield self
else
self
end
end