Re: ===, =:=, ~~, eq and == revisited (blame ajs!)

Darren Duncan Thu, 13 Jul 2006 00:56:03 -0700

At 7:25 PM +0300 7/12/06, Yuval Kogman wrote:

Over at #perl6 we had a short discussion on =:=, ===, and ~~, mostly raised by
ajs's discussion on Str items and ===.

<snip>

Coincidentally, I raised almost the same questions there a weekearlier, and had a brief discussion with audreyt about it, though theanswers that came out of it seemed rather different than what was inthis thread so far, so I will share them. See the following url:


http://colabti.de/irclogger/irclogger_log/perl6?date=2006-07-06,Thu&sel=376#l599

I will also quote the text as it was short, snipping out unrelated parts:

[ 11:29pm ] dduncan : slight change of topic, but I was wondering how.id works with non-trivial types

[ 11:29pm ] dduncan : eg, what does the .id of a Pair look like?

[ 11:29pm ] dduncan : I know that to users it shouldn't matter, butto people implementing composite types, it does

[ 11:30pm ] audreyt : dduncan: one possibility - could be just itself.

[ 11:33pm ] dduncan : one key thing I'm wondering about .id forimmutable types is ... are they supposed to generate some neutralvalue like an integer, two of which can then be comparedindependently of the type definition, or will they contain referencesto the actual object all the time and that the object's class stillneeds to declare a === method which is invoked as needed?[ 11:33pm ] dduncan : if it is the latter, I imagine thatimplementation will be simpler, at a possible cost of performance ifthe same comparison is done a lot

[ 11:34pm ] audreyt : dduncan: the latter
[ 11:34pm ] dduncan : okay, that answers my question

So, in the general case, it would seem best if the binary operator=== was just an ordinary method that each class provides, rather thanrequiring classes to defined a .id. Or in addition to this to helpwith performance, a .id can exist anyway that optionally returns anappropriate hash of an object.

A default === would be defined in Object, which returns the sameresult as =:= returns; two objects are equivalent iff they are thesame container. A default .id defined in Object would simply returnthe same object it was invoked on.

Built-in immutable types, like Str and Int and Pair and Seq, wouldoverride that === such that they return true iff the two operands arecontainers of the same class and the two containers both holdappearances of the same (universally distinct) value. This isdetermined by doing a deep comparison of the values themselves, as isappropriate. (Internally to the type's implementation, adomain-appropriate hash of the value could optionally be generated atan appropriate time and be used to speed up === operations, withappropriate action taken if it isn't guaranteed that multipledistinct values won't become identical hash values.) The .id couldbe overridden to return a simple number or string or binary forsimpler types, and return the object itself otherwise.

Built-in mutable types, like Array or Hash, would not override theObject-defined ===, which is equivalent to =:=, nor the built-in .id,which returns the object itself. This is reasonable in practicebecause the contents of those containers could be changed at anytime, especially if the containers are aliased to multiple variablesthat are outside of the testing code's control. The only thing thatcan be guaranteed to be constant over time is that whether or not anobject is itself, as determined by =:=. By contrast, if === were todo a deep copy with mutable types, the results could not be trustedto be repeatable because the moment after === returns, thecontainer's value may have changed again, so actions done based onthe === return value would be invalid if they assumed the value tostill be the same at that time, such as if the mutable type was usedas a hash key and was to be retrievable by its value.

User defined types can choose on their own whether to override ===and/or .id or not, and they would use their own knowledge of theirinternal structures to do an appropriate deep comparison. There isno need to try to generate some kind of unique numerical .id forarbitrarily complex objects.

One thing that can't be overridden is that === can only return trueiff both operands are of the same class. This includes undef, aseach class has its own undef that is distinct from those of otherclasses.

So if this is the way that things worked, then it would be very easyto implement it for any kind of type. And it would be very reliableto use any type as a hash key.

Note that, while the fact may be determinable by some other means, itmay be useful to have an explicit meta-method for all types that sayswhether the type is immutable or mutable. A user defined type sayingthat it is immutable is making a promise to the compiler that itsobjects won't change after they are created.

As for being able to tersely do deep comparisons of mutable types, Idon't think that === is appropriate and that something else should beused instead, something that isn't invoked when working with hashkeys.


I may have forgotten to raise something else, but there's that for now.

-- Darren Duncan

Re: ===, =:=, ~~, eq and == revisited (blame ajs!)

Reply via email to