Re: [PHP-DEV] [Pre-RFC Discussion] User Defined Operator Overloads (again)

Rowan Tommins [IMSoP] Tue, 17 Sep 2024 12:26:54 -0700

On 17/09/2024 18:15, Jordan LeDoux wrote:

    1. Are we over-riding *operators* or *operations*? That is, is the
    user
    saying "this is what happens when you put a + symbol between two Foo
    objects", or "this is what happens when you add two Foo objects
    together"?
If we allow developers to define arbitrary code which is executed as aresult of an operator, we will always end up allowing the first one.

I don't think that's really true. Take the behaviour of comparisons inyour previous RFC: if that RFC had been accepted, the user would havehad no way to make $a < $b and $a > $b have different behaviour, becausethe same overload would be called, with the same parameters, in both cases.

Slightly less strict is requiring groups of operators: the Haskell "num"typeclass (roughly similar to an interface) requires definitions for allof "+", "*", "abs", "signum", "fromInteger", and either unary or binary"-". It also defines the type signatures for each. If this was the onlyway to overload the "+" operator, users would have to really go out oftheir way to use it to mean something unrelated addition.

As it happens, Haskell *does* allow arbitrary operator overloads, and infact goes to the other extreme and allows entirely new operators to beinvented. The same is true in PostgreSQL - you can implement the<<//-^+^-//>> operator if you want to.

I think it's absolutely possible - and desirable - to choose aphilosophical position on that spectrum, and use it to drive designdecisions. The choice of "__add" vs "operator+" is one such decision.

The approach I plan to use for this question has a name: PolymorphicHandler Resolution. The overload that is executed will be decided bythe following series of decisions:
1. Are both of the operands objects? If not, use the overload on theone that is. (NOTE: if neither are objects, the new code will bebypassed entirely, so I do not need to handle this case)2. If they are both objects, are they both instances of the sameclass? If they are, use the overload of the one on the left.3. If they are not objects of the same class, is one of them a directdescendant of the other? If so, use the overload of the descendant.4. If neither of them are direct descendants of the other, use theoverload of the object on the left. Does it produce a type errorbecause it does not accept objects of the type in the other position?Return the error and abort instead of re-trying by using the overloadon the right.

This is option (g) in my list, with the additional "prefer sub-classes"rule (step 3), which I agree would be a good addition.

As noted, it doesn't provide symmetry, because step 4 depends on theorder in the source code. Option (c) is the same algorithm without step4, so guarantees that $a + $b and $b + $a will always call the same method.

Options (d), (e), and (f) each add an extra step: one operand can signal"I don't know" and the other operand gets a chance to answer. They'reessentially ways to "partially implement" an operator.

Options (a) and (b) perform the same kind of polymorphic resolution on*both* operands, which is how many languages work for functions and/ormethods already.

Reading the C# spec, if there is more than one candidate overload whichis equally specific, an error is raised. I guess you could do the sameeven with one implementation per class, by replacing step 4 in youralgorithm:

> 4. If neither of them are direct descendants of the other, and onlyone implements the operator, use it.> 5. If neither of them are direct descendants of the other, and bothimplement the operator, throw an error.


Let's call that option (h) :)

By the way, searching online for the phrase "Polymorphic HandlerResolution" finds no results other than you saying it is the name forthis algorithm.

This is similar to what I originally designed, and I actually moved toan enum based on feedback. The argument was something like`$isReversed` or `$left` or so on is somewhat ambiguous, while theenum makes it extremely explicit.

Ah, fair enough. Explicitness vs conciseness is always a trade-off. Mythinking was that the "reversed" form would be far more rarely calledthan the "normal" form; but that depends a lot on which resolutionalgorithm is used.



Regards,

--
Rowan Tommins
[IMSoP]

Re: [PHP-DEV] [Pre-RFC Discussion] User Defined Operator Overloads (again)

Reply via email to