[records] equals / hashCode (was: Records -- current status)

Brian Goetz Fri, 13 Apr 2018 10:17:55 -0700

Along the lines of the previous mail, people have and will ask "whycan't I redefine equals/hashCode". And the answer has two layers:

- The constraints on equals/hashCode are stronger for records, andusers might inadvertently violate them. (They can be specified in theoverrides of equals/hashCode in AbstractRecord, so there at least can bea place where this specification lives, even if no one reads it.) - In conjunction with ancillary fields, the constraints are sure to beviolated, whether inadvertently and deliberately.

Let's take a look at what sorts of modifications to equals/hashCodewould be OK, should we decide to relax this restriction. Equalityshould still derive from the record's state, but there might beacceptable variations.

Would it be OK to _widen_ the definition of equality, by ignoring acomponent of the record?

This is an example of what Gunnar asked for, which is to restrictequality to the primary key fields:


    record PersonEntity(int primaryKey, String name, int age) {
        // equality based only on primaryKey
    }

Is this OK?  Well, let's look at our model:
 - Does ctor(dtor(c)) == c?  Yes.
 - if S1==S2, does ctor(S1) == ctor(S2)?  Yes.

- For equal instances, does mutating them in the same way yield equalinstances? Yes. - For equal instances, does calling the same method on both with thesame parameters yield equivalent results? No.

So, if p1 == p2, we cannot rely on p1.age() == p2.age(), so this failsthe requirements of our pseudo-formal model. (Assuming our model is theright one.)

So, how would we feel about that? Two records that are equals() to eachother, but not substitable?

A more subtle version of this would be to consider all components, butuse a more inclusive notion of equality for that field, such ascomparing array components by contents.


    record Numbers(int[] numbers) {
        // equality based on Arrays.equals()
    }

 - Does ctor(dtor(c)) == c?  Yes.
 - Do equal state vectors produce equal records?  Yes.
 - Do identical mutations on equal records produce equal records? Yes.

- Does identical operations on equal records produce equal results? Almost...


The Almost qualification can be seen here:
    int[] a1;
    int[] a2 = copyOf(a1);
    Numbers r1 = new Numbers(a1), r2 = new Numbers(a2);
    boolean same = a1.numbers().equals(a2.numbers())

The accessor will yield up the array references, which will not beequals() to each other. This is essentially the same problem as above.

You get a similar result if your record represents something like arational number and you don't normalize to lowest terms in theconstructor; then you can have q1 equal q2, but q1.numerator() !=q1.numerator().

Are any of these variations compelling enough to suggest we've got thewrong model?






On 3/16/2018 2:55 PM, Brian Goetz wrote:

There are a number of potentially open details on the design forrecords. My inclination is to start with the simplest thing thatpreserves the flexibility and expectations we want, and consideropening up later as necessary.
One of the biggest issues, which Kevin raised as a must-address issue,is having sufficient support for precondition validation. Withoutforeclosing on the ability to do more later with declarative guards, Ithink the recent construction proposal meets the requirement forlightweight enforcement with minimal or no duplication. I'm hopefulthat this bit is "there".
Our goal all along has been to define records as being “just macros”for a finer-grained set of features. Some of these are motivated byboilerplate; some are motivated by semantics (coupling semantics ofAPI elements to state.) In general, records will get there first, andthen ordinary classes will get the more general feature, but thedefault answer for "can you relax records, so I can use it in thiscase that almost but doesn't quite fit" should be "no, but there willprobably be a feature coming that makes that class simpler, wait forthat."
Some other open issues (please see my writeup athttp://cr.openjdk.java.net/~briangoetz/amber/datum.html forreference), and my current thoughts on these, are outlined below.Comments welcome!
- Extension. The proposal outlines a notion of abstract record,which provides a "width subtyped" hierarchy. Some have questionedwhether this carries its weight, especially given how Scala doesn'tsupport case-to-case extension (some see this as a bug, others as anexistence proof.) Records can implement interfaces.
- Concrete records are final. Relaxing this adds complexity to theequality story; I'm not seeing good reasons to do so.
- Additional constructors. I don't see any reason why additionalconstructors are problematic, especially if they are constrained todelegate to the default constructor (which in turn is made far simplerif there can be statements ahead of the this() call.) Users may findthe lack of additional constructors to be an arbitrary limitation (andthey'd probably be right.)
 - Static fields.  Static fields seem harmless.
- Additional instance fields. These are a much bigger concern. Whilethe primary arguments against them are of the "slippery slope"variety, I still have deep misgivings about supporting unrestrictednon-principal instance fields, and I also haven't found a reasonableset of restrictions that makes this less risky. I'd like to keeplooking for a better story here, before just caving on this, as Iworry doing so will end up biting us in the back.
- Mutability and accessibility. I'd like to propose an odd choicehere, which is: fields are final and package (protected for abstractrecords) by default, but finality can be explicitly opted out of(non-final) and accessibility can be explicitly widened (public).
- Accessors. Perhaps the most controversial aspect is that recordsare inherently transparent to read; if something wants to trulyencapsulate state, it's not a record. Records will eventually havepattern deconstructors, which will expose their state, so we should goout of the gate with the equivalent. The obvious choice is to exposeread accessors automatically. (These will not be named getXxx; we arenot burning the ill-advised Javabean naming conventions into thelanguage, no matter how much people think it already is.) The obviousnaming choice for these accessors is fieldName(). No provision forwrite accessors; that's bring-your-own.
- Core methods. Records will get equals, hashCode, and toString. There's a good argument for making equals/hashCode final (so theycan't be explicitly redeclared); this gives us stronger preservationof the data invariants that allow us to safely and mechanicallysnapshot / serialize / marshal (we'd definitely want this if we everallowed additional instance fields.) No reason to suppress overrideof toString, though. Records could be safely made cloneable() withautomatic support too (like arrays), but not clear if this is worth it(its darn useful for arrays, though.) I think the auto-generatedgetters should be final too; this leaves arrays as second-classcomponents, but I am not sure that bothers me.

[records] equals / hashCode (was: Records -- current status)

Reply via email to