Hi Stuart,

I managed to find some time to tackle this bug.

What I found from inspecting the JDK code and experimenting is that it doesn't matter how the private static final long serialVersionUID field is initialized. Whether it is a compile-time constant or its value is computed in <clinit>, the serialization/deserialization infrastructure acts the same. It is using reflection to obtain the field's value and so it can't distinguish between compile-time constant and computed value. So ypur plan to eliminate serialVersionUID value from the stream of "patched" EnumSet.class object falls apart.

I filed a bug [1] and prepared a patch [2] which just adds serialVersionUID to EnumSet and initializes it to the same value that is automatically computed in JDK 8 and before.

I had to revert the existing BogusEnumSet test to basically a version that was actual at the time of JDK 8 release, undoing the edits of byte[] value representing serialized stream that have been performed by the same patches that broke the serialization of EnumSet.class objects. I also added a test that verifies the de-serialization of a stream containing EnumSet.class object, produced on JDK 8.

Although the patched EnumSet now contains explicit serialVersionUID, serializing EnumSet.class with patched JDK 14 does not procude the same stream, because the stream also contains descriptors of EnumSet fields which changed from version in JDK 8. When deserializing EnumSet.class object, only serialVersionUID from stream is checked against local version though, so this doesn't matter.

Considering that this patch breaks serialization between already released 9,10,11,12 versions of JDK and yet to be released versions that would contain the patch, I think the patch should be backported to:
- 13
- 11u (11.0.4?)
(and possibly to other 9+ update versions yet to be released).

This would provide consistent serialization among latest releases of actual versions including JDK 8 and before.

Do you think it is necessary to provide a fall-back system property or just advise people to upgrade to latest update release in case they experience trouble? Upgrading to latest update releases is pretty safe these days and should not be a problem for anyone.

Regards,

Peter

[1] https://bugs.openjdk.java.net/browse/JDK-8227368
[2] http://cr.openjdk.java.net/~plevart/jdk-dev/8227368_EnumSet.class_serialization/


On 6/29/19 2:00 AM, Stuart Marks wrote:
Daniel Fuchs pointed me to a weird thing they had to do with the serialVersionUID in one of the JMX classes:

https://hg.openjdk.java.net/jdk/jdk/file/c59f36ed7b52/src/java.management/share/classes/javax/management/MBeanAttributeInfo.java#l46

Essentially the svuid for this class initialized in a static block, and its value is conditional based on the value of some system property. I don't think using a property is necessary for the EnumSet case. However, it does point out something interesting, which is that if the svuid is not initialized with a compile-time constant, and instead via a static block, the value doesn't appear in serialized-form.html.

Thus, we can backport a change to JDK 11 that changes EnumSet's svuid, without changing the Java SE 11 specification! (This is analogous to changing a method's implementation to behave the way we want to, without changing the method's specification to specify that it behaves that way. On the other hand, this is a really sleazy hack.)

Here's an outline of what we can do:

1) Add EnumSet.serialVersionUID to JDK 13 in the usual way, specifying a constant that's the same as the JDK 8 value. We are after RDP1 but I think this change is well-justified. This change should automatically be propagated to JDK 14.

2) "Backport" the fix to JDK 11, but assign the value in a static initializer block instead of as a constant in a field initializer. Something like this:


diff -r 27d4b8acbe07 src/java.base/share/classes/java/util/EnumSet.java
--- a/src/java.base/share/classes/java/util/EnumSet.java    Thu Jun 13 17:46:57 2019 -0700 +++ b/src/java.base/share/classes/java/util/EnumSet.java    Fri Jun 28 16:42:03 2019 -0700
@@ -76,11 +76,18 @@
  * @since 1.5
  * @see EnumMap
  */
-@SuppressWarnings("serial") // No serialVersionUID due to usage of
-                            // serial proxy pattern
+@SuppressWarnings("serial") // serialVersionUID is not a compile-time constant
 public abstract class EnumSet<E extends Enum<E>> extends AbstractSet<E>
     implements Cloneable, java.io.Serializable
 {
+    // Initialize the serialVersionUID to be compatible with JDK 8.
+    // Do this from a static block to avoid having the value appear
+    // in serialized-form.html. See JDK-xxxxxxx.
+    private static final long serialVersionUID;
+    static {
+        serialVersionUID = 1009687484059888093L;
+    }
+
     /**
      * The class of all the elements of this set.
      */

3) Backport the fix to JDK 12. I'm not sure this is absolutely necessary, since JDK 12 likely has a short lifetime, but if we're doing 11 and 13 it makes sense to do 12 as well (though mainly for completeness).

4) It's unclear whether a similar patch as above needs to be added to JDK 8. Since it already has the right svuid, we could get away without doing anything. However, with backports continuing in the 8u release family, it might be prudent to apply this patch in order to prevent future backports to 8u from inadvertently changing the svuid -- which as you point out did happen in 9 and 10.

5) I don't think we need to patch JDK 1.6, 7, 9, or 10 but the maintainers of those releases can certainly decide to do so.

=====

In any case, I think doing the above will result in consistent EnumSet.serialVersionUID values for JDK 8 LTS, JDK 11 LTS, and current JDK releases, without having to worry about spec changes for any of the past releases.

Let me know how you'd like to proceed. I'm happy to help out with reviewing, filing bugs, CSRs, etc.

s'marks






On 6/27/19 2:57 PM, Stuart Marks wrote:
Arrrrrggggh. Yet another serialization bug.

But yes, this is a bug. Thanks for finding and diagnosing it.

Like Martin, I've often forgotten that classes themselves can be included in a serial stream, as well as instances of those classes. In fact I seem to recall arguing that because EnumSet uses the serialization proxy pattern, instances of it should never appear in a legitimate serial stream. I think that's true. However, I sent on to say that because of this, there is no issue with serialization compatibility, and thus EnumSet didn't need a serialVersionUID. That's incorrect.

I'm uncomfortable with relaxing the serialization spec and mechanism to allow a class in the serial stream to have a different svuid from the one loaded in the running JVM. Offhand I don't know what problems it could cause, but it seems like a fundamental change that would lead to problems at some point.

Also, this is a problem with one class (so far...) and I don't think we should change the whole serialization mechanism to support it.

I'm thus leaning toward your first suggestion of adding a serialVersionUID declaration to EnumSet that matches the value from JDK 8. This would go into the current repo (JDK 14) and likely be backported to JDK 13.

It seems like a no-brainer to backport this to JDK 11 as well; this would provide broad compatibility across JDK 8 LTS, JDK 11 LTS, and current JDK releases. However, changing the svuid is a specification change. More investigation is necessary to figure out what would be involved in doing this.

Meanwhile, it would seem sensible to file a bug and start on a fix for the current release(s). Would you be able to do that?

Again, thanks for finding this.

s'marks

On 6/18/19 7:32 AM, Peter Levart wrote:


On 6/18/19 4:00 PM, Martin Buchholz wrote:
Java Historian says:
I was a reviewer for Effective Java 3rd Edition and EnumSet is the canonical example of the Serialization Proxy pattern, so I tried to make sure the pattern was implemented as perfectly as possible.
8192935: Fix EnumSet's SerializationProxy javadoc
All of us who try to make java serialization work right have a mental model of the many things that might go wrong. Serialization of Class objects has never been part of my own mental model - I've only ever considered instances.

Perhaps the necessity for Class objects representing Serializable classes to agree in sertialVersionUID is a bug in Java serialization implementation? There's no such requirement for Class objects representing non-Serializable classes and I don't see why this requirement is there for Serializable classes. Could this requirement simply be relaxed with no ill consequences?

Regards, Peter



On Tue, Jun 18, 2019 at 5:32 AM Peter Levart <peter.lev...@gmail.com <mailto:peter.lev...@gmail.com>> wrote:

    Hi,

    I recently stumbled on an exception thrown when deserializing stream
    produced on JDK 8 and read with JDK 11. I narrowed the problem
    down to
    serialization/deserialization of a public EnumSet.class object. There     were several changes made to EnumSet in the Mercurial history of jdk
    repo, but I think the following two broke the serialization:

    http://hg.openjdk.java.net/jdk/jdk/rev/d0e8542ef650
    http://hg.openjdk.java.net/jdk/jdk/rev/a7e13065a7a0

    It is interesting to note that before those two changes were made,
    there
    was a chance to fix the problem reported by newly added serial lint
    warnings. Unfortunately they were just silenced:

    http://hg.openjdk.java.net/jdk/jdk/rev/501d8479f798

    +@SuppressWarnings("serial") // No serialVersionUID due to usage of
    +                            // serial proxy pattern

    It is true that serialization of instances of Serializable classes is
    not broken by changes to them when they implement serial proxy
    pattern
    (i.e. writeReplace() method) even if they don't itself declare a
    private
    static final long serialVersionUID field, but this is not true of
    Class
    objects representing those Serializable classes. It is even more
    controversial that serialization of Class objects representing
    non-Serializable classes is never broken (which is understandable as
    they don't have a habit of declaring serialVersionUID fields).

    Both of the above braking changes were made post JDK 8 release, so
    deserialization of JDK 8 (and older) streams is affected in all
    JDK 9 +
    releases or vice versa.

    So, what shall be done. I suggest adding serialVersionUID field to
    EnumSet vith a value that corresponds to JDK 8 serialization
    format and
    later backport this change to JDK 11.

    What do you think?


    Regards, Peter


    PS: ImmutableCollections nested classes also implement serial proxy     pattern and don't declare serialVersionUID fields, but they are not
    public, so it is less chance that Class objects representing them
    could
    be used in serial streams, although it is not impossible. For example:

    objectOutputStream.writeObject(Set.of().getClass());



Reply via email to