Re: Serialzation PREVIOUSLY: RFR: 8229773: Resolve permissions for code source URLs lazily

Peter Firmstone Mon, 19 Aug 2019 21:52:51 -0700

Thanks Sean,

No I hadn't seen it, I've just read it, will probably need to read itagain to appreciate it fully...

It certainly identifies all the issues I'm aware of, as well as beingrespectful of the original implementors (many of whom participated inApache River when Jini was donated to Apache), I came to the sameconclusion with circular object graphs; the benefits don't outweigh thecost.

We also use annotations instead of interfaces,to annotate the class andconstructor, so that overriding classes don't automagically inherit thefunctionality.

At this time, we haven't reimplemented deconstruction, we are usingObjectOutputStream with serializers, which are basically serializationproxy's for existing classes, we have fully reimplementeddeserialization using constructors.

Agree with serial from being independant of the wire protocol, so anyserialization scheme can be used, this is an excellent idea of course.

The constructors / deconstructors have identified that serial form isreally just a parameter list. Developers will want to make defensivecopies of mutable state, just like public api methods.

We did consider constructors with multiple parameters, but decidedagainst it for the following reasons:


  1. We didn't care about parameter order (tuples), or the order in
     which they were serialized / deserialized, we only cared about
     parameter names and types.
  2. For encapsulation we didn't want subclasses having to manage the
     serial form of superclasses, we wanted them to remain as
     independant as possible, so they don't inadvertantly break.
         * For example, a library superclass adds a serial form
           parameter, or changes a type, in its serial form.   The
           child class would have to be aware of the changes in order
           to pass the correct parameters to the correct superclass
           constructor.
         * Different serial version constructors would result in the
           loss of later version superclass state when child classes
           call an earlier version.
  3. We settled on a caller sensitive parameter that is passed to the
     deserialization constructor.
         * Encapsulation: Each class in an inheritance heirarch only
           has access to it's own serial form.
         * The serial form of each class is independant and may evolve
           independantly.
         * Each class in the inheritance heirarchy is responsible for
           checking it's own invariants, including the ability to
           create superclass instances, even if a superclass is
           abstract for checking inter class invariants.
  4. It was less work for the framework to populate a standard
     parameter object, with serial form, the framework didn't need to
     worry about inspecting the constructor signature and determining
     the parameter order.
  5. One constructor could be used for different versions.
  6. We currently use |serialPersistentFields to declare serial form,
     but there is probably a better way of doing this, perhaps a way
     that also documents different serial form versions.|

Regards,

Peter.

On 20/08/2019 7:55 AM, Sean Mullan wrote:

Brian Goetz (copied) has done a lot of thinking in the serializationarea, so I have copied him. Not sure if you have seen it but herecently posted a document about some of his ideas and possible futuredirections for serialization:http://cr.openjdk.java.net/~briangoetz/amber/serialization.html
--Sean

On 8/17/19 10:22 PM, Peter Firmstone wrote:
Thanks Sean,
You've gone to some trouble to answer my question, which demonstratesyou have considered it.
I donate some time to help maintain Apache River, derived from Sun'sJini. Once Jini depended on RMI, today, not so much, it still hassome dependencies on some RMI interfaces, but doesn't utilise JRMPalthough it provides some backward compatibilty enable it.
But my point is, we heavily utilise java Serialization, and have anindependant implementation of a subset of Java Serialization(originating from Apache Harmony). We do this for security as we usean annotated serialization constructor. Serial form is unchanged,we have Serializers for commonly used java library objects, forexample, we have a "PermissionSerializer", but we don't have a"PermissionCollectionSerializer" or "PermissionsSerializer" (forjava.security.Permissions). Incidentally, we have found we do notneed the ability to serialize circular object graphs. Throwable isan object that has a circular object graph, but that circular objectgraph can be linked up after deserialization.
Permission implementing Serializable is probably not too much of athreat, as these objects are effectively immutable after lazyinitialization.
ProtectionDomain calls java.security.Permissions::setReadOnly duringit's construction.
ProtectionDomain::getPermissions returns internaljava.security.Permissions. If this is serialized, then the readOnlyinternal state can be written to as the internal object referencesare accessible from within the stream.
Admitedly, the attacker would already need to have some privilege, tohave access to a ProtectionDomain, so it's a path of privilegeescallation. I'm not talking about gadget attacks anddeserialization of untrusted data, I'm talking about breakingencapsulation.
Even though we are heavily dependant on Java Serialization, we arevery careful when we implement it, and avoid implementing it whenpossible. Hindsight is 20:20, but given we are now seeing some JavaSE backward compatibility breakages, perhaps it might be worthconsidering breaking serialization. I don't mean we need tonecessarily break object serial form, but making the Javaserialization API explicit with subset of existing api features, thatmakes long term maintenace and security less of a burden and removingsupport for Serialization of some objects, where it is seldom used,perhaps using a JEP that requests developers to consider whichlibrary objects actually need to be serializable.
Something we do in our Java Serialization API is require that mutabledeserialized objects are defensively copied during objectconstruction (serial fields are deserialized before an object isconstructed, the deserialized fields are accessible via a parameterpassed in during construction. We have tools that assist developersto check deserialized Java Collections contain the expected objecttypes for example, so during object construction the developer has toreplace the Collection with a new instance and copy the contents tothe new Collection after checking the type of each object containedtherein. Also we don't actually serialize Java Collections, we havestandard serial forms for List, Set and Map, so these serial formsare equal, similar to the List, Set and Map contracts. By doingthis, Collections don't actually need to implement Serializable atall, as a Serializer becomes responsible for their serialization.This also means that all Collections must be accessed by interfaces,rather than implementation classes, so the deserializationconstructor, must defensively copy them into their preferredCollection instance. It's a bit like dependency injection.
I know it would take time, and there would be some pain, but longterm it would save a lot of maintenance developer time.
Regards,

Peter.

On 17/08/2019 12:50 AM, Sean Mullan wrote:
On 8/15/19 8:18 PM, Peter Firmstone wrote:
Hi Roger,

+1 for writeReplace
Personally I'd like to see some security classes break backwardcompatibility and remove support for serialization as it allowssomeone to get references to internal objects, especially sincethese classes are cached by the JVM. Which makesPermissionCollection.setReadOnly() very easy to bypass, by addingpermissions to internal collections once you have a reference to them.
Does anyone have any use cases for serializing these objects?
These objects are easy to re-create by sending or recieving andparsing strings, because they are built from text based policyfiles, and when you do that, you are validating input, so I neverdid fully understand why they were made serializable.
This is briefly explained on page 61 in the "Inside Java 2 PlatformSecurity" book [1]:
"The Permission class implements two interfaces: java.security.Guardand java.io.Serializable. For the latter, the intention is thatPermission objects may be transported to remote machines, such asvia Remote Method Invocation (RMI), and thus a Serializablerepresentation is useful."
The Permission class was introduced in Java SE 1.2 so there weredifferent motivations back then :)
--Sean

[1] https://www.oracle.com/technetwork/java/javaee/index-141918.html

Re: Serialzation PREVIOUSLY: RFR: 8229773: Resolve permissions for code source URLs lazily

Reply via email to