Re: RFR 9: JEP 290: Filter Incoming Serialization Data

Peter Levart Wed, 20 Jul 2016 01:21:12 -0700

Hi Roger,

On first reading, I have the following thoughts:

- The name "ObjectInputFilter" makes me think that it is a function that"filters" the input stream (like a Predicate inStream::filter(Predicate)), but it is in fact a validator thatterminates deserialization on 1st rejection. So perhaps a different nameis in order - ObjectInputValidator ?

- I haven't found in the public javadocs, an explanation of what happenswhen the filter returns ALLOWED, REJECTED or UNDECIDED. Docs just saythat the deserialization is terminated (on UNDECIDED too?) but not withwhat exception (there is some explanation on OIS::filterCheck, but thisis a private method).

- The crux of behavioral docs is on the OIS::setObjectInputFiltermethod. I would expect it to be on the ObjectInputFilter class, but Iunderstand that OIS subclasses might have a different behavior. How dothey behave indeed? For example IIOPInputStream does not use the filter,right?

- I had some trouble to precisely understand the behavior from the docsalone. The following in OIS::setObjectInputFilter:


1174      * @implSpec

1175 * The filter, when {@code non-null}, is invoked during{@linkplain #readObject()}1176 * for each object (regular or class) in the stream includingthe following:

1177      * <ul>

1178 * <li>each object reference previously deserialized fromthe stream,

1179      *     <li>each regular class,

1180 * <li>each interface of a dynamic proxy and the dynamicproxy class itself,1181 * <li>each array is filtered using the array size and thetype of the array,1182 * <li>each object replaced by its class' {@codereadResolve} method1183 * is filtered using the replacement object's class andif it is an array, the length,1184 * <li>and each object replaced by {@linkplain#resolveObject resolveObject}1185 * is filtered using the replacement object's class andif it is an array, the length.

1186      * </ul>
1187      *

1188 * When the {@link ObjectInputFilter#checkInput checkInput}method is invoked

1189      * it is passed the current class, (null if no class),

...does not specify when the passed-in class might be "null". Readingthe implementation, I see it is null when a back reference to previouslydeserialized object is read from stream, but javadocs are not clearabout that.

- I wonder if invoking the filter for each interface of a dynamic proxyis necessary (other properties passed to the filter don't change duringiteration through the interfaces and each interface call-back is not anindicator that an object is about to be read-in next). This is notuniform with other objects where the filter is invoked only once. Why isa dynamic proxy so special? If one wants to check the proxy interfacesin the filter, she can obtain them manually:


if (Proxy.isProxyClass(clazz)) {
    for (Class<?> intf : class.getInterfaces()) {
        ...
    }
}

- The docs might be more clear about when precisely the filter isinvoked (i.e. after the type of the object and possible length of arrayor the back reference has already been read from the stream, but theobject state has not been read yet). This is important to correctlyinterpret the streamBytes parameter. The docs might also be more clearabout when the nRefs is incremented (it says: "for each call". Is itbefore or after the call?).

- What is the purpose of the UNDECIDED return? I suspect it is meant tobe used in some filter implementation that delegates the validation tosome "parent" filter and respects its decision unless it is UNDECIDED inwhich case it decides (or does not) on its own. Should such strategy bementioned in the docs to encourage inter-operable filter implementations?

- The call-back is invoked after the type of the object and possiblearray length is read from stream but before the object's state is read.Suppose that the object that is about to be read is eitherExternalizable object or an object with a readObject() method(s) thatconsume block data from the stream. This block data can be large. Shouldthere be a call-back to "announce" the block data too? (for example,when the 'clazz' is null and the 'size' is 0, the call-back reports aback-reference to a previously read object, but when the 'clazz' is nulland the 'size' > 0, it announces the 'size' bytes of block data. Doesthis make sense?)


That's it for the start. If I notice something else, I'll post again.

Regards, Peter


On 07/19/2016 04:02 PM, Roger Riggs wrote:

Please review the design, implementation, and tests of JEP 290: FilterIncoming Serialization Data[1]
It allows incoming streams of object-serialization data to be filteredin order to improve both security and robustness.
The JEP[1] has more detail on the background and scope.
The core mechanism is a filter interface implemented by serializationclients and set on an |ObjectInputStream|. The filter is called duringthe deserialization process to validate the classes beingdeserialized, the sizes of arrays being created, and metricsdescribing stream length, stream depth, and number of references asthe stream is being decoded.
A process-wide filter can be configured that is applied to everyObjectInputStream.The API of ObjectInputStream can be used to set a custom filter tosupersede or augment the process-wide filter.
Webrev:
http://cr.openjdk.java.net/~rriggs/webrev-serial-filter-jdk9-8155760/

SpecDiff:
http://cr.openjdk.java.net/~rriggs/filter-diffs/overview-summary.html

Javadoc (subset)
http://cr.openjdk.java.net/~rriggs/filter-javadoc/java/io/ObjectInputStream.htmlhttp://cr.openjdk.java.net/~rriggs/filter-javadoc/java/io/ObjectInputFilter.html
Comments appreciated, Roger

[1] JEP 290:   https://bugs.openjdk.java.net/browse/JDK-8154961

Re: RFR 9: JEP 290: Filter Incoming Serialization Data

Reply via email to