Re: RFR JDK-8164278: java.util.Base64.EncOutputStream/DecInputStream is slower than corresponding version in javax.mail package

Roger Riggs Tue, 06 Feb 2018 08:35:50 -0800

Hi Sherman,

On 2/5/2018 9:00 PM, Xueming Shen wrote:

Hi,


Please help review the change for  JDK-8164278.

issue: https://bugs.openjdk.java.net/browse/JDK-8164278
webrev: http://cr.openjdk.java.net/~sherman/8164278/webrev

Are the reentrant locks necessary? concurrent reads from streams arenot usually

synchronized so its the caller that need to synchronize.
If locks are necessary, why no lock for the EncOutputStream buffer?

809: Can the buffer byte array be sized based on linemax? The fielddeclaration should

  be with the other fields at the top of the file.

848: checkNewline compares == with linemax; that works when each byte iscounted separately

 It seems like it would safer if it was ">=".

957: can sbBuf be shared with lbuf?

More on the input buffering question below.

jmh.src: http://cr.openjdk.java.net/~sherman/8164278/Base64BM.java
jmh.result: http://cr.openjdk.java.net/~sherman/8164278/base64.bm

Base64.Decoder.decode0:
Adopted the "similar" optimization approach we took inBase64.Encoder.encode0() to add a "fast path" to decode a block of 4-byte units together(current implementation decodes one single byte per while/loop. The jmh benchmark resultindicates a big speed boost (those decodeArray/Mime/Url results, from 30% to 2 timesfaster, depends on
    input size).

:)

Base64.Encoder.encode0()
It appears encode0() was fully optimized in 1.8. Can't get itfaster :-) Tried to use Unsafe.getLong/putLong instead of byte by byte access. But itappears the 8-byte "vectorization" does not bring us enough speed up, the performanceis the same as the
    current one. See encode00() at
    http://cr.openjdk.java.net/~sherman/8164278/webrev.00

Base64.Encoder.wrap(OutputStream)/EncOutputStream.write():
If my memory serves me right, the current implementation was underthe assumption that the underlying output stream probably is buffered (if invokercares). It would be a redundant if EncOutputStream buffers bytes again. It appears this is a wrongassumption. It is much slower to write 4 bytes separately, compared to bundle themtogether in a byte[4] and write into underlying, even the underlying output stream is aByteArrayOutputStream. Again, the proposed change is to add a fast path loop, as we do inencode0(), to decode/ write a block of 3-byte->4-byte unites. It appears this fast loopcan help the compiler to
    optimize away some boundary checks, therefor is much faster.
The jmh result Base64BM.encodeOS suggests the new implementationis almost 4 times faster
    and is almost the same as java.mail's stream encoder.

Base64.Decoder.wrap(InputStream)/DecInputStream.read():
Same as the approach we take for decode0(), to add a fast pathdecode block of 4-byte unites
    together.
The jmh result Base64BM.decodeOS (the name probably should bedecodeIS, for InputStream, but anyway...) shows the proposed one is 4 times faster than theexisting impl and double
    the  java.mail (Base64BM.decodeOS_javamail) implementation.
However, there is a side-effect of adding a buffering mechanisminto DecInputStream. The current implementation read bytes from the underlying stream oneby one, it never reads more bytes than it needs, which means it should/is supposed tojust stop at the last byte
    that it needs to decode, when there is "=" present in the stream.
With buffering, it's possible more bytes (after "=", whichindicates "end of base64 stream") might be read/consumed in and buffered. A concern? if this is indeed aconcern, the only alternative might be to add a separate method to support this"faster-buffered-decoder"?

How much buffering is needed to speed it up? Can the mark/resetfunctions of the underlying

stream be used to backup the stream if it overshoots?
If mark and reset are not supported then read 1 byte at a time.

Regards, Roger


Thanks,
Sherman

Re: RFR JDK-8164278: java.util.Base64.EncOutputStream/DecInputStream is slower than corresponding version in javax.mail package

Reply via email to