Re: RFR JDK-6321472: Add CRC-32C API

Staffan Friberg Fri, 17 Oct 2014 09:48:34 -0700

On 10/17/2014 01:46 AM, Peter Levart wrote:

On 10/17/2014 03:42 AM, Staffan Friberg wrote:
Hi,
This RFE adds a CRC-32C class. It implements Checksum so it will havethe same API CRC-32, but use a different polynomial when calculatingthe CRC checksum.
CRC-32C implementation uses slicing-by-8 to achieve high performancewhen calculating the CRC value.
A part from adding the new class, java.util.zip.CRC32C, I have alsoadded two default methods to Checksum. These are methods that wereadded to Adler32 and CRC32 in JDK 8 but before default methods wereadded, which was why they were only added to the implementors and notthe interface.
Bug: https://bugs.openjdk.java.net/browse/JDK-6321472
Webrev: http://cr.openjdk.java.net/~sfriberg/JDK-6321472/webrev.00
I have started a CCC request for the changes, but was asked to getfeedback from the core libs group before finalizing the request incase there are any API or Javadoc changes suggested.
Thanks,
Staffan
Hi Staffan,
I can see CRC32C.reflect(int) method reverses the bits in 32 bit intvalue. You could use Integer.reverse(int) instead.
The CRC32C.swap32(int) method is (almost) exactly the same asInteger.reverseBytes(int) and equivalent.
I wonder if handling ByteBuffer could be simplified. You couldleverage it's own byte order manipulation by temporarily setting (andresetting afterwards) ByteBuffer.order() and then useByteBuffer.getInt() to extract 32 bits at a time for your algorithm.This could get you the optimal variant of algorithm for both kinds ofbuffers (direct or byte[] based). Perhaps even the byte[] basedvariant of algorithm could be implemented by wrapping the array withByteBuffer, passing it to common private method, and relying on theescape analysis of Hotspot to allocate the HeapByteBuffer wrapperobject on stack.
Regards, Peter

Hi Peter,

Thanks for reviewing.

I have switched to the Integer methods. Was looking through that API butI was too stuck with the reflect and swap names so I missed the reversemethods... :)

As Vitaly noted in his email the wrapped case runs much slower. Goingthrough the generated code it looks like the getInt method actually readfour bytes and then builds and int from them, unless we have someintrinsic replacing that code.


Bits.java
    static int getIntL(long a) {
        return makeInt(_get(a + 3),
                       _get(a + 2),
                       _get(a + 1),
                       _get(a    ));
    }

    static private int makeInt(byte b3, byte b2, byte b1, byte b0) {
        return (((b3       ) << 24) |
                ((b2 & 0xff) << 16) |
                ((b1 & 0xff) <<  8) |
                ((b0 & 0xff)      ));
    }

It looks like the same holds true for DirectByteBuffers unless you areon x86 which supports unaligned reads. So I think aligning and usingUnsafe is the best option here for performance.


DirectByteBuffer.java
    private int getInt(long a) {
        if (unaligned) {
            int x = unsafe.getInt(a);
            return (nativeByteOrder ? x : Bits.swap(x));
        }
        return Bits.getInt(a, bigEndian);
    }

Bits.java
    static boolean unaligned() {
        if (unalignedKnown)
            return unaligned;
        String arch = AccessController.doPrivileged(
            new sun.security.action.GetPropertyAction("os.arch"));
        unaligned = arch.equals("i386") || arch.equals("x86")
            || arch.equals("amd64") || arch.equals("x86_64");
        unalignedKnown = true;
        return unaligned;
    }

Regards,
Staffan

Re: RFR JDK-6321472: Add CRC-32C API

Reply via email to