[jira] Commented: (HADOOP-6148) Implement a pure Java CRC32 calculator

Todd Lipcon (JIRA) Wed, 15 Jul 2009 12:04:40 -0700

    [ 
https://issues.apache.org/jira/browse/HADOOP-6148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12731631#action_12731631
 ]


Todd Lipcon commented on HADOOP-6148:
-------------------------------------

Thanks for the improvements, Nicholas. I should have a chance to verify your 
findings today or tomorrow.

At first glance, it seems odd that zip.CRC32 is faster than PureJavaCrc32. Are 
you using the Sun JDK or OpenJDK? In my tests for large block sizes, Sun's JDK 
is slower than PureJavaCrc32, but OpenJDK's is faster.

> Implement a pure Java CRC32 calculator
> --------------------------------------
>
>                 Key: HADOOP-6148
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6148
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Owen O'Malley
>            Assignee: Todd Lipcon
>         Attachments: benchmarks20090714.txt, benchmarks20090715.txt, 
> crc32-results.txt, hadoop-5598-evil.txt, hadoop-5598-hybrid.txt, 
> hadoop-5598.txt, hadoop-5598.txt, hdfs-297.txt, PureJavaCrc32.java, 
> PureJavaCrc32.java, PureJavaCrc32.java, PureJavaCrc32.java, 
> TestCrc32Performance.java, TestCrc32Performance.java, 
> TestCrc32Performance.java, TestPureJavaCrc32.java
>
>
> We've seen a reducer writing 200MB to HDFS with replication = 1 spending a 
> long time in crc calculation. In particular, it was spending 5 seconds in crc 
> calculation out of a total of 6 for the write. I suspect that it is the 
> java-jni border that is causing us grief.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-6148) Implement a pure Java CRC32 calculator

Reply via email to