[
https://issues.apache.org/jira/browse/HBASE-14738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Purtell updated HBASE-14738:
-----------------------------------
Description:
Profiling 0.98.15 I see 20-30% of CPU time spent in Hadoop's PureJavaCrc32. Not
surprising given previous results described on HBASE-11927. Backport.
There are two issues with the backport:
# The patch on 11927 changes the default CRC type from CRC32 to CRC32C.
Although the changes are backwards compatible -files with either CRC type will
be handled correctly in a transparent manner - we should probably leave the
default alone in 0.98 and advise users on a site configuration change to use
CRC32C if desired, for potential hardware acceleration.
# Need a shim for differences between Hadoop's DataChecksum type.
was:
Profiling 0.98.15 I see 20-30% of CPU time spent in Hadoop's PureJavaCrc32. Not
surprising given previous results described on HBASE-11927. Backport.
There are two issues with the backport:
# The patch on 11927 changes the default CRC type from CRC32 to CRC32C.
Although the changes are backwards compatible -files with either CRC type will
be handled correctly in a transparent manner - we should probably leave the
default alone in 0.98 and advise users on a site configuration change to use
CRC32C if desired, for potential hardware acceleration.
# Need a shim for differences between Hadoop's DataChecksum type.
> Backport HBASE-11927 (Use Native Hadoop Library for HFile checksum) to 0.98
> ---------------------------------------------------------------------------
>
> Key: HBASE-14738
> URL: https://issues.apache.org/jira/browse/HBASE-14738
> Project: HBase
> Issue Type: Task
> Reporter: Andrew Purtell
> Assignee: Andrew Purtell
> Fix For: 0.98.16
>
>
> Profiling 0.98.15 I see 20-30% of CPU time spent in Hadoop's PureJavaCrc32.
> Not surprising given previous results described on HBASE-11927. Backport.
> There are two issues with the backport:
> # The patch on 11927 changes the default CRC type from CRC32 to CRC32C.
> Although the changes are backwards compatible -files with either CRC type
> will be handled correctly in a transparent manner - we should probably leave
> the default alone in 0.98 and advise users on a site configuration change to
> use CRC32C if desired, for potential hardware acceleration.
> # Need a shim for differences between Hadoop's DataChecksum type.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)