[
https://issues.apache.org/jira/browse/HBASE-5074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13201569#comment-13201569
]
Phabricator commented on HBASE-5074:
------------------------------------
mbautin has commented on the revision "[jira] [HBASE-5074] Support checksums in
HBase block cache".
@dhruba; thanks for the fixes! Here are some more comments (I still have to
go through the last 25% of the new version of the patch).
INLINE COMMENTS
src/main/java/org/apache/hadoop/hbase/io/hfile/FixedFileTrailer.java:119
Please address this comment. The javadoc says "major" and the variable name
says "minor".
src/main/java/org/apache/hadoop/hbase/io/hfile/FixedFileTrailer.java:49
Please correct the misspelling.
src/main/java/org/apache/hadoop/hbase/io/hfile/FixedFileTrailer.java:352 I
think this function needs to be renamed to expectAtLeastMajorVersion for clarity
src/main/java/org/apache/hadoop/hbase/io/hfile/HFileBlock.java:287 I think we
should either consistently use the onDiskSizeWithHeader field or get rid of it.
src/main/java/org/apache/hadoop/hbase/io/hfile/HFileWriterV1.java:220 Please
do use a constant instead of "0" here for the minor version.
src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java:3551 Long line
src/main/java/org/apache/hadoop/hbase/util/ChecksumType.java:60 This lazy
initialization is not thread-safe. This also applies to other enum members
below. Can the meth field be initialized on the enum constructor, or do we rely
on some classes being loaded by the time this initialization is invoked?
src/main/java/org/apache/hadoop/hbase/util/ChecksumType.java:63-67 Avoid
repeating "org.apache.hadoop.util.PureJavaCrc32" three times in string form
src/main/java/org/apache/hadoop/hbase/util/ChecksumType.java:74-75 Avoid
repeating the "java.util.zip.CRC32" string
src/main/java/org/apache/hadoop/hbase/util/ChecksumType.java:98-99 Avoid
repeating the string
src/test/java/org/apache/hadoop/hbase/io/hfile/TestChecksum.java:132 Fix
indentation
src/test/java/org/apache/hadoop/hbase/io/hfile/TestChecksum.java:174 Fix
indentation
src/test/java/org/apache/hadoop/hbase/io/hfile/TestChecksum.java:71
Inconsistent formatting: "1024 +980".
REVISION DETAIL
https://reviews.facebook.net/D1521
> support checksums in HBase block cache
> --------------------------------------
>
> Key: HBASE-5074
> URL: https://issues.apache.org/jira/browse/HBASE-5074
> Project: HBase
> Issue Type: Improvement
> Components: regionserver
> Reporter: dhruba borthakur
> Assignee: dhruba borthakur
> Attachments: D1521.1.patch, D1521.1.patch, D1521.2.patch,
> D1521.2.patch, D1521.3.patch, D1521.3.patch
>
>
> The current implementation of HDFS stores the data in one block file and the
> metadata(checksum) in another block file. This means that every read into the
> HBase block cache actually consumes two disk iops, one to the datafile and
> one to the checksum file. This is a major problem for scaling HBase, because
> HBase is usually bottlenecked on the number of random disk iops that the
> storage-hardware offers.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira