[
https://issues.apache.org/jira/browse/HBASE-1818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matteo Bertozzi updated HBASE-1818:
-----------------------------------
Attachment: binary-block-seek.patch
I've rewritten Scanner.blockSeek() to perform a binary search.
The new method has the same behaviours of the previous one even with duplicate
keys.
Unfortunatly I've used an ArrayList to keep track of keys' offset, ArrayList
can be replaced by an array if number of keys in a block is known.
> HFile code review and refinement
> --------------------------------
>
> Key: HBASE-1818
> URL: https://issues.apache.org/jira/browse/HBASE-1818
> Project: HBase
> Issue Type: Improvement
> Components: io
> Affects Versions: 0.20.0
> Reporter: Schubert Zhang
> Assignee: Schubert Zhang
> Priority: Minor
> Fix For: 0.20.1, 0.90.0
>
> Attachments: binary-block-seek.patch, HFile-v3.patch, HFile-v4.patch,
> HFile-v5.patch
>
>
> HFile is a good mimic of Google's SSTable file format. And we want HFile to
> become a common file format of hadoop in the near future.
> We will review the code of HFile and record the comments here, and then
> provide fixed patch after the review.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.