[
https://issues.apache.org/jira/browse/HADOOP-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12636459#action_12636459
]
Hong Tang commented on HADOOP-3315:
-----------------------------------
Yes, Scanner is stateful: lowerBound() or upperBound() will get to the right
point of the compression block, and keep the block steram open.
Sorry forgot to mention about the change from Location to RowID. One school of
thoughts is that one cannot really do much with the Location object. So at some
time of the point, we may need to expose the fact that you can locate by RowID
anyways. At that point, the use of Location would be obsolete. So let's just
expose it now to keep the API set tight. With RowID, you can easily implement
some auxiliary index to remember the # of rows starting with the same "row key"
in HBase, and you can do seek(current()+n) to efficiently skip those rows.
> New binary file format
> ----------------------
>
> Key: HADOOP-3315
> URL: https://issues.apache.org/jira/browse/HADOOP-3315
> Project: Hadoop Core
> Issue Type: New Feature
> Components: io
> Reporter: Owen O'Malley
> Assignee: Amir Youssefi
> Attachments: HADOOP-3315_20080908_TFILE_PREVIEW_WITH_LZO_TESTS.patch,
> HADOOP-3315_20080915_TFILE.patch, TFile Specification Final.pdf
>
>
> SequenceFile's block compression format is too complex and requires 4 codecs
> to compress or decompress. It would be good to have a file format that only
> needs
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.