[
https://issues.apache.org/jira/browse/HBASE-613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12598702#action_12598702
]
stack commented on HBASE-613:
-----------------------------
Thanks for writing a unit test. Have you tried the original prescription
above, using PE sequentialWrite three times in a row? After each upload, run a
scan to find the latest timestamp for the upload... then confirm that a scanner
against the latest timestamp returns all cells? Doing this procedure three
times, I found that the first upload was missing about half of its records
after 3rd upload (I had a bit of jython for doing the scans but seem to have
mislaid it).
> Timestamp-anchored scanning fails to find all records
> -----------------------------------------------------
>
> Key: HBASE-613
> URL: https://issues.apache.org/jira/browse/HBASE-613
> Project: Hadoop HBase
> Issue Type: Bug
> Components: client
> Reporter: stack
> Assignee: Jim Kellerman
> Fix For: 0.2.0
>
> Attachments: TestTimestampScanning.java
>
>
> If I add 3 versions of a cell and then scan across the first set of added
> cells using a timestamp that should only get values from the first upload, a
> bunch are missing (I added 100k on each of the three uploads). I thought it
> the fact that we set the number of cells found back to 1 in HStore when we
> move off current row/column but that doesn't seem to be it. I also tried
> upping the MAX_VERSIONs on my table and that seemed to have no effect. Need
> to look closer.
> Build a unit test because replicating on cluster takes too much time.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.