[ 
https://issues.apache.org/jira/browse/HBASE-613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12598702#action_12598702
 ] 

stack commented on HBASE-613:
-----------------------------

Thanks for writing a unit test.  Have you tried the original prescription 
above, using PE sequentialWrite three times in a row?  After each upload, run a 
scan to find the latest timestamp for the upload... then confirm that a scanner 
against the latest timestamp returns all cells?  Doing this procedure three 
times, I found that the first upload was missing about half of its records 
after 3rd upload (I had a bit of jython for doing the scans but seem to have 
mislaid it).



> Timestamp-anchored scanning fails to find all records
> -----------------------------------------------------
>
>                 Key: HBASE-613
>                 URL: https://issues.apache.org/jira/browse/HBASE-613
>             Project: Hadoop HBase
>          Issue Type: Bug
>          Components: client
>            Reporter: stack
>            Assignee: Jim Kellerman
>             Fix For: 0.2.0
>
>         Attachments: TestTimestampScanning.java
>
>
> If I add 3 versions of a cell and then scan across the first set of added 
> cells using a timestamp that should only get values from the first upload, a 
> bunch are missing (I added 100k on each of the three uploads).  I thought it 
> the fact that we set the number of cells found back to 1 in HStore when we 
> move off current row/column but that doesn't seem to be it.  I also tried 
> upping the MAX_VERSIONs on my table and that seemed to have no effect.  Need 
> to look closer.
> Build a unit test because replicating on cluster takes too much time.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to