ivakegg opened a new issue, #5951:
URL: https://github.com/apache/accumulo/issues/5951

   I have conclusively proved that if you scan a table without a versioning 
iterator, and that table contains identical keys but different values, keys 
will be dropped.  I played with using batch scanners and single scanners, and I 
played with varying buffer sizes and the symptoms where the same; keys would be 
lost.  I had to go to directly reading the rfile to be able to see all of the 
keys I needed for processing.  None of the keys have the delete flag set.
   
   Accumulo 2.1.4
   Redhat 8
   
   I have an example of a table with only 1 file that demonstrates this issue.  
I have not attempted to create a test example as of yet.
   
   I expect that a scan of a table without any iterators or any delete keys 
would be equivalent to a direct scan of the rfiles.
   
   I have noted that in the example I have, if I scan the separate rows 
directly then it is less likely to drop keys.  However if I do a full scan of 
the table that I am more likely to drop keys.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to