Geoffrey Jacoby created PHOENIX-5645:
----------------------------------------

             Summary: GlobalIndexChecker should prevent compaction from purging 
vey recent deleted cells
                 Key: PHOENIX-5645
                 URL: https://issues.apache.org/jira/browse/PHOENIX-5645
             Project: Phoenix
          Issue Type: Improvement
            Reporter: Geoffrey Jacoby
            Assignee: Geoffrey Jacoby


IndexTool rebuilds and index scrutiny can both give strange, incorrect results 
if a major compaction occurs in the middle of their run. In the rebuild case, 
it's because we're rewriting "history" on the index at the same time that 
compaction is rewriting "history" by purging deleted and expired cells. 

In the case of scrutiny, it's because it does an SCN-based lookback, and if 
versions are purged on the index before their equivalent data table rows, you 
can get false errors. 

Since in the new indexing path we already have a coprocessor on each index, it 
should override the compaction hook to shield rows newer than some configurable 
age from being purged during a major compaction.

In the future, this should be contributed as a general feature to HBase for 
arbitrary tables. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to