Eli Levine created PHOENIX-3871:
-----------------------------------

             Summary: Incremental stats collection
                 Key: PHOENIX-3871
                 URL: https://issues.apache.org/jira/browse/PHOENIX-3871
             Project: Phoenix
          Issue Type: Improvement
            Reporter: Eli Levine


Phoenix automatically gathers statistics at [major compaction 
time|http://phoenix.apache.org/update_statistics.html]. While this is useful 
and accurate, it also means that statistics can become stale due to the 
infrequency of major compactions (can be days between major compactions), 
reducing their usefulness. 

This jira asks the question: Is it possible for Phoenix to collects statistics 
at a more granular level, say for every (or a sampling of) UPSERT, or minor 
compaction. Since statistics are always approximations, it is OK for this 
incremental approach to not be 100% accurate.

The current stats collection mechanism at major compaction time should be kept 
to accurately "fix up" stats at major compaction time.

[~jamestaylor], FYI. We talked about this in person a few weeks ago. Creating 
this Jira for posterity. Please add anything that I missed. Thanks!



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to