Eli Levine created PHOENIX-3871: ----------------------------------- Summary: Incremental stats collection Key: PHOENIX-3871 URL: https://issues.apache.org/jira/browse/PHOENIX-3871 Project: Phoenix Issue Type: Improvement Reporter: Eli Levine
Phoenix automatically gathers statistics at [major compaction time|http://phoenix.apache.org/update_statistics.html]. While this is useful and accurate, it also means that statistics can become stale due to the infrequency of major compactions (can be days between major compactions), reducing their usefulness. This jira asks the question: Is it possible for Phoenix to collects statistics at a more granular level, say for every (or a sampling of) UPSERT, or minor compaction. Since statistics are always approximations, it is OK for this incremental approach to not be 100% accurate. The current stats collection mechanism at major compaction time should be kept to accurately "fix up" stats at major compaction time. [~jamestaylor], FYI. We talked about this in person a few weeks ago. Creating this Jira for posterity. Please add anything that I missed. Thanks! -- This message was sent by Atlassian JIRA (v6.3.15#6346)