Eli Levine created PHOENIX-3871:
-----------------------------------
Summary: Incremental stats collection
Key: PHOENIX-3871
URL: https://issues.apache.org/jira/browse/PHOENIX-3871
Project: Phoenix
Issue Type: Improvement
Reporter: Eli Levine
Phoenix automatically gathers statistics at [major compaction
time|http://phoenix.apache.org/update_statistics.html]. While this is useful
and accurate, it also means that statistics can become stale due to the
infrequency of major compactions (can be days between major compactions),
reducing their usefulness.
This jira asks the question: Is it possible for Phoenix to collects statistics
at a more granular level, say for every (or a sampling of) UPSERT, or minor
compaction. Since statistics are always approximations, it is OK for this
incremental approach to not be 100% accurate.
The current stats collection mechanism at major compaction time should be kept
to accurately "fix up" stats at major compaction time.
[~jamestaylor], FYI. We talked about this in person a few weeks ago. Creating
this Jira for posterity. Please add anything that I missed. Thanks!
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)