Evelyn Boland created HBASE-28837:
-------------------------------------

             Summary: Add row statistics coprocessor
                 Key: HBASE-28837
                 URL: https://issues.apache.org/jira/browse/HBASE-28837
             Project: HBase
          Issue Type: Improvement
    Affects Versions: 3.0.0-beta-1, 2.0.0
            Reporter: Evelyn Boland


Goal:

Add a coprocessor to HBase that allows administrators to track high level 
statistics on the rows and cells in their HBase tables. Administrators can load 
this coprocessor into their RegionServers if they wish to gain more visibility 
into the shape of their data in HBase.

At my day job, we've leveraged the statistics from this coprocessor to 
automatically configure more optimal block sizes and smarter compaction 
schedules for our fleet of nearly 200 HBase clusters.

Context:

Since HBase tables can store terabytes or even petabytes of data, HBase 
administrators often have incomplete information about the data stored in their 
HBase tables. Without a comprehensive understanding of the shape of their data, 
it can be difficult for administrators to configure clusters for a desired 
level of performance and/or reliability. Row statistics have the potential to 
supercharge HBase management.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to