[ https://issues.apache.org/jira/browse/HBASE-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lars George resolved HBASE-2181. -------------------------------- Resolution: Duplicate Duplicates HBASE-2165 > remove or provide config to completely disable all aspects of 'table > fragmentation' > ------------------------------------------------------------------------------------ > > Key: HBASE-2181 > URL: https://issues.apache.org/jira/browse/HBASE-2181 > Project: Hadoop HBase > Issue Type: Bug > Reporter: ryan rawson > Fix For: 0.21.0 > > > Given the potentially low and misleading value of the metric, and how much > effort must be expended to collect them, I would argue at least we should > allow users to disable the feature completely. > The first problem is the data the metric delivers is not very useful. On any > given busy system, this value is often 100%. On a sample system here, 12% of > the tables were at either 0 or 100%. Furthermore the 100% metric is not > particularly informative. If a table has 100% 'fragmentation' it does not > necessarily imply that this table is in dire need of compaction. The HBase > compaction code will generally keep at least 2 store files around - it > refuses to minor compact older and larger files, preferring to merge small > files. Thus on a table taking writes on all regions, the expected value of > fragmentation is in fact 100%. And this is not a bad thing either. > Considering that compacting a 500GB table will take an hour and hammer a > cluster, misleading users into striving to get to 0% is non ideal. > The other major problem of this feature is collecting the data is non-trivial > on larger clusters. I did a test where I did a lsr on a hadoop cluster, and > to generate 15k lines of output, it pegged the namenode at over 100% cpu for > a few seconds. On a cluster with 7000 regions, we can clearly easily have > 14,000 (2 store files per region is typical) files thus causing spikes > against the namenode to generate this statistic. > I would propose 3 courses of actions: > - allow complete disablement of the feature, including the background thread > and the UI display > - change the metric to mean '# of regions with > 5 store files' > - replacing the metric with a completely different one that attempts to > capture the spirit of the intent but with less load. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.