[ https://issues.apache.org/jira/browse/HADOOP-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12559938#action_12559938 ]
Jim Kellerman commented on HADOOP-2636: --------------------------------------- Better yet, move triggering of cache flush to the store level instead of the region level. Same for compactions. Split still has to happen at the region level because it is the region that embodies the concept of row range. However the split could be triggered by a single store reaching the split threshold. > [hbase] Make flusher less dumb > ------------------------------ > > Key: HADOOP-2636 > URL: https://issues.apache.org/jira/browse/HADOOP-2636 > Project: Hadoop > Issue Type: Improvement > Components: contrib/hbase > Reporter: stack > Priority: Minor > > When flusher runs -- its triggered when the sum of all Stores in a Region > a > configurable max size -- we flush all Stores though a Store memcache might > have but a few bytes. > I would think Stores should only dump their memcache disk if they have some > substance. > The problem becomes more acute, the more families you have in a Region. > Possible behaviors would be to dump the biggest Store only, or only those > Stores > 50% of max memcache size. Behavior would vary dependent on the > prompt that provoked the flush. Would also log why the flush is running: > optional or > max size. > This issue comes out of HADOOP-2621. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.