[ https://issues.apache.org/jira/browse/HADOOP-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
stack resolved HADOOP-1644. --------------------------- Resolution: Fixed Committed after testing on three different machines -- macosx and two linux machines one of which was an old K6 single-processor -- and checking no javadoc errors. Resolving. > [hbase] Compactions should not block updates > -------------------------------------------- > > Key: HADOOP-1644 > URL: https://issues.apache.org/jira/browse/HADOOP-1644 > Project: Hadoop > Issue Type: Improvement > Components: contrib/hbase > Affects Versions: 0.15.0 > Reporter: stack > Assignee: stack > Fix For: 0.15.0 > > Attachments: interlacing.patch, non-blocking-compaction-v2.patch, > non-blocking-compaction.patch > > > Currently, compactions take a long time. During compaction, updates are > carried by the HRegions' memcache (+ backing HLog). memcache is unable to > flush to disk until compaction completes. > Under sustained, substantial -- rows that contain multiple columns one of > which is a web page -- updates by multiple concurrent clients (10 in this > case), a common hbase usage scenario, the memcache grows fast and often to > orders of magnitude in excess of the configured 'flush-to-disk' threshold. > This throws the whole system out of kilter. When memcache does get to run > after compaction completes -- assuming you have sufficent RAM and the region > server doesn't OOME -- then the resulting on-disk file will be way larger > than any other on-disk HStoreFile bringing on a region split ..... but the > resulting split will produce regions that themselves need to be immediately > split because each half is beyond the configured limit, and so on... > In another issue yet to be posted, tuning and some pointed memcache flushes > makes the above condition less extreme but until compaction durations come > close to the memcache flush threshold compactions will remain disruptive. > Its allowed that compactions may never be fast enough as per bigtable paper > (This is a 'wish' issue). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.