[ 
https://issues.apache.org/jira/browse/HBASE-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12656386#action_12656386
 ] 

Andrew Purtell commented on HBASE-1062:
---------------------------------------

One way to handle this is to extend the concept of safe mode to the 
regionservers. They should hold off on compactions and splits for some 
configurable interval, and then slowly ramp up compactions/splits with 
randomized hold intervals to avoid thundering herd behavior. 

> Compactions at (re)start on a very large table can overwhelm DFS
> ----------------------------------------------------------------
>
>                 Key: HBASE-1062
>                 URL: https://issues.apache.org/jira/browse/HBASE-1062
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: Andrew Purtell
>
> Given a large table, > 1000 regions for example, if a cluster restart is 
> necessary, the compactions undertaken by the regionservers when the master 
> makes initial region assignments can overwhelm DFS, leading to file errors 
> and data loss. This condition is exacerbated if write load was heavy before 
> restart and so many regions want to split as soon as they are opened. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to