[ https://issues.apache.org/jira/browse/HBASE-7849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Harsh J updated HBASE-7849: --------------------------- Description: Given the current mechanism, it is possible for users to flood a single region with 1k+ store files via the bulkload API and basically cause the region to become a flying dutchman - never getting assigned successfully again. Ideally, an administrative limit could solve this. If the bulkload RPC call can check if the region already has X store files, then it can reject the request to add another and throw a failure at the client with an appropriate message. This may be an intrusive change, but seems necessary in perfecting the gap between devs and ops in managing a HBase clusters. This would especially prevent abuse in form of unaware devs not pre-splitting tables before bulkloading things in. Currently, this leads to ops pain, as the devs think HBase has gone non-functional and begin complaining. was: Given the current mechanism, it is possible for users to flood a single region with 1k+ store files via the bulkload API and basically causes the region to become a flying dutchman - never getting assigned successfully again. Ideally, an administrative limit could solve this. If the bulkload RPC call can check if the region already has X store files, then it can reject the request to add another and throw a failure at the client with an appropriate message. This may be an intrusive change, but seems necessary in perfecting the gap between devs and ops in managing a HBase clusters. This would especially prevent abuse in form of unaware devs not pre-splitting tables before bulkloading things in. Currently, this leads to ops pain, as the devs think HBase has gone non-functional and begin complaining. > Provide administrative limits around bulkloads of files into a single region > ---------------------------------------------------------------------------- > > Key: HBASE-7849 > URL: https://issues.apache.org/jira/browse/HBASE-7849 > Project: HBase > Issue Type: Improvement > Components: regionserver > Reporter: Harsh J > > Given the current mechanism, it is possible for users to flood a single > region with 1k+ store files via the bulkload API and basically cause the > region to become a flying dutchman - never getting assigned successfully > again. > Ideally, an administrative limit could solve this. If the bulkload RPC call > can check if the region already has X store files, then it can reject the > request to add another and throw a failure at the client with an appropriate > message. > This may be an intrusive change, but seems necessary in perfecting the gap > between devs and ops in managing a HBase clusters. This would especially > prevent abuse in form of unaware devs not pre-splitting tables before > bulkloading things in. Currently, this leads to ops pain, as the devs think > HBase has gone non-functional and begin complaining. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira