Catalog Janitor logic bug causes region leackage
------------------------------------------------
Key: HBASE-4799
URL: https://issues.apache.org/jira/browse/HBASE-4799
Project: HBase
Issue Type: Bug
Components: master
Affects Versions: 0.90.4
Reporter: Max Lapan
Priority: Critical
When region split takes a significant amount of time, CatalogJanitor can
cleanup one of SPLIT records, but left another in META. When another split
finish, janitor cleans left SPLIT record, but parent regions haven't removed
from FS and META not cleared.
The race condition is follows:
1. region split started
2. one of regions splitted, i.e. A (have no reference storefiles) but other (B)
doesn't
3. janitor started and in routine checkDaughter removes SPLITA from meta, but
see that SPLITB has references and does nothing.
4. region B completes split
5. janitor wakes up, removes SPLITB, but see that there is no records for A and
does nothing again.
Result - parent region hangs forever.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira