Duo Zhang created HBASE-21325:
---------------------------------

             Summary: Add a max wait time for waitOnAllRegionsToClose
                 Key: HBASE-21325
                 URL: https://issues.apache.org/jira/browse/HBASE-21325
             Project: HBase
          Issue Type: Improvement
            Reporter: Duo Zhang


When testing sync replication, I found that, if I transit the remote cluster to 
DA, while the local cluster is still in A, the region server will hang when 
shutdown. As the fsOk flag only test the local cluster(which is reasonable), we 
will enter the waitOnAllRegionsToClose, and since the WAL is broken(the remote 
wal directory is gone)  so we will never succeed. And this lead to an infinite 
wait inside waitOnAllRegionsToClose.

So I think here we should have an upper bound for the wait time in 
waitOnAllRegionsToClose method.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to