John created AMBARI-7782:
----------------------------

             Summary: Stop RegionServer may fail during enabling NameNode HA
                 Key: AMBARI-7782
                 URL: https://issues.apache.org/jira/browse/AMBARI-7782
             Project: Ambari
          Issue Type: Bug
          Components: ambari-server
    Affects Versions: 1.7.0
            Reporter: John


Ambari will stop and restart all service during the period of enable Namenode 
HA. Sometimes, stop HBase RegionServer may fail which causes the failure of 
"Stop all service" operation. At this moment, one or more retries can overcome 
this problem.

The possible root cause of this issue may be that: command 
"/usr/lib/hbase/bin/hbase-daemon.sh --config /etc/hbase/conf stop regionserver" 
consumes 30 + epsilon seconds; timeout handler is triggered since deadline is 
missed; begin to force kill process (this operation fails because its start 
time is after 30 + epsilon at which HBase RegionServer is offline).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to