Hari Sekhon created AMBARI-9197:
-----------------------------------

             Summary: Ambari gets stuck / not able to cancel timed out operation
                 Key: AMBARI-9197
                 URL: https://issues.apache.org/jira/browse/AMBARI-9197
             Project: Ambari
          Issue Type: Bug
          Components: ambari-server, ambari-web
    Affects Versions: 1.7.0
         Environment: HDP 2.2
            Reporter: Hari Sekhon


Ambari server has recently had added the ability to cancel operations 
(AMBARI-1897) but is not able to cancel operations that are timing out in 
yellow and gets stuck in this state for several minutes.

I've attached a screenshot which shows there is no X next to the operations in 
yellow that are stalled.

This is the result of a hang on a client but highlights that Ambari server's 
ability to cancel operations needs hardening. I've raised the scenario for the 
agent hang in AMBARI-8768.

For expedience I had to forcibly mount hdfs nfs to recover the ambari agent and 
then bounced Ambari server to clear it's blocked operations queue so I could 
continue restarting services with updated configs.

Regards,

Hari Sekhon
http://www.linkedin.com/in/harisekhon



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to