Dmitry Lysnichenko created AMBARI-9836:
------------------------------------------

             Summary: Install Packages breaks all running cluster operations
                 Key: AMBARI-9836
                 URL: https://issues.apache.org/jira/browse/AMBARI-9836
             Project: Ambari
          Issue Type: Bug
          Components: ambari-server
    Affects Versions: 2.0.0
            Reporter: Dmitry Lysnichenko
            Assignee: Dmitry Lysnichenko
             Fix For: 2.0.0


- installed my cluster (hdfs, yarn, tez,hive,zk,pig,ams)
- stop all
- issue the "start all"
- since that command takes a while, I went to Admin > Stack and Versions > 
Version
- clicked manage versions, added a new version
- went and clicked Install Packages
- the Install version operation is queued, not started
- I attempt to cancel Install version, which doesn't seem to work.
- I see exceptions in ambari-server.log
- now my start all services is busted and not making progress either. 
Everything is at a standstill.

{code}
01:46:38,994 ERROR [Thread-20] DistributeRepositoriesActionListener:102 - 
Command report is null, marking action as INSTALL_FAILED
01:46:38,996 ERROR [Thread-20] DistributeRepositoriesActionListener:102 - 
Command report is null, marking action as INSTALL_FAILED
01:46:38,997 ERROR [Thread-20] DistributeRepositoriesActionListener:102 - 
Command report is null, marking action as INSTALL_FAILED
01:46:39,001  WARN [Thread-20] ActionScheduler:199 - Exception received
java.lang.NullPointerException
        at java.util.TreeMap.getEntry(TreeMap.java:342)
        at java.util.TreeMap.containsKey(TreeMap.java:227)
        at 
org.apache.ambari.server.state.cluster.ClusterImpl.getService(ClusterImpl.java:840)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.transitionToFailedState(ActionScheduler.java:655)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.abortOperationsForStage(ActionScheduler.java:628)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.processCancelledRequestsList(ActionScheduler.java:843)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.doWork(ActionScheduler.java:213)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.run(ActionScheduler.java:194)
        at java.lang.Thread.run(Thread.java:745)
01:46:49,003 ERROR [Thread-20] DistributeRepositoriesActionListener:102 - 
Command report is null, marking action as INSTALL_FAILED
01:46:49,005 ERROR [Thread-20] DistributeRepositoriesActionListener:102 - 
Command report is null, marking action as INSTALL_FAILED
01:46:49,007 ERROR [Thread-20] DistributeRepositoriesActionListener:102 - 
Command report is null, marking action as INSTALL_FAILED
01:46:49,010  WARN [Thread-20] ActionScheduler:199 - Exception received
java.lang.NullPointerException
        at java.util.TreeMap.getEntry(TreeMap.java:342)
        at java.util.TreeMap.containsKey(TreeMap.java:227)
        at 
org.apache.ambari.server.state.cluster.ClusterImpl.getService(ClusterImpl.java:840)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.transitionToFailedState(ActionScheduler.java:655)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.abortOperationsForStage(ActionScheduler.java:628)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.processCancelledRequestsList(ActionScheduler.java:843)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.doWork(ActionScheduler.java:213)
        at 
org.apache.ambari.server.actionmanager.ActionScheduler.run(ActionScheduler.java:194)
        at java.lang.Thread.run(Thread.java:745)

{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to