-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/40139/
-----------------------------------------------------------

(Updated Nov. 10, 2015, 7:30 p.m.)


Review request for Ambari, Alejandro Fernandez, Jonathan Hurley, Jayush Luniya, 
Nate Cole, and Yusaku Sako.


Summary (updated)
-----------------

SKIPPED_FAILED state should not be bubbled up to the Upgrade level


Bugs: AMBARI-13818
    https://issues.apache.org/jira/browse/AMBARI-13818


Repository: ambari


Description (updated)
-------

When there is a skipped failure, the "upgrade" state itself comes 
SKIPPED_FAILED. Even when the upgrade is running or paused, it is returning 
"SKIPPED_FAILED". The API should not roll this up to the "upgrade" level as the 
current behavior is confusing. At the top level, it should just be HOLDING, 
IN_PROGRESS, COMPLETED, etc. SKIPPED_FAILED should be bubbled up to the upgrade 
group level and stop there.


Also fixes another blocker:
STR:
1) Install and deploy cluster with older HDP version
2) Enable NameNode HA
3) Register, install new HDP version
4) Start Rolling Upgrade with "Skip all Service Check failures" and "Skip all 
Slave Component failures" options
5) Break datanode_upgrade.py script and wait for Core Slaves failures
6) Click "Pause upgrade" on "Core Slaves - >Verifying Skipped Failures" step
Result:
Button "Resume upgrade" doesn't work. After clicking on this button I've got 
next http response
{
  "status" : 400,
  "message" : "java.lang.IllegalArgumentException: Can only set status to 
PENDING when the upgrade is ABORTED (currently SKIPPED_FAILED)"
}


Diffs
-----

  
ambari-server/src/main/java/org/apache/ambari/server/controller/internal/CalculatedStatus.java
 f87c32c 

Diff: https://reviews.apache.org/r/40139/diff/


Testing (updated)
-------

checked on live cluster

mvn clean test in progress


Thanks,

Dmitro Lisnichenko

Reply via email to