-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/30603/
-----------------------------------------------------------

(Updated Feb. 4, 2015, 10:38 p.m.)


Review request for Ambari, Dmitro Lisnichenko, Jonathan Hurley, Nate Cole, and 
Yurii Shylov.


Changes
-------

Unit tests passed.


Bugs: AMBARI-9467
    https://issues.apache.org/jira/browse/AMBARI-9467


Repository: ambari


Description
-------

UpgradeHelper somehow calls the active Namenode first, but this ends up being 
the standby namenode by the time it gets called; investigate why.

We will abide by the order in the runbook to first upgrade the standby then the 
active namenode, which then causes a flip.
In rare cases, if a namenode fails for whatever reason, ZKFC will initiate a 
failover, which explains why sometimes the order may be flipped by the time 
that the Namenode prepare happens. However, the namenode_upgrade.py script 
works in both cases (active first, or standby first). So this explains the rare 
behavior.
There's another Jira to run the namenode_upgrade script as part of the 
Pre-Cluster group to make the backup, so this should reduce the likelyhood of a 
flip happening after the calculation was made.


Diffs
-----

  
ambari-server/src/main/java/org/apache/ambari/server/serveraction/upgrades/FinalizeUpgradeAction.java
 fceb44d 
  ambari-server/src/main/java/org/apache/ambari/server/state/UpgradeHelper.java 
0c6f68a 
  
ambari-server/src/main/java/org/apache/ambari/server/state/cluster/ClusterImpl.java
 4a8c020 
  
ambari-server/src/main/resources/common-services/ZOOKEEPER/3.4.5.2.0/package/scripts/params.py
 2484463 
  
ambari-server/src/main/resources/common-services/ZOOKEEPER/3.4.5.2.0/package/scripts/service_check.py
 338de32 
  
ambari-server/src/main/resources/common-services/ZOOKEEPER/3.4.5.2.0/package/scripts/zookeeper_server.py
 a7ca335 
  
ambari-server/src/test/java/org/apache/ambari/server/state/UpgradeHelperTest.java
 396a91c 
  
ambari-server/src/test/java/org/apache/ambari/server/state/cluster/ClusterTest.java
 bb6a713 

Diff: https://reviews.apache.org/r/30603/diff/


Testing (updated)
-------

Verified Rolling Upgrade a 3-node cluster with HDFS, ZK, and Namenode HA. The 
flip happens rarely, but ambari must be robust to handle it.

Unit tests passed.
[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 30:57.087s
[INFO] Finished at: Wed Feb 04 14:38:29 PST 2015
[INFO] Final Memory: 60M/390M
[INFO] ------------------------------------------------------------------------


Thanks,

Alejandro Fernandez

Reply via email to