[
https://issues.apache.org/jira/browse/AMBARI-10029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14518734#comment-14518734
]
Hudson commented on AMBARI-10029:
---------------------------------
SUCCESS: Integrated in Ambari-trunk-Commit #2468 (See
[https://builds.apache.org/job/Ambari-trunk-Commit/2468/])
AMBARI-10029. Node recovery support - phase 2 (smohanty:
http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=a3a0ae041a43aaa1ffdc8f9247338bb68dd5675f)
* ambari-agent/src/test/python/ambari_agent/TestRecoveryManager.py
*
ambari-server/src/main/java/org/apache/ambari/server/actionmanager/ActionScheduler.java
*
ambari-server/src/test/java/org/apache/ambari/server/agent/TestHeartbeatMonitor.java
* ambari-server/src/main/java/org/apache/ambari/server/agent/ActionQueue.java
* ambari-agent/src/main/python/ambari_agent/ActionQueue.py
* ambari-agent/src/test/python/ambari_agent/TestController.py
* ambari-server/src/main/java/org/apache/ambari/server/agent/StatusCommand.java
*
ambari-server/src/main/java/org/apache/ambari/server/agent/HeartBeatResponse.java
*
ambari-server/src/main/java/org/apache/ambari/server/agent/HeartbeatMonitor.java
* ambari-agent/src/main/python/ambari_agent/RecoveryManager.py
* ambari-agent/src/test/python/ambari_agent/TestActionQueue.py
*
ambari-server/src/test/java/org/apache/ambari/server/agent/TestActionQueue.java
* ambari-agent/src/main/python/ambari_agent/LiveStatus.py
* ambari-agent/src/main/python/ambari_agent/Controller.py
* ambari-agent/src/test/python/ambari_agent/TestLiveStatus.py
*
ambari-server/src/main/java/org/apache/ambari/server/agent/HeartBeatHandler.java
> Node auto-recovery
> ------------------
>
> Key: AMBARI-10029
> URL: https://issues.apache.org/jira/browse/AMBARI-10029
> Project: Ambari
> Issue Type: New Feature
> Components: ambari-agent, ambari-server
> Affects Versions: 2.0.0
> Reporter: Sumit Mohanty
> Assignee: Sumit Mohanty
> Fix For: 2.1.0
>
> Attachments: AMBARI-10029.p-II.patch, AMBARI-10029.patch,
> NodeRecovery.pdf
>
>
> Using blue-print, it is possible to perform a zero-touch install of hadoop
> clusters using Ambari. This is especially useful in the cloud environment.
> However, cloud environment also can be dynamic in the sense that nodes will
> get rebooted or reset to the original image.
> Reset, being that the node (usually VM) gets reverted to original state where
> it joined the cluster. It is assumed that a reset node has ambari-agent
> installed and configured to communicate with the server. The node may also
> have all packages pre-instaled.
> Node recovery is the feature to bring back a rebooted/reset online by
> starting or installing and then starting the host components that are already
> on the host.
> In general, temporarily losing a node and then performing node recovery on a
> slave host should not affect the whole cluster. If its is a master node then
> there can be some disruption based on what is deployed on the master host and
> if HA is enabled for the master services or not.
> Node recovery, discussed in this JIRA, only addresses the ability to
> automatically INSTALL/CONFIGURE/START host components on the node so that the
> desired state of the host component matches the actual state.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)