[
https://issues.apache.org/jira/browse/HAMA-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14578201#comment-14578201
]
Edward J. Yoon commented on HAMA-939:
-------------------------------------
{code}
Index: core/src/main/java/org/apache/hama/bsp/sync/ZooKeeperSyncClientImpl.java
===================================================================
--- core/src/main/java/org/apache/hama/bsp/sync/ZooKeeperSyncClientImpl.java
(revision 1684294)
+++ core/src/main/java/org/apache/hama/bsp/sync/ZooKeeperSyncClientImpl.java
(working copy)
@@ -82,7 +82,13 @@
initialize(this.zk, bspRoot);
peerAddress = new InetSocketAddress(bindAddress, bindPort);
- LOG.info("Start connecting to Zookeeper! At " + peerAddress);
+ LOG.info("Quorum servers: " + quorumServers);
+ while(!zk.getState().isConnected()) {
+ LOG.info("connecting to Zookeeper..");
+ Thread.sleep(1000);
+ }
+ LOG.info("connected to Zookeeper: " + zk.getState().isConnected());
+
numBSPTasks = conf.getInt("bsp.peers.num", 1);
}
{code}
With above trace code, I chekced that zk sync server is ok, and every tasks can
be connected to zk. I doubt the taskid mismatch during write zk nodes.
> Refactoring which was implement using out-of-date status response
> -----------------------------------------------------------------
>
> Key: HAMA-939
> URL: https://issues.apache.org/jira/browse/HAMA-939
> Project: Hama
> Issue Type: Improvement
> Components: yarn
> Affects Versions: 0.7.0
> Reporter: Minho Kim
> Assignee: Minho Kim
> Fix For: 0.7.0
>
> Attachments: HAMA-939_v1.patch, HAMA-939_v2.patch, HAMA-939_v3.patch
>
>
> Status response handler of current YARN module is out of date.
> So I'm planning to modify status response handler using callback handler of
> RM and NM.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)