[ 
https://issues.apache.org/jira/browse/HAMA-939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14578201#comment-14578201
 ] 

Edward J. Yoon commented on HAMA-939:
-------------------------------------

{code}
Index: core/src/main/java/org/apache/hama/bsp/sync/ZooKeeperSyncClientImpl.java
===================================================================
--- core/src/main/java/org/apache/hama/bsp/sync/ZooKeeperSyncClientImpl.java    
(revision 1684294)
+++ core/src/main/java/org/apache/hama/bsp/sync/ZooKeeperSyncClientImpl.java    
(working copy)
@@ -82,7 +82,13 @@
     initialize(this.zk, bspRoot);
 
     peerAddress = new InetSocketAddress(bindAddress, bindPort);
-    LOG.info("Start connecting to Zookeeper! At " + peerAddress);
+    LOG.info("Quorum servers: " + quorumServers);
+    while(!zk.getState().isConnected()) {
+      LOG.info("connecting to Zookeeper..");
+      Thread.sleep(1000);
+    }
+    LOG.info("connected to Zookeeper: " + zk.getState().isConnected());
+    
     numBSPTasks = conf.getInt("bsp.peers.num", 1);
   }
 {code}

With above trace code, I chekced that zk sync server is ok, and every tasks can 
be connected to zk. I doubt the taskid mismatch during write zk nodes.

> Refactoring which was implement using out-of-date status response
> -----------------------------------------------------------------
>
>                 Key: HAMA-939
>                 URL: https://issues.apache.org/jira/browse/HAMA-939
>             Project: Hama
>          Issue Type: Improvement
>          Components: yarn
>    Affects Versions: 0.7.0
>            Reporter: Minho Kim
>            Assignee: Minho Kim
>             Fix For: 0.7.0
>
>         Attachments: HAMA-939_v1.patch, HAMA-939_v2.patch, HAMA-939_v3.patch
>
>
> Status response handler of current YARN module is out of date.
> So I'm planning to modify status response handler using callback handler of 
> RM and NM.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to