[
https://issues.apache.org/jira/browse/AIRAVATA-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14498214#comment-14498214
]
Eroma commented on AIRAVATA-1651:
---------------------------------
Same connection error occurred in BR2 for Amber Sander. This does not occur all
the time. Second time experiment got completed successfully.
error in log:
2015-04-16 11:12:21,913 [pool-13-thread-6] ERROR
org.apache.airavata.gfac.core.handler.GFacHandlerException - KeeperErrorCode =
ConnectionLoss for
/gfac-experiments/gfac-node0/SLM-AmberSander-BR2_a66220fe-a1fe-48fd-b148-5b2398a9c40e+IDontNeedaNode_ccccd401-8e6f-4497-8f89-043b9b3bbc75/org.apache.airavata.gfac.ssh.handler.SSHInputHandler
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode =
ConnectionLoss for
/gfac-experiments/gfac-node0/SLM-AmberSander-BR2_a66220fe-a1fe-48fd-b148-5b2398a9c40e+IDontNeedaNode_ccccd401-8e6f-4497-8f89-043b9b3bbc75/org.apache.airavata.gfac.ssh.handler.SSHInputHandler
at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1228)
at
org.apache.airavata.gfac.core.utils.GFacUtils.savePluginData(GFacUtils.java:1222)
at
org.apache.airavata.gfac.ssh.handler.SSHInputHandler.invoke(SSHInputHandler.java:108)
at
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeInFlowHandlers(BetterGfacImpl.java:901)
at
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.launch(BetterGfacImpl.java:690)
at
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:481)
at
org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:210)
at
org.apache.airavata.gfac.core.utils.InputHandlerWorker.call(InputHandlerWorker.java:49)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
error in PGA:
org.apache.zookeeper.KeeperException$ConnectionLossException:
KeeperErrorCode = ConnectionLoss for
/gfac-experiments/gfac-node0/SLM-AmberSander-BR2_a66220fe-a1fe-48fd-b148-5b2398a9c40e+IDontNeedaNode_ccccd401-8e6f-4497-8f89-043b9b3bbc75/org.apache.airavata.gfac.ssh.handler.SSHInputHandler
> Zookeeper connection lost error; Experiment failed
> --------------------------------------------------
>
> Key: AIRAVATA-1651
> URL: https://issues.apache.org/jira/browse/AIRAVATA-1651
> Project: Airavata
> Issue Type: Bug
> Environment: http://test-drive.airavata.org/pga/public
> Reporter: Eroma
> Assignee: Lahiru Gunathilake
>
> Two experiment has the same error message in log
> One experiment got FAILED at experiment level and no job status recorded.
> Other Experiment failed but the job got COMPLETE. Randomely occurs. was
> unable to recreate
> error messages retrived from log;
> 2015-03-26 09:33:34,693 [main-SendThread(gw127.iu.xsede.org:9181)] INFO
> org.apache.zookeeper.ClientCnxn - Opening socket connection to server
> gw127.iu.xsede.org/149.165.228.125:9181
> ...skipping...
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode
> = ConnectionLoss for
> /gfac-experiments/gfac-node0/SLM-WRF-Stampede_c0697813-a8f4-4d8a-b0f3-6808f8538b18+IDontNeedaNode_a3b6133f-f8af-435d-9b2a-76838db535f6/org.apache.airavata.gfac.gsissh.handler.GSISSHInputHandler/state
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
> at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1228)
> at
> org.apache.airavata.gfac.core.utils.GFacUtils.updatePluginState(GFacUtils.java:1013)
> at
> org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeInFlowHandlers(BetterGfacImpl.java:902)
> at
> org.apache.airavata.gfac.core.cpi.BetterGfacImpl.launch(BetterGfacImpl.java:690)
> at
> org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:481)
> at
> org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:210)
> at
> org.apache.airavata.gfac.core.utils.InputHandlerWorker.call(InputHandlerWorker.java:49)
> at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:745)
> and
> aused by: org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for
> /gfac-experiments/gfac-node0/SLM-Trinity-Stampede_0bd73a38-6931-498f-af7b-d700dc177c43+IDontNeedaNode_db287294-796d-43c1-896d-e3b412b4c8a7/org.apache.airavata.gfac.ssh.handler.AdvancedSCPOutputHandler
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
> at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
> at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1003)
> at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1031)
> at
> org.apache.airavata.gfac.core.utils.GFacUtils.createPluginZnode(GFacUtils.java:935)
> at
> org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeOutFlowHandlers(BetterGfacImpl.java:939)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)