[ 
https://issues.apache.org/jira/browse/HAMA-359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13035230#comment-13035230
 ] 

Edward J. Yoon commented on HAMA-359:
-------------------------------------

I tested after changing 'Thread.sleep(100);' to 'Thread.sleep(10000);' at 
BSPPeer.sync() method and finally, my job successfully done.

{code}

2011-05-18 15:15:27,147 INFO org.apache.hadoop.ipc.Server: IPC Server listener 
on 40000: starting
2011-05-18 15:15:27,149 INFO org.apache.hadoop.ipc.Server: IPC Server handler 0 
on 40000: starting
2011-05-18 15:15:27,151 INFO org.apache.hama.bsp.BSPMaster: Starting RUNNING
2011-05-18 15:22:48,062 DEBUG org.apache.hama.bsp.JobInProgress: numBSPTasks: 16
2011-05-18 15:22:48,065 DEBUG org.apache.hama.bsp.JobInProgress: Job is 
initialized.
2011-05-18 16:29:27,582 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000005_0' has finished successfully.
2011-05-18 16:29:27,583 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000005' has completed.
2011-05-18 16:29:27,806 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000004_0' has finished successfully.
2011-05-18 16:29:27,806 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000004' has completed.
2011-05-18 16:29:28,336 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000015_0' has finished successfully.
2011-05-18 16:29:28,336 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000015' has completed.
2011-05-18 16:29:28,517 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000000_0' has finished successfully.
2011-05-18 16:29:28,517 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000000' has completed.
2011-05-18 16:29:28,524 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000013_0' has finished successfully.
2011-05-18 16:29:28,524 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000013' has completed.
2011-05-18 16:29:28,589 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000012_0' has finished successfully.
2011-05-18 16:29:28,589 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000012' has completed.
2011-05-18 16:29:28,602 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000001_0' has finished successfully.
2011-05-18 16:29:28,602 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000001' has completed.
2011-05-18 16:29:28,775 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000014_0' has finished successfully.
2011-05-18 16:29:28,775 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000014' has completed.
2011-05-18 16:29:28,909 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000010_0' has finished successfully.
2011-05-18 16:29:28,909 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000010' has completed.
2011-05-18 16:29:28,914 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000007_0' has finished successfully.
2011-05-18 16:29:28,914 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000007' has completed.
2011-05-18 16:29:28,960 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000006_0' has finished successfully.
2011-05-18 16:29:28,960 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000006' has completed.
2011-05-18 16:29:29,148 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000011_0' has finished successfully.
2011-05-18 16:29:29,148 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000011' has completed.
2011-05-18 16:29:29,199 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000008_0' has finished successfully.
2011-05-18 16:29:29,199 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000008' has completed.
2011-05-18 16:29:29,244 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000009_0' has finished successfully.
2011-05-18 16:29:29,244 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000009' has completed.
2011-05-18 16:29:29,274 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000002_0' has finished successfully.
2011-05-18 16:29:29,274 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000002' has completed.
2011-05-18 16:29:29,392 INFO org.apache.hama.bsp.JobInProgress: Taskid 
'attempt_201105181515_0001_000003_0' has finished successfully.
2011-05-18 16:29:29,392 INFO org.apache.hama.bsp.TaskInProgress: Task 
'task_201105181515_0001_000003' has completed.
2011-05-18 16:29:29,395 DEBUG org.apache.hama.bsp.JobInProgress: Job 
successfully done.
2011-05-18 16:32:48,365 DEBUG org.apache.hama.bsp.BSPMaster: returns all jobs: 1
{code}

>> Is this still related to the barrier sync?

Yes. the problem related with zk node creation/deletion logic in enterBarrier() 
and leaveBarrier() methods. Sometimes they occurs at the same time.

>> Increasing the timeout won't fix the problem with it.

As i mentioned on chat, JVM garbage collection pause causes zk session time-out 
errors.

> Development of Shortest Path Finding Algorithm
> ----------------------------------------------
>
>                 Key: HAMA-359
>                 URL: https://issues.apache.org/jira/browse/HAMA-359
>             Project: Hama
>          Issue Type: New Feature
>          Components: examples
>    Affects Versions: 0.2.0
>            Reporter: Edward J. Yoon
>            Assignee: Thomas Jungblut
>              Labels: gsoc, gsoc2011, mentor
>             Fix For: 0.3.0
>
>         Attachments: HAMA-359-v2.patch, HAMA-359-v3.patch, HAMA-359-v4.patch, 
> HAMA-359.patch, eddie.patch
>
>   Original Estimate: 2016h
>  Remaining Estimate: 2016h
>
> The goal of this project is development of parallel algorithm for finding a 
> Shortest Path using Hama BSP.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to