[
https://issues.apache.org/jira/browse/HAMA-359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13035230#comment-13035230
]
Edward J. Yoon commented on HAMA-359:
-------------------------------------
I tested after changing 'Thread.sleep(100);' to 'Thread.sleep(10000);' at
BSPPeer.sync() method and finally, my job successfully done.
{code}
2011-05-18 15:15:27,147 INFO org.apache.hadoop.ipc.Server: IPC Server listener
on 40000: starting
2011-05-18 15:15:27,149 INFO org.apache.hadoop.ipc.Server: IPC Server handler 0
on 40000: starting
2011-05-18 15:15:27,151 INFO org.apache.hama.bsp.BSPMaster: Starting RUNNING
2011-05-18 15:22:48,062 DEBUG org.apache.hama.bsp.JobInProgress: numBSPTasks: 16
2011-05-18 15:22:48,065 DEBUG org.apache.hama.bsp.JobInProgress: Job is
initialized.
2011-05-18 16:29:27,582 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000005_0' has finished successfully.
2011-05-18 16:29:27,583 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000005' has completed.
2011-05-18 16:29:27,806 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000004_0' has finished successfully.
2011-05-18 16:29:27,806 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000004' has completed.
2011-05-18 16:29:28,336 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000015_0' has finished successfully.
2011-05-18 16:29:28,336 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000015' has completed.
2011-05-18 16:29:28,517 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000000_0' has finished successfully.
2011-05-18 16:29:28,517 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000000' has completed.
2011-05-18 16:29:28,524 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000013_0' has finished successfully.
2011-05-18 16:29:28,524 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000013' has completed.
2011-05-18 16:29:28,589 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000012_0' has finished successfully.
2011-05-18 16:29:28,589 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000012' has completed.
2011-05-18 16:29:28,602 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000001_0' has finished successfully.
2011-05-18 16:29:28,602 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000001' has completed.
2011-05-18 16:29:28,775 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000014_0' has finished successfully.
2011-05-18 16:29:28,775 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000014' has completed.
2011-05-18 16:29:28,909 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000010_0' has finished successfully.
2011-05-18 16:29:28,909 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000010' has completed.
2011-05-18 16:29:28,914 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000007_0' has finished successfully.
2011-05-18 16:29:28,914 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000007' has completed.
2011-05-18 16:29:28,960 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000006_0' has finished successfully.
2011-05-18 16:29:28,960 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000006' has completed.
2011-05-18 16:29:29,148 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000011_0' has finished successfully.
2011-05-18 16:29:29,148 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000011' has completed.
2011-05-18 16:29:29,199 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000008_0' has finished successfully.
2011-05-18 16:29:29,199 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000008' has completed.
2011-05-18 16:29:29,244 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000009_0' has finished successfully.
2011-05-18 16:29:29,244 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000009' has completed.
2011-05-18 16:29:29,274 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000002_0' has finished successfully.
2011-05-18 16:29:29,274 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000002' has completed.
2011-05-18 16:29:29,392 INFO org.apache.hama.bsp.JobInProgress: Taskid
'attempt_201105181515_0001_000003_0' has finished successfully.
2011-05-18 16:29:29,392 INFO org.apache.hama.bsp.TaskInProgress: Task
'task_201105181515_0001_000003' has completed.
2011-05-18 16:29:29,395 DEBUG org.apache.hama.bsp.JobInProgress: Job
successfully done.
2011-05-18 16:32:48,365 DEBUG org.apache.hama.bsp.BSPMaster: returns all jobs: 1
{code}
>> Is this still related to the barrier sync?
Yes. the problem related with zk node creation/deletion logic in enterBarrier()
and leaveBarrier() methods. Sometimes they occurs at the same time.
>> Increasing the timeout won't fix the problem with it.
As i mentioned on chat, JVM garbage collection pause causes zk session time-out
errors.
> Development of Shortest Path Finding Algorithm
> ----------------------------------------------
>
> Key: HAMA-359
> URL: https://issues.apache.org/jira/browse/HAMA-359
> Project: Hama
> Issue Type: New Feature
> Components: examples
> Affects Versions: 0.2.0
> Reporter: Edward J. Yoon
> Assignee: Thomas Jungblut
> Labels: gsoc, gsoc2011, mentor
> Fix For: 0.3.0
>
> Attachments: HAMA-359-v2.patch, HAMA-359-v3.patch, HAMA-359-v4.patch,
> HAMA-359.patch, eddie.patch
>
> Original Estimate: 2016h
> Remaining Estimate: 2016h
>
> The goal of this project is development of parallel algorithm for finding a
> Shortest Path using Hama BSP.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira