[ 
https://issues.apache.org/jira/browse/HAMA-387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13065764#comment-13065764
 ] 

Edward J. Yoon commented on HAMA-387:
-------------------------------------

HAMA-387_v04.patch is still problematic.

Hanged when the job was started.

{code}
root@hnode01:/usr/local/src/hama-trunk# bin/hama jar 
hama-examples-0.4.0-incubating-SNAPSHOT.jar bench 2 2 200
11/07/15 17:00:05 DEBUG bsp.BSPJobClient: BSPJobClient.submitJobDir: 
hdfs://hnode15:9000/tmp/hadoop-root/bsp/system/submit_157sln
11/07/15 17:00:06 INFO bsp.BSPJobClient: Running job: job_201107151659_0001
11/07/15 17:00:09 INFO bsp.BSPJobClient: Current supersteps number: 0

----
2011-07-15 17:32:25,637 INFO org.apache.hama.bsp.TaskRunner: 
attempt_201107151659_0001_000011_0 11/07/15 17:32:25 DEBUG bsp.BSPPeer: 
hnode08:61000 is in superstep 0
2011-07-15 17:32:25,638 INFO org.apache.hama.bsp.TaskRunner: 
attempt_201107151659_0001_000011_0 11/07/15 17:32:25 DEBUG bsp.BSPPeer: 
hnode15:61000 is in superstep 46
2011-07-15 17:32:25,639 INFO org.apache.hama.bsp.TaskRunner: 
attempt_201107151659_0001_000011_0 11/07/15 17:32:25 DEBUG bsp.BSPPeer: 
hnode06:61000 is in superstep 0
2011-07-15 17:32:25,640 INFO org.apache.hama.bsp.TaskRunner: 
attempt_201107151659_0001_000011_0 11/07/15 17:32:25 DEBUG bsp.BSPPeer: 
hnode10:61000 is in superstep 0
2011-07-15 17:32:25,641 INFO org.apache.hama.bsp.TaskRunner: 
attempt_201107151659_0001_000011_0 11/07/15 17:32:25 DEBUG bsp.BSPPeer: 
hnode16:61000 is in superstep 0
2011-07-15 17:32:25,642 INFO org.apache.hama.bsp.TaskRunner: 
attempt_201107151659_0001_000011_0 11/07/15 17:32:25 DEBUG bsp.BSPPeer: 
[cnode05:61000] enter the enterbarrier: 0
{code}

> Add task ID and superstep count informations to lock file
> ---------------------------------------------------------
>
>                 Key: HAMA-387
>                 URL: https://issues.apache.org/jira/browse/HAMA-387
>             Project: Hama
>          Issue Type: Improvement
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>            Assignee: Edward J. Yoon
>             Fix For: 0.4.0
>
>         Attachments: HAMA-387_v02.patch, HAMA-387_v03.patch, 
> HAMA-387_v04.patch, new.patch, sleepless.patch
>
>
> I think, the lock file must include:
>  * the job ID
>  * the task ID of the lock file owner
>  * the current superstep count
> to check ownership and validation.
> Currently they are named by hostname, but multi-tasks can be run per one 
> groomserver in the future. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to