[ 
https://issues.apache.org/jira/browse/HAMA-642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Edward J. Yoon updated HAMA-642:
--------------------------------

    Fix Version/s:     (was: 0.6.0)

Job never finishes. 

I doubt there's a JDBM's bug (concurrent access or structural bug might be a 
part of the problem) or bug in this patch.

{code}
[oracle@bda01 hama-0.6.0-MEM]$ bin/hama jar hama-examples-0.6.0-SNAPSHOT.jar 
sssp 3 /user/oracle/edward/rangraph/part-r-00000 testout
12/09/23 19:20:12 INFO bsp.FileInputFormat: Total input paths to process : 1
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.12:50010
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.14:50010
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.7:50010
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.8:50010
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.3:50010
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.2:50010
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.17:50010
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.18:50010
12/09/23 19:20:12 INFO net.NetworkTopology: Adding a new node: 
/switch1/192.168.10.5:50010
12/09/23 19:20:12 INFO bsp.FileInputFormat: Total # of splits: 85
12/09/23 19:20:12 INFO bsp.BSPJobClient: Running job: job_201209231919_0001
12/09/23 19:20:15 INFO bsp.BSPJobClient: Current supersteps number: 0
12/09/23 19:20:18 INFO bsp.BSPJobClient: Current supersteps number: 2
12/09/23 19:20:27 INFO bsp.BSPJobClient: Current supersteps number: 3
12/09/23 19:20:36 INFO bsp.BSPJobClient: Current supersteps number: 8
12/09/23 19:20:39 INFO bsp.BSPJobClient: Current supersteps number: 10
12/09/23 19:20:42 INFO bsp.BSPJobClient: Current supersteps number: 12
12/09/23 19:20:48 INFO bsp.BSPJobClient: Current supersteps number: 13
12/09/23 19:21:00 INFO bsp.BSPJobClient: Current supersteps number: 14
12/09/23 19:21:06 INFO bsp.BSPJobClient: Current supersteps number: 15
12/09/23 19:21:15 INFO bsp.BSPJobClient: Current supersteps number: 16
12/09/23 19:21:21 INFO bsp.BSPJobClient: Current supersteps number: 17
12/09/23 19:21:30 INFO bsp.BSPJobClient: Current supersteps number: 18
12/09/23 19:21:36 INFO bsp.BSPJobClient: Current supersteps number: 19
12/09/23 19:21:42 INFO bsp.BSPJobClient: Current supersteps number: 20
12/09/23 19:21:48 INFO bsp.BSPJobClient: Current supersteps number: 21
12/09/23 19:21:54 INFO bsp.BSPJobClient: Current supersteps number: 22
12/09/23 19:21:57 INFO bsp.BSPJobClient: Current supersteps number: 23
12/09/23 19:22:03 INFO bsp.BSPJobClient: Current supersteps number: 24
12/09/23 19:22:06 INFO bsp.BSPJobClient: Current supersteps number: 26
12/09/23 19:22:09 INFO bsp.BSPJobClient: Current supersteps number: 28
12/09/23 19:22:12 INFO bsp.BSPJobClient: Current supersteps number: 31
12/09/23 19:22:15 INFO bsp.BSPJobClient: Current supersteps number: 33
12/09/23 19:22:18 INFO bsp.BSPJobClient: Current supersteps number: 37
12/09/23 19:22:21 INFO bsp.BSPJobClient: Current supersteps number: 40
12/09/23 19:22:24 INFO bsp.BSPJobClient: Current supersteps number: 48
12/09/23 19:22:27 INFO bsp.BSPJobClient: Current supersteps number: 49
12/09/23 19:22:27 INFO bsp.BSPJobClient: The total number of supersteps: 49
12/09/23 19:22:27 INFO bsp.BSPJobClient: Counters: 8
12/09/23 19:22:27 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.JobInProgress$JobCounter
12/09/23 19:22:27 INFO bsp.BSPJobClient:     LAUNCHED_TASKS=85
12/09/23 19:22:27 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.BSPPeerImpl$PeerCounter
12/09/23 19:22:27 INFO bsp.BSPJobClient:     SUPERSTEPS=49
12/09/23 19:22:27 INFO bsp.BSPJobClient:     SUPERSTEP_SUM=4165
12/09/23 19:22:27 INFO bsp.BSPJobClient:     TIME_IN_SYNC_MS=1692428
12/09/23 19:22:27 INFO bsp.BSPJobClient:     IO_BYTES_READ=1167771205
12/09/23 19:22:27 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_SENT=1157989750
12/09/23 19:22:27 INFO bsp.BSPJobClient:     TASK_INPUT_RECORDS=10000000
12/09/23 19:22:27 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_RECEIVED=578994875
Job Finished in 135.463 seconds

----

[oracle@bda01 hama-0.6.0-JDBM]$ bin/hama jar 
../hama-0.6.0-MEM/hama-examples-0.6.0-SNAPSHOT.jar sssp 3 
/user/oracle/edward/rangraph/part-r-00000 testout
....
12/09/23 20:43:48 INFO bsp.BSPJobClient: Current supersteps number: 485
12/09/23 20:44:00 INFO bsp.BSPJobClient: Current supersteps number: 486
12/09/23 20:44:09 INFO bsp.BSPJobClient: Current supersteps number: 487
12/09/23 20:44:18 INFO bsp.BSPJobClient: Current supersteps number: 488
12/09/23 20:44:30 INFO bsp.BSPJobClient: Current supersteps number: 489
12/09/23 20:44:39 INFO bsp.BSPJobClient: Current supersteps number: 490
12/09/23 20:44:48 INFO bsp.BSPJobClient: Current supersteps number: 491
12/09/23 20:45:00 INFO bsp.BSPJobClient: Current supersteps number: 492
12/09/23 20:45:09 INFO bsp.BSPJobClient: Current supersteps number: 493
12/09/23 20:45:18 INFO bsp.BSPJobClient: Current supersteps number: 494
12/09/23 20:45:30 INFO bsp.BSPJobClient: Current supersteps number: 495
12/09/23 20:45:39 INFO bsp.BSPJobClient: Current supersteps number: 496
12/09/23 20:45:48 INFO bsp.BSPJobClient: Current supersteps number: 497
12/09/23 20:46:00 INFO bsp.BSPJobClient: Current supersteps number: 498
12/09/23 20:46:09 INFO bsp.BSPJobClient: Current supersteps number: 499
12/09/23 20:46:18 INFO bsp.BSPJobClient: Current supersteps number: 500
12/09/23 20:46:30 INFO bsp.BSPJobClient: Current supersteps number: 501
^C[oracle@bda01 hama-0.6.0-JDBM]$ 
[oracle@bda01 hama-0.6.0-JDBM]$ bin/hama job -list all
1 jobs submitted
States are:
        Running : 1     Succeded : 2    Failed : 3      Prep : 4
JobId   State   StartTime       UserName
job_201209231923_0001   1       1348453426073   oracle
[oracle@bda01 hama-0.6.0-JDBM]$ bin/hama job -kill job_201209231923_0001
Killed job job_201209231923_0001
{code}
                
> Make GraphRunner disk based
> ---------------------------
>
>                 Key: HAMA-642
>                 URL: https://issues.apache.org/jira/browse/HAMA-642
>             Project: Hama
>          Issue Type: Improvement
>          Components: graph
>    Affects Versions: 0.5.0
>            Reporter: Thomas Jungblut
>            Assignee: Edward J. Yoon
>         Attachments: HAMA-642_unix_1.patch, HAMA-642_unix_2.patch, 
> HAMA-scale_1.patch, HAMA-scale_2.patch, HAMA-scale_3.patch, HAMA-scale_4.patch
>
>
> To improve scalability we can improve the graph runner to be disk based.
> Which basically means:
> - We have just a single Vertex instance that get's refilled.
> - We directly write vertices to disk after partitioning
> - In every superstep we iterate over the vertices on disk, fill the vertex 
> instance and call the users compute functions
> Problems:
> - State other than vertex value can't be stored easy
> - How do we deal with random access after messages have arrived?
> So I think we should make the graph runner more hybrid, like using the queues 
> we have implemented in the messaging. So the graphrunner can be configured to 
> run completely on disk, in cached mode or in in-memory mode.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to