[ 
https://issues.apache.org/jira/browse/HAMA-557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13452725#comment-13452725
 ] 

Edward J. Yoon commented on HAMA-557:
-------------------------------------

And here's my quick performance test results

{code}
== FT disabled ==

[oracle@kr1-bda01 hama-0.6.0-SNAPSHOT]$ bin/hama jar 
hama-examples-0.6.0-SNAPSHOT.jar bench 100 100 50
12/09/10 20:09:43 INFO bsp.BSPJobClient: Running job: job_201209102009_0001
12/09/10 20:09:46 INFO bsp.BSPJobClient: Current supersteps number: 8
12/09/10 20:09:49 INFO bsp.BSPJobClient: Current supersteps number: 49
12/09/10 20:09:52 INFO bsp.BSPJobClient: Current supersteps number: 50
12/09/10 20:09:52 INFO bsp.BSPJobClient: The total number of supersteps: 50
12/09/10 20:09:52 INFO bsp.BSPJobClient: Counters: 8
12/09/10 20:09:52 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.JobInProgress$JobCounter
12/09/10 20:09:52 INFO bsp.BSPJobClient:     LAUNCHED_TASKS=85
12/09/10 20:09:52 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.BSPPeerImpl$PeerCounter
12/09/10 20:09:52 INFO bsp.BSPJobClient:     SUPERSTEPS=50
12/09/10 20:09:52 INFO bsp.BSPJobClient:     SUPERSTEP_SUM=4250
12/09/10 20:09:52 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_SENT=20677838
12/09/10 20:09:52 INFO bsp.BSPJobClient:     TIME_IN_SYNC_MS=101865
12/09/10 20:09:52 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_RECEIVED=20677838
12/09/10 20:09:52 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_SENT=850000
12/09/10 20:09:52 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_RECEIVED=425000
Job Finished in 9.317 seconds

[oracle@kr1-bda01 hama-0.6.0-SNAPSHOT]$ bin/hama jar 
hama-examples-0.6.0-SNAPSHOT.jar bench 100 10000 50
12/09/10 20:10:24 INFO bsp.BSPJobClient: Running job: job_201209102009_0002
12/09/10 20:10:27 INFO bsp.BSPJobClient: Current supersteps number: 2
12/09/10 20:10:30 INFO bsp.BSPJobClient: Current supersteps number: 17
12/09/10 20:10:33 INFO bsp.BSPJobClient: Current supersteps number: 33
12/09/10 20:10:36 INFO bsp.BSPJobClient: Current supersteps number: 50
12/09/10 20:10:39 INFO bsp.BSPJobClient: The total number of supersteps: 50
12/09/10 20:10:39 INFO bsp.BSPJobClient: Counters: 8
12/09/10 20:10:39 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.JobInProgress$JobCounter
12/09/10 20:10:39 INFO bsp.BSPJobClient:     LAUNCHED_TASKS=85
12/09/10 20:10:39 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.BSPPeerImpl$PeerCounter
12/09/10 20:10:39 INFO bsp.BSPJobClient:     SUPERSTEPS=50
12/09/10 20:10:39 INFO bsp.BSPJobClient:     SUPERSTEP_SUM=4250
12/09/10 20:10:39 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_SENT=234746978
12/09/10 20:10:39 INFO bsp.BSPJobClient:     TIME_IN_SYNC_MS=204266
12/09/10 20:10:39 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_RECEIVED=234746978
12/09/10 20:10:39 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_SENT=85000000
12/09/10 20:10:39 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_RECEIVED=42500000
Job Finished in 15.279 seconds

[oracle@kr1-bda01 hama-0.6.0-SNAPSHOT]$ bin/hama jar 
hama-examples-0.6.0-SNAPSHOT.jar pagerank /user/oracle/edward/web-Google.txt 
edward/testout
12/09/10 20:13:58 INFO bsp.FileInputFormat: Total input paths to process : 1
12/09/10 20:13:58 INFO bsp.FileInputFormat: Total # of splits: 85
12/09/10 20:13:58 INFO bsp.BSPJobClient: Running job: job_201209102013_0001
12/09/10 20:14:01 INFO bsp.BSPJobClient: Current supersteps number: 2
12/09/10 20:14:04 INFO bsp.BSPJobClient: Current supersteps number: 4
12/09/10 20:14:07 INFO bsp.BSPJobClient: Current supersteps number: 11
12/09/10 20:14:07 INFO bsp.BSPJobClient: The total number of supersteps: 11
12/09/10 20:14:07 INFO bsp.BSPJobClient: Counters: 10
12/09/10 20:14:07 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.JobInProgress$JobCounter
12/09/10 20:14:07 INFO bsp.BSPJobClient:     LAUNCHED_TASKS=85
12/09/10 20:14:07 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.BSPPeerImpl$PeerCounter
12/09/10 20:14:07 INFO bsp.BSPJobClient:     SUPERSTEPS=11
12/09/10 20:14:07 INFO bsp.BSPJobClient:     SUPERSTEP_SUM=935
12/09/10 20:14:07 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_SENT=96714132
12/09/10 20:14:07 INFO bsp.BSPJobClient:     TIME_IN_SYNC_MS=96246
12/09/10 20:14:07 INFO bsp.BSPJobClient:     IO_BYTES_READ=75380115
12/09/10 20:14:07 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_RECEIVED=96714132
12/09/10 20:14:07 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_SENT=14662796
12/09/10 20:14:07 INFO bsp.BSPJobClient:     TASK_INPUT_RECORDS=5105043
12/09/10 20:14:07 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_RECEIVED=7331398
Job Finished in 9.343 seconds


== FT enabled ==

[oracle@kr1-bda01 hama-0.6.0-SNAPSHOT]$ bin/hama jar 
hama-examples-0.6.0-SNAPSHOT.jar bench 100 100 50
12/09/10 19:26:28 INFO bsp.BSPJobClient: Running job: job_201209101921_0004
12/09/10 19:26:31 INFO bsp.BSPJobClient: Current supersteps number: 1
12/09/10 19:26:34 INFO bsp.BSPJobClient: Current supersteps number: 5
12/09/10 19:26:37 INFO bsp.BSPJobClient: Current supersteps number: 13
12/09/10 19:26:40 INFO bsp.BSPJobClient: Current supersteps number: 17
12/09/10 19:26:43 INFO bsp.BSPJobClient: Current supersteps number: 26
12/09/10 19:26:46 INFO bsp.BSPJobClient: Current supersteps number: 30
12/09/10 19:26:49 INFO bsp.BSPJobClient: Current supersteps number: 38
12/09/10 19:26:52 INFO bsp.BSPJobClient: Current supersteps number: 42
12/09/10 19:26:55 INFO bsp.BSPJobClient: Current supersteps number: 50
12/09/10 19:26:58 INFO bsp.BSPJobClient: The total number of supersteps: 50
12/09/10 19:26:58 INFO bsp.BSPJobClient: Counters: 8
12/09/10 19:26:58 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.JobInProgress$JobCounter
12/09/10 19:26:58 INFO bsp.BSPJobClient:     LAUNCHED_TASKS=85
12/09/10 19:26:58 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.BSPPeerImpl$PeerCounter
12/09/10 19:26:58 INFO bsp.BSPJobClient:     SUPERSTEPS=50
12/09/10 19:26:58 INFO bsp.BSPJobClient:     SUPERSTEP_SUM=4250
12/09/10 19:26:58 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_SENT=20682590
12/09/10 19:26:58 INFO bsp.BSPJobClient:     TIME_IN_SYNC_MS=168254
12/09/10 19:26:58 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_RECEIVED=20682590
12/09/10 19:26:58 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_SENT=850000
12/09/10 19:26:58 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_RECEIVED=425000
Job Finished in 30.278 seconds

[oracle@kr1-bda01 hama-0.6.0-SNAPSHOT]$ bin/hama jar 
hama-examples-0.6.0-SNAPSHOT.jar bench 100 10000 50
12/09/10 20:11:23 INFO bsp.BSPJobClient: Running job: job_201209102011_0001
12/09/10 20:11:26 INFO bsp.BSPJobClient: Current supersteps number: 0
12/09/10 20:11:29 INFO bsp.BSPJobClient: Current supersteps number: 2
12/09/10 20:11:32 INFO bsp.BSPJobClient: Current supersteps number: 5
12/09/10 20:11:35 INFO bsp.BSPJobClient: Current supersteps number: 12
12/09/10 20:11:38 INFO bsp.BSPJobClient: Current supersteps number: 15
12/09/10 20:11:41 INFO bsp.BSPJobClient: Current supersteps number: 21
12/09/10 20:11:44 INFO bsp.BSPJobClient: Current supersteps number: 23
12/09/10 20:11:47 INFO bsp.BSPJobClient: Current supersteps number: 29
12/09/10 20:11:50 INFO bsp.BSPJobClient: Current supersteps number: 32
12/09/10 20:11:53 INFO bsp.BSPJobClient: Current supersteps number: 37
12/09/10 20:11:56 INFO bsp.BSPJobClient: Current supersteps number: 40
12/09/10 20:11:59 INFO bsp.BSPJobClient: Current supersteps number: 47
12/09/10 20:12:02 INFO bsp.BSPJobClient: Current supersteps number: 49
12/09/10 20:12:05 INFO bsp.BSPJobClient: Current supersteps number: 50
12/09/10 20:12:05 INFO bsp.BSPJobClient: The total number of supersteps: 50
12/09/10 20:12:05 INFO bsp.BSPJobClient: Counters: 8
12/09/10 20:12:05 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.JobInProgress$JobCounter
12/09/10 20:12:05 INFO bsp.BSPJobClient:     LAUNCHED_TASKS=85
12/09/10 20:12:05 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.BSPPeerImpl$PeerCounter
12/09/10 20:12:05 INFO bsp.BSPJobClient:     SUPERSTEPS=50
12/09/10 20:12:05 INFO bsp.BSPJobClient:     SUPERSTEP_SUM=4250
12/09/10 20:12:05 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_SENT=234747503
12/09/10 20:12:05 INFO bsp.BSPJobClient:     TIME_IN_SYNC_MS=174675
12/09/10 20:12:05 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_RECEIVED=234747503
12/09/10 20:12:05 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_SENT=85000000
12/09/10 20:12:05 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_RECEIVED=42500000
Job Finished in 42.34 seconds

[oracle@kr1-bda01 hama-0.6.0-SNAPSHOT]$ bin/hama jar 
hama-examples-0.6.0-SNAPSHOT.jar pagerank /user/oracle/edward/web-Google.txt 
edward/testout
12/09/10 20:12:47 INFO bsp.FileInputFormat: Total input paths to process : 1
12/09/10 20:12:47 INFO bsp.FileInputFormat: Total # of splits: 85
12/09/10 20:12:48 INFO bsp.BSPJobClient: Running job: job_201209102011_0002
12/09/10 20:12:51 INFO bsp.BSPJobClient: Current supersteps number: 0
12/09/10 20:12:54 INFO bsp.BSPJobClient: Current supersteps number: 2
12/09/10 20:13:00 INFO bsp.BSPJobClient: Current supersteps number: 7
12/09/10 20:13:03 INFO bsp.BSPJobClient: Current supersteps number: 11
12/09/10 20:13:06 INFO bsp.BSPJobClient: The total number of supersteps: 11
12/09/10 20:13:06 INFO bsp.BSPJobClient: Counters: 10
12/09/10 20:13:06 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.JobInProgress$JobCounter
12/09/10 20:13:06 INFO bsp.BSPJobClient:     LAUNCHED_TASKS=85
12/09/10 20:13:06 INFO bsp.BSPJobClient:   
org.apache.hama.bsp.BSPPeerImpl$PeerCounter
12/09/10 20:13:06 INFO bsp.BSPJobClient:     SUPERSTEPS=11
12/09/10 20:13:06 INFO bsp.BSPJobClient:     SUPERSTEP_SUM=935
12/09/10 20:13:06 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_SENT=96730041
12/09/10 20:13:06 INFO bsp.BSPJobClient:     TIME_IN_SYNC_MS=205603
12/09/10 20:13:06 INFO bsp.BSPJobClient:     IO_BYTES_READ=75380115
12/09/10 20:13:06 INFO bsp.BSPJobClient:     COMPRESSED_BYTES_RECEIVED=96730041
12/09/10 20:13:06 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_SENT=14662796
12/09/10 20:13:06 INFO bsp.BSPJobClient:     TASK_INPUT_RECORDS=5105043
12/09/10 20:13:06 INFO bsp.BSPJobClient:     TOTAL_MESSAGES_RECEIVED=7331398
Job Finished in 18.297 seconds
{code}
                
> Implement Checkpointing service in Hama
> ---------------------------------------
>
>                 Key: HAMA-557
>                 URL: https://issues.apache.org/jira/browse/HAMA-557
>             Project: Hama
>          Issue Type: Sub-task
>          Components: bsp core
>    Affects Versions: 0.6.0
>            Reporter: Suraj Menon
>            Assignee: Suraj Menon
>             Fix For: 0.6.0
>
>         Attachments: HAMA-505-557-610-611-v1.patch, 
> HAMA-505-557-610-611-v2.patch, HAMA-557-ft-framework.patch
>
>
> Implement checkpointing service in Apache Hama. My patches for HAMA-533 and 
> HAMA-534 are blocked on this.
> - Checkpointing should be done as messages are either sent or received. I 
> prefer while receiving messages, as we can achieve some parallelism with 
> asynchronous messages. Please comment if you differ.
> - BSPMaster should hold the checkpoint status for each task. Checkpoint 
> status includes superstep count and file information for which checkpointing 
> is complete
> - MessageManager should notify Checkpointer of a new message at BSPPeer.
> - Implement/Reuse MessageBundle class as splitClass in BSPPeerImpl for 
> recovery in initInput.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to