[ 
https://issues.apache.org/jira/browse/HAMA-511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205365#comment-13205365
 ] 

Suraj Menon commented on HAMA-511:
----------------------------------

Nice to see some views expressed. There is no way we should be giving up the 
simple or rather the new simple fault tolerant BSP API.
The reason, I rushed to express this idea, is for us to keep it in mind while 
we design and implement fault tolerance, which is our current focus and there 
are people already working on it.

As an example, when I am making the checkpointing configurable with a simple 
modulo logic today, I was making it a modulo function of a counter for number 
of times sync() function is called. Now with selective superstep 
synchronization in mind, I have to make the checkpointing logic a function of 
current superstep number.

Regarding the changes to be made, I was encouraged by the design that Thomas 
had in his github repo. 
https://github.com/thomasjungblut/thomasjungblut-common/blob/master/src/de/jungblut/bsp/ft/FaultTolerantBSP.java
I think It already necessitates sending the Superstep array to BSPPeer. For 
selective synchronization, we would need multidimensional array to be sent with 
the column of the array to be executed. The column number would also have to be 
part of the identity of Zookeeper node for synchronization. I agree this is not 
a small task.

In offline mode, we can always implement task precedence constraints with 
multiple batch processing (like Oozie for hadoop-mapreduce). However, I think 
having this flexibility would be really useful for real-time Hama tasks. I feel 
this would give Hama capability to be a framework for implementing distributed 
real-time computation tasks as well. We can evaluate the design of S4(Y!), 
EarlyBird(Twitter) and others to verify this.
                
> Submitting heterogenous supersteps with precedence constraints on Hama
> ----------------------------------------------------------------------
>
>                 Key: HAMA-511
>                 URL: https://issues.apache.org/jira/browse/HAMA-511
>             Project: Hama
>          Issue Type: New Feature
>            Reporter: Suraj Menon
>            Priority: Minor
>         Attachments: Defining supersteps for BSP.pdf
>
>
> Hama should support submission of jobs with support for:
> 1) Skipping unwanted superstep synchronization.
> 2) Run supersteps with heterogenous nature of computation
> 3) Scheduling supersteps with precedence constraints.
> An explanation of these is provided in the attachment.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to