[ 
https://issues.apache.org/jira/browse/HAMA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13103601#comment-13103601
 ] 

Vinod Kumar Vavilapalli commented on HAMA-431:
----------------------------------------------

ChiaHung,

bq. With polling, it seems that chances are the polling would not reach the 
agreement (there could always have 1 process missing) in an unfortunate timing 
case. Also, as the processes increase probably it would increase the loading 
for master to deal with polling tasks. 
Regarding the missing processes, which we call stragglers in mapreduce, isn't 
the API such that there should be no progress till all the processes perform 
the barrier sync?
Regarding the load, even MR AM which uses a Hadoop RPC server has similar 
requirements, in the order of ten's of thousands of tasks. That amount of 
scalability should be enough for Hama's case also. And like Thomas mentioned, 
each BSPMaster is needed to serve the same job's BSPPeers, so that should help 
too.

bq. In addition, my understanding is the integration with MRV2 would be just an 
additional support so that MR job/ application can be submitted without 
rewriting to use hama for computation.
It is not clear to me. But if you are talking of the ability to run the current 
BSP jobs without rewriting them, then yes, we will support API level 
compatibility.

> MapReduce NG integration
> ------------------------
>
>                 Key: HAMA-431
>                 URL: https://issues.apache.org/jira/browse/HAMA-431
>             Project: Hama
>          Issue Type: New Feature
>            Reporter: Thomas Jungblut
>            Assignee: Thomas Jungblut
>
> We should take a look at how to integrate Hama's BSP Engine to Hadoop's 
> nextGen application platform.
> Can be currently found in the 0.23 branch.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to