[
https://issues.apache.org/jira/browse/HAMA-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13009128#comment-13009128
]
Edward J. Yoon commented on HAMA-370:
-------------------------------------
Just curious, what's the benefits of using a phi accrual detection compared w/
heartbeat detection? (GroomServer Failure)
And, when BSP task failed during processing, we can simply re-start the task to
provides fault tolerance.
> Failure detector for Hama
> -------------------------
>
> Key: HAMA-370
> URL: https://issues.apache.org/jira/browse/HAMA-370
> Project: Hama
> Issue Type: New Feature
> Components: bsp
> Affects Versions: 0.3.0
> Environment: GNU/ Debian, JDK 1.6.0_22-b04
> Reporter: ChiaHung Lin
> Assignee: ChiaHung Lin
> Labels: patch
> Fix For: 0.3.0
>
> Attachments: HAMA-370.patch, HAMA-370.patch
>
>
> In order to enable fault tolerance service, BSPMaster requires to have
> ability in determining GroomServers' status. This generally can be achieved
> through failure detector. The attached file contains source for such patch.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira