[
https://issues.apache.org/jira/browse/HAMA-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13079889#comment-13079889
]
ChiaHung Lin commented on HAMA-370:
-----------------------------------
This probably provides some information that can be applied to improve with
hadoop's failure detection e.g. static threshold value.
http://research.microsoft.com/en-us/um/people/srikanth/netdb11/netdb11papers/netdb11-final15.pdf
> Failure detector for Hama
> -------------------------
>
> Key: HAMA-370
> URL: https://issues.apache.org/jira/browse/HAMA-370
> Project: Hama
> Issue Type: New Feature
> Components: bsp
> Affects Versions: 0.3.0
> Environment: GNU/ Debian, JDK 1.6.0_22-b04
> Reporter: ChiaHung Lin
> Assignee: ChiaHung Lin
> Labels: patch
> Attachments: HAMA-370.patch, HAMA-370.patch
>
>
> In order to enable fault tolerance service, BSPMaster requires to have
> ability in determining GroomServers' status. This generally can be achieved
> through failure detector. The attached file contains source for such patch.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira