[ 
https://issues.apache.org/jira/browse/HAMA-363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13063815#comment-13063815
 ] 

ChiaHung Lin commented on HAMA-363:
-----------------------------------

My test on my current vms looks work. 

But after reading parts of the paper[1], I discover the current implementation 
lacks of features such as federation, data transport, etc. The federation can 
provide aggregative information so that e.g. BSPMaster can proceed further by 
deciding if a worker fails, which is required by monitor service. In addition, 
plugin part may probably need to be improved as well, though at the moment from 
the developer's viewpoint it looks ok. 

[1]. The ganglia distributed monitoring system: design, implementation, and 
experience. http://ganglia.info/papers/science.pdf

> Add network condition monitoring function to BSPMaster
> ------------------------------------------------------
>
>                 Key: HAMA-363
>                 URL: https://issues.apache.org/jira/browse/HAMA-363
>             Project: Hama
>          Issue Type: New Feature
>          Components: bsp
>    Affects Versions: 0.3.0
>            Reporter: Edward J. Yoon
>             Fix For: 0.4.0
>
>         Attachments: HAMA-363.patch, HAMA-363.patch, HAMA-363.patch
>
>
> There's few reason why this issue is important. Basically Master server 
> should know the status of cluster:
>  * to optimize the network usage
>  * to handle network connectivity problems
>  * to handle different network conditions
> And, I would like to see some network usage statistics.
> In this issue, we implement only basic function which is collect network 
> usage.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to