[ 
https://issues.apache.org/jira/browse/HAMA-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13925659#comment-13925659
 ] 

Anastasis Andronidis commented on HAMA-629:
-------------------------------------------

Yes, I agree.

I was also keep in mind that we do a local aggregation in each peer. My commend 
is more on really large deployment (e.g. 1500 peers) with heavy use of the 
aggregation framework (heavy use = large chunks of data in the aggregation 
messages). 

> Improve RPC Scalability Part 2
> ------------------------------
>
>                 Key: HAMA-629
>                 URL: https://issues.apache.org/jira/browse/HAMA-629
>             Project: Hama
>          Issue Type: Sub-task
>          Components: graph
>    Affects Versions: 0.5.0
>            Reporter: Thomas Jungblut
>             Fix For: 0.7.0
>
>
> There is a problem when all 1k peers would attempt to send to a single peer 
> (let's say a master task in a graph algorithm that aggregates). In this case 
> the peer will start 1k-threads which is using enourmous amount of memory. 
> I think we can coordinate the message sending either with Zookeeper or by 
> using the task id and do a smarter sending chain.
> By the last, I mean, that each task can start at a different offset in the 
> peer array to start sending messages to the other peers. But this won't solve 
> the problem DDoS'ing a single master task.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to