[ https://issues.apache.org/jira/browse/GIRAPH-273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13446215#comment-13446215 ]
Maja Kabiljo commented on GIRAPH-273: ------------------------------------- I still don't see how constant difference of one connection per worker can make a problem, if the problem wasn't already there. I do agree that in most of the applications the approach which I described is not needed, but those are the cases in which even storing aggregators on ZooKeeper was working fine, and basically what ever we do won't matter much. I already have the implementation for this, there are just a few smaller bugs I have to fix, and also I have to wait for RPC to be removed first (I didn't want to leave a big mess with two completely different code paths). In my implementation there is no need for another barrier before sending aggregators, since they can be aggregated on the worker owner even before the computation there is done. And worker waits to receive aggregated values from all the others before sending them to master. > Aggregators shouldn't use Zookeeper > ----------------------------------- > > Key: GIRAPH-273 > URL: https://issues.apache.org/jira/browse/GIRAPH-273 > Project: Giraph > Issue Type: Improvement > Reporter: Maja Kabiljo > Assignee: Maja Kabiljo > > We use Zookeeper znodes to transfer aggregated values from workers to master > and back. Zookeeper is supposed to be used for coordination, and it also has > a memory limit which prevents users from having aggregators with large value > objects. These are the reasons why we should implement aggregators gathering > and distribution in a different way. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira