[ 
https://issues.apache.org/jira/browse/SPARK-2011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ankur Dave updated SPARK-2011:
------------------------------

    Description: In the iteration loop, Pregel currently performs an innerJoin 
to apply messages to vertices followed by an outerJoinVertices to join the 
resulting subset of vertices back to the graph. These two operations could be 
merged into a single call to joinVertices, which should be reimplemented in a 
more efficient manner. This would allow us to examine only the vertices that 
received messages.  (was: In the iteration loop, Pregel currently performs an 
innerJoin to apply messages to vertices followed by an outerJoinVertices to 
join the resulting subset of vertices back to the graph. These two operations 
could be merged into a single specialized operation called updateVertices. This 
would allow us to examine only the vertices that received messages, and it 
might eliminate a stage in every iteration.)

> Eliminate duplicate join in Pregel
> ----------------------------------
>
>                 Key: SPARK-2011
>                 URL: https://issues.apache.org/jira/browse/SPARK-2011
>             Project: Spark
>          Issue Type: Improvement
>          Components: GraphX
>            Reporter: Ankur Dave
>            Assignee: Ankur Dave
>
> In the iteration loop, Pregel currently performs an innerJoin to apply 
> messages to vertices followed by an outerJoinVertices to join the resulting 
> subset of vertices back to the graph. These two operations could be merged 
> into a single call to joinVertices, which should be reimplemented in a more 
> efficient manner. This would allow us to examine only the vertices that 
> received messages.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to