[ 
https://issues.apache.org/jira/browse/FLINK-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martin Kiefer closed FLINK-1597.
--------------------------------
    Resolution: Not a Problem

The execution plans are fine. There is no bug.

> VertexCentricIterations create inefficient execution plans
> ----------------------------------------------------------
>
>                 Key: FLINK-1597
>                 URL: https://issues.apache.org/jira/browse/FLINK-1597
>             Project: Flink
>          Issue Type: Bug
>          Components: Gelly
>    Affects Versions: master
>            Reporter: Martin Kiefer
>
> I did experiments with optimized versions of a graph algorithm that should 
> utilize a secondary sort on the edges and a trade off between superstep 
> numbers and I/O. To my surprise the optimizations did barely affect the 
> execution times. I narrowed it down to inefficient execution plans.
> I assumed that edge sets would be partitioned once at the beginning of a 
> VertexCentricIteration and never be touched again because they can not change 
> during the iteration. I think this should be the desired behavior. What 
> actually happens is that UDFs creating the edge set are pulled inside the 
> iteration and are executed every superstep. This harms the performance of 
> graph algorithms significantly. 
> As a simple example have a look at the execution plan generated for the 
> PageRankExample:
> https://gist.github.com/martinkiefer/28a63f953477e3987b5d



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to