Hello to my squirrels,

I've started looking into FLINK-1943
<https://issues.apache.org/jira/browse/FLINK-1943> and I need some help to
understand what to test and how to do it properly.

In the corresponding Spargel compiler test, the following functionality is
checked:

1. sink: the ship strategy is FORWARD and the parallelism is correct
2. iteration: degree of parallelism
3. solution set join: parallelism and input1 ship strategy is PARTITION_HASH
4. workset join: parallelism, input1 (edges) ship strategy is
PARTITION_HASH and cached, input2 (workset) ship strategy is FORWARD
5. check that the initial partitioning is pushed out of the loop
6. check that the initial workset sort is outside the loop

I have been able to verify 1-4 of the above for the GSA iteration plan, but
I'm not sure how to check (5) and (6) or whether they are expected to hold
in the GSA case.

In [1] you can see what the GSA iteration operators looks like and in [2]
you can see what the visualizer tools generates the GSA connected
components.

Any pointers would be greatly appreciated!

Cheers,
Vasia.

[1]:
https://docs.google.com/drawings/d/1tiNQeOphWtkNXTGlnDJ3Ipanh0Tm2R8sHe8XNyTnf98/edit?usp=sharing
[2]: http://imgur.com/GQZ48ZI

Reply via email to