I think running the program multiple times is a reasonable way to start
working on this.

I would try and see whether this can be re-written to a non-nested
iterations case. Nestes iterations algorithms may have much more overhead
to start with.

Stephan


On Tue, Apr 14, 2015 at 3:53 PM, Benoît Hanotte <
benoit.jc.hano...@campus.tu-berlin.de> wrote:

>  Thanks for you quick answers!
>
> The algorithm is the following: I've got a spatial set of data and I want
> to find dense regions. The space is beforehand discretized into "cells" of
> a fixed size. Then, for each dense cell (1st iteration), starting with the
> most dense, the algorithm tries to extend it with neighboring cells (the
> nested iteration) until the density of the obtained extended cell is under
> a threshold (and some other conditions). When that's done, the used cells
> are removed from the input set, and the next most dense cell is extended.
>
> Would running the program multiple times be efficient in that case (I
> would have to save the datasets and reload them probably)? As for using
> the mapPartition operator I think it wouldn't be possible since the nested
> iteration needs the entire cells datasets.
>
> Best,
>
> Benoît
>
>
> Le Tue, 14 Apr 2015 15:16:48 +0200, Till Rohrmann <till.rohrm...@gmail.com>
> a écrit:
>
> If your inner iterations happens to work only on the data of a single
> partition, then you can also implement this iteration as part of a
> mapPartition operator. The only problem there would be that you have to
> keep all the partition's data on the heap, if you need access to it.
>
> Cheers,
>
> Till
>
> On Tue, Apr 14, 2015 at 3:11 PM, Stephan Ewen <se...@apache.org> wrote:
>
>> Hi Benoît!
>>
>> You are right, the nested iterations are currently not supported.
>>
>> The test you found actually checks that the Optimizer gives a good error
>> message when encountering nested iterations.
>>
>> Can you write your program as one iterations (the inner) and start the
>> program multiple times to simulate the nesting?
>>
>> Greetings,
>> Stephan
>>
>>
>> On Tue, Apr 14, 2015 at 8:11 AM, Benoît Hanotte <
>> benoit.jc.hano...@campus.tu-berlin.de> wrote:
>>
>>> Hello,
>>>
>>> I'm implementing an algorithm which requires nested iterations, and,
>>> from what I understood, this feature was not yet available in Flink [1],
>>> and my experiments with 2 nested bulk iterations seem to confirm that.
>>> However I came across a Flink unit test [2] using nested iterations, so I'm
>>> now a bit confused. Could you please give me some insight on what is
>>> supported or not in the current state of the project?
>>>
>>> Thanks!
>>>
>>> Benoît.
>>>
>>> [1]
>>> http://mail-archives.apache.org/mod_mbox/flink-user/201411.mbox/%3CCANC1h_tpNFWwrnm14+Et1yBvkQPQ1-pfk=iupp5un90zgug...@mail.gmail.com%3E
>>> [2]
>>> https://github.com/apache/flink/blob/master/flink-optimizer/src/test/java/org/apache/flink/optimizer/NestedIterationsTest.java
>>>
>>
>>
>
>
>
>

Reply via email to