The two pull requests do not go all the way, unfortunately. They cover only the runtime, the API integration part is missing still, unfortunately...
On Mon, Jul 20, 2015 at 5:53 PM, Maximilian Michels <m...@apache.org> wrote: > You could do that but you might run into merge conflicts. Also keep in > mind that it is work in progress :) > > On Mon, Jul 20, 2015 at 4:15 PM, Maximilian Alber < > alber.maximil...@gmail.com> wrote: > >> Thanks! >> >> Ok, cool. If I would like to test it, I just need to merge those two pull >> requests into my current branch? >> >> Cheers, >> Max >> >> On Mon, Jul 20, 2015 at 4:02 PM, Maximilian Michels <m...@apache.org> >> wrote: >> >>> Now that makes more sense :) I thought by "nested iterations" you meant >>> iterations in Flink that can be nested, i.e. starting an iteration inside >>> an iteration. >>> >>> The caching/pinning of intermediate results is still a work in progress >>> in Flink. It is actually in a state where it could be merged but some >>> pending pull requests got delayed because priorities changed a bit. >>> >>> Essentially, we need to merge these two pull requests: >>> >>> https://github.com/apache/flink/pull/858 >>> This introduces a session management which allows to keep the >>> ExecutionGraph for the session. >>> >>> https://github.com/apache/flink/pull/640 >>> Implements the actual backtracking and caching of the results. >>> >>> Once these are in, we can change the Java/Scala API to support >>> backtracking. I don't exactly know how Spark's API does it but, essentially >>> it should work then by just creating new operations on an existing DataSet >>> and submit to the cluster again. >>> >>> Cheers, >>> Max >>> >>> On Mon, Jul 20, 2015 at 3:31 PM, Maximilian Alber < >>> alber.maximil...@gmail.com> wrote: >>> >>>> Oh sorry, my fault. When I wrote it, I had iterations in mind. >>>> >>>> What I actually wanted to say, how "resuming from intermediate >>>> results" will work with (non-nested) "non-Flink" iterations? And with >>>> iterations I mean something like this: >>>> >>>> while(...): >>>> - change params >>>> - submit to cluster >>>> >>>> where the executed Flink-program is more or less the same at each >>>> iterations. But with changing input sets, which are reused between >>>> different loop iterations. >>>> >>>> I might got something wrong, because in our group we mentioned caching >>>> a lá Spark for Flink and someone came up that "pinning" will do that. Is >>>> that somewhat right? >>>> >>>> Thanks and Cheers, >>>> Max >>>> >>>> On Mon, Jul 20, 2015 at 1:06 PM, Maximilian Michels <m...@apache.org> >>>> wrote: >>>> >>>>> "So it is up to debate how the support for resuming from intermediate >>>>> results will look like." -> What's the current state of that debate? >>>>> >>>>> Since there is no support for nested iterations that I know of, the >>>>> debate how intermediate results are integrated has not started yet. >>>>> >>>>> >>>>>> "Intermediate results are not produced within the iterations cycles." >>>>>> -> Ok, if there are none, what does it have to do with that debate? :-) >>>>>> >>>>> >>>>> I was referring to the existing support for intermediate results >>>>> within iterations. If we were to implement nested iterations, this could >>>>> (possibly) change. This is all very theoretical because there are no plans >>>>> to support nested iterations. >>>>> >>>>> Hope this clarifies. Otherwise, please restate your question because I >>>>> might have misunderstood. >>>>> >>>>> Cheers, >>>>> Max >>>>> >>>>> >>>>> On Mon, Jul 20, 2015 at 12:11 PM, Maximilian Alber < >>>>> alber.maximil...@gmail.com> wrote: >>>>> >>>>>> Thanks for the answer! But I need some clarification: >>>>>> >>>>>> "So it is up to debate how the support for resuming from intermediate >>>>>> results will look like." -> What's the current state of that debate? >>>>>> "Intermediate results are not produced within the iterations cycles." >>>>>> -> Ok, if there are none, what does it have to do with that debate? :-) >>>>>> >>>>>> Cheers, >>>>>> Max >>>>>> >>>>>> On Mon, Jul 20, 2015 at 10:50 AM, Maximilian Michels <m...@apache.org> >>>>>> wrote: >>>>>> >>>>>>> Hi Max, >>>>>>> >>>>>>> You are right, there is no support for nested iterations yet. As far >>>>>>> as I know, there are no concrete plans to add support for it. So it is >>>>>>> up >>>>>>> to debate how the support for resuming from intermediate results will >>>>>>> look >>>>>>> like. Intermediate results are not produced within the iterations >>>>>>> cycles. >>>>>>> Same would be true for nested iterations. So the behavior for resuming >>>>>>> from >>>>>>> intermediate results should be alike for nested iterations. >>>>>>> >>>>>>> Cheers, >>>>>>> Max >>>>>>> >>>>>>> On Fri, Jul 17, 2015 at 4:26 PM, Maximilian Alber < >>>>>>> alber.maximil...@gmail.com> wrote: >>>>>>> >>>>>>>> Hi Flinksters, >>>>>>>> >>>>>>>> as far as I know, there is still no support for nested iterations >>>>>>>> planned. Am I right? >>>>>>>> >>>>>>>> So my question is how such use cases should be handled in the >>>>>>>> future. >>>>>>>> More specific: when pinning/caching will be available, you suggest >>>>>>>> to use that feature and program in "Spark" style? Or is there some >>>>>>>> other, >>>>>>>> more flexible, mechanism planned for loops? >>>>>>>> >>>>>>>> Cheers, >>>>>>>> Max >>>>>>>> >>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> >