[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2017-12-05 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 well..major runtime changes are coming with FLIP-6, 15 and 16 so I would suggest you watch those. Loop FT will be included in one of these along with other loop redesign features. ---

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2017-03-27 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 sweet! thanks @StefanRRichter --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] flink pull request #1668: [FLINK-3257] Add Exactly-Once Processing Guarantee...

2017-03-27 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/1668#discussion_r108166093 --- Diff: flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamIterationHead.java --- @@ -17,100 +17,164

[GitHub] flink pull request #1668: [FLINK-3257] Add Exactly-Once Processing Guarantee...

2017-03-27 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/1668#discussion_r108152267 --- Diff: flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamIterationTail.java --- @@ -64,6 +70,22 @@ public void init

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2017-03-27 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 Thanks for the review @gyfora and @StephanEwen , these are very good points. @StephanEwen makes sense to not really index/keep metadata of individual records in log slices, it is extra

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2017-03-24 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 Hey. Any update/opinion/something anyone? Just a gentle reminder, sorry if this sounds a bit desperate :) --- If your project is set up for it, you can reply to this email and have your

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2017-03-15 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 I have just rebased to the current master. Please let me wrap this up. It has been over a year :) Unregistering state in the OperatorStateStore is very tiny fix. @StephanEwen

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2017-03-10 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 Hey @addisonj. Sure! You could perhaps review the changes and maybe see how to discard empty operator states if you are motivated. This is the only pending issue for this PR. thanks

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2017-01-19 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 The last update implements a variant of what @StephanEwen proposes. We have put some more thought on this offline too thanks to @gyfora ! The idea is that instead of putting records to each

[GitHub] flink issue #3088: [FLINK-5407] Fix savepoints for iterative jobs

2017-01-13 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/3088 Hey! In this upcoming [PR ](https://github.com/apache/flink/pull/1668) there is a custom operator responsible for the logging at the iteration head. Hope this does not break your tests in case

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-12-23 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 @StephanEwen could you check my question above? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-12-16 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 Working on it atm . I decided to make the following optimisations but want to very quickly make sure that async checkpointing works the way I believe it does: - Most importantly, I am

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-12-13 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 agreed @StephanEwen! I will do that. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-12-13 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 These are some good points @StephanEwen, thanks for checking it. How about the following, regarding each issue: - `Concurrent Checkpoints`: Looks like an improvement but I can sure

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-12-12 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 Exactly, these two issues do not depend on each other. No doubt loop FT is the first thing that can enable iterations in a production deployment so I would merge that first. Thank you

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-12-12 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 Hey again, @StephanEwen @uce . When you have 10 min can you take a look to see if this is acceptable? I would not like to leave this here for months again, it has been out way too long

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-11-22 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 Ok, so I am progressing this a bit independently from the termination stuff and then we rebase to the first PR that is merged. I just changed everything and rebased to the current master

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-09-28 Thread senorcarbone
Github user senorcarbone commented on the issue: https://github.com/apache/flink/pull/1668 Hey! Good to be back :) . Let's fix this properly, as @StephanEwen recommended it now that there is some time. We are writing together with @FouadMA a FLIP to address major loop fixes

[GitHub] flink pull request: [FLINK-3257] Add Exactly-Once Processing Guara...

2016-04-26 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1668#issuecomment-214883298 ok good to know @uce! Let me get back to it in a couple of weeks and make it right, now it is a bit impossible to find time. --- If your project is set up

[GitHub] flink pull request: [FLINK-3257] Add Exactly-Once Processing Guara...

2016-03-29 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1668#issuecomment-202818056 Thanks @StephanEwen and @uce for looking into it! I really appreciate it. How about the following: 1. I update this PR with the patch that uses ListState

[GitHub] flink pull request: [FLINK-3257] Add Exactly-Once Processing Guara...

2016-02-29 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1668#issuecomment-190279441 You can find an alternative version using `ListState` in the following branch: https://github.com/senorcarbone/flink/commits/ftloopsalt So I noticed

[GitHub] flink pull request: [FLINK-3257] Add Exactly-Once Processing Guara...

2016-02-23 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1668#issuecomment-187825999 Thanks @tillrohrmann for the feedback! I merged `ForwardingOneInputTask` into `StreamIterationSink`. One more think I missed pointing out is that when

[GitHub] flink pull request: [FLINK-3257] Add Exactly-Once Processing Guara...

2016-02-18 Thread senorcarbone
GitHub user senorcarbone opened a pull request: https://github.com/apache/flink/pull/1668 [FLINK-3257] Add Exactly-Once Processing Guarantees for Iterative DataStream Jobs # **[WIP]** This is a first version of the adapted snapshot algorithm to support iterations

[GitHub] flink pull request: [FLINK-3261] Allow Task to decline checkpoint ...

2016-01-25 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1537#issuecomment-174500162 Looks cool. Just so I understand exactly, what is wrong again if the Coordinator simply aborts expired checkpoint attempts? Wouldn't the protocol be the same

[GitHub] flink pull request: [FLINK-3256] Fix colocation group re-instantia...

2016-01-25 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1526#issuecomment-174486113 I guess this fix ok to merge. Any objections? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] flink pull request: [FLINK-3256] Fix colocation group re-instantia...

2016-01-21 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1526#issuecomment-173575376 I removed the groups from the ExecutionGraph. Hope it's good now. --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] flink pull request: [FLINK-3256] Fix colocation group re-instantia...

2016-01-20 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1526#issuecomment-173260776 Sure, I will do it on the fly. Thanks for checking it! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] flink pull request: [FLINK-3256] Fix colocation group re-instantia...

2016-01-20 Thread senorcarbone
GitHub user senorcarbone opened a pull request: https://github.com/apache/flink/pull/1526 [FLINK-3256] Fix colocation group re-instantiation This PR deals with the problem of inconsistent colocation groups upon reconfiguration. The problem was that we were removing shared

[GitHub] flink pull request: [FLINK-2808] Rework state abstraction and clea...

2015-10-08 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/1239#discussion_r41493576 --- Diff: flink-staging/flink-streaming/flink-streaming-core/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamTask.java --- @@ -322,73

[GitHub] flink pull request: [FLINK-2808] Rework state abstraction and clea...

2015-10-08 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1239#issuecomment-146472181 on a related note, is anyone planning to complete snapshotting for cyclc graphs for the 0.10 release? Unfortunately I do not have time to work on it this week

[GitHub] flink pull request: [FLINK-2808] Rework state abstraction and clea...

2015-10-08 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1239#issuecomment-146511645 @ktzoumas that's understandable! We could consider it for the maintenance release maybe if there is enough time --- If your project is set up for it, you can

[GitHub] flink pull request: [FLINK-2779][FLINK-2794][streaming][docs] New ...

2015-10-02 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1208#issuecomment-145032778 Ah Indeed. I just reviewed this with the prospect of custom pre-aggregations in mind and it seems like pre-aggregation strategies operate on bucket-granularity

[GitHub] flink pull request: [FLINK-2779][FLINK-2794][streaming][docs] New ...

2015-10-02 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1208#issuecomment-145025577 Nice doc so far ^^ One tiny fix on the *Advanced window constructs* subsection: [0,1000], [100,1100], **[200,1200]**, ..., [1000, 2000

[GitHub] flink pull request: [FLINK-2779][FLINK-2794][streaming][docs] New ...

2015-10-02 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1208#issuecomment-145028978 From a quick read on the documentation (and prior knowledge from google dataflow) it is easy to get a full picture of the new semantics. Even though I like it from

[GitHub] flink pull request: [FLINK-2713] [streaming] Set state restore to ...

2015-09-21 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1154#issuecomment-141934683 That's much better ^^ I approve :+1: --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] flink pull request: [FLINK-2713] [streaming] Include custom StateC...

2015-09-21 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1150#issuecomment-141907506 At first I found it a bit odd to include serialized checkpointers on every single statehandle but on a second look maybe that is the only way to generalize

[GitHub] flink pull request: [FLINK-2683] [runtime] Add dedicated operator ...

2015-09-20 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/1147#issuecomment-141817879 wow! I also want to look into this in more detail, it looks promising. @StephanEwen if you find a bit of time, can you write some more details to guide

[GitHub] flink pull request: [FLINK-2419] Add test for sinks after keyBy an...

2015-07-29 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/947#issuecomment-126054868 :+1: from me. looks that it does the job --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] flink pull request: [FLINK-2324] [streaming] Partitioned state che...

2015-07-29 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/937#issuecomment-126067262 That would be good ^^ then it's :+1: from me, at least for now. It's generally good performance-wise to have less serialised states. This means that we

[GitHub] flink pull request: Make streaming iterations more robust

2015-07-10 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/900#discussion_r34352021 --- Diff: flink-staging/flink-streaming/flink-streaming-core/src/main/java/org/apache/flink/streaming/api/datastream/IterativeDataStream.java

[GitHub] flink pull request: Make streaming iterations more robust

2015-07-10 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/900#issuecomment-120405183 There are quite a few changes in this PR! In general I am very keen to eliminating DataStream mutations and adding that logic into the jobgraph building phase

[GitHub] flink pull request: Make streaming iterations more robust

2015-07-10 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/900#discussion_r34359293 --- Diff: flink-staging/flink-streaming/flink-streaming-core/src/main/java/org/apache/flink/streaming/api/datastream/IterativeDataStream.java

[GitHub] flink pull request: [FLINK-2279] [streaming] Add Connected Iterati...

2015-06-26 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/870#issuecomment-115724842 no more wrapper types :+1: I approve --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] flink pull request: New operator state interfaces

2015-06-24 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/747#discussion_r33166608 --- Diff: flink-staging/flink-streaming/flink-streaming-core/src/main/java/org/apache/flink/streaming/api/datastream/DataStream.java --- @@ -301,16

[GitHub] flink pull request: New operator state interfaces

2015-06-22 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/747#issuecomment-114180337 We have added a `KeyedDataStream` type to the API as you can see in the last commit. It is basically a supertype of `GroupDataStream` and represents a `DataStream

[GitHub] flink pull request: [release-0.9] [streaming] Fixed streaming exam...

2015-06-10 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/816#issuecomment-110691858 :+1: --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] flink pull request: [streaming] Optional iteration feedback partit...

2015-06-09 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/810#issuecomment-110406047 :+1: Thanks Gyula, this is pretty handy for ML tasks. If possible I would recommend including this functionality to the undergoing release. --- If your project

[GitHub] flink pull request: New operator state interfaces

2015-05-29 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/747#discussion_r31325009 --- Diff: flink-runtime/src/main/java/org/apache/flink/runtime/jobgraph/tasks/CheckpointCommittingOperator.java --- @@ -18,7 +18,10

[GitHub] flink pull request: Pluggable state backend support for checkpoint...

2015-05-19 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/676#discussion_r30636159 --- Diff: flink-runtime/src/main/java/org/apache/flink/runtime/state/ByteStreamStateHandle.java --- @@ -0,0 +1,77 @@ +/* + * Licensed

[GitHub] flink pull request: Pluggable state backend support for checkpoint...

2015-05-19 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/676#issuecomment-103640808 Apart from my comment on the transient field I think this is ok to merge. Perhaps we can look into standardising the task manager backend configuration afterwards

[GitHub] flink pull request: [FLINK-1808] Send barrier requests only when t...

2015-04-03 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/551#discussion_r27763870 --- Diff: flink-runtime/src/main/scala/org/apache/flink/runtime/jobmanager/StreamCheckpointCoordinator.scala --- @@ -82,15 +81,18 @@ class

[GitHub] flink pull request: [FLINK-1808] Send barrier requests only when t...

2015-03-31 Thread senorcarbone
GitHub user senorcarbone opened a pull request: https://github.com/apache/flink/pull/551 [FLINK-1808] Send barrier requests only when the execution graph is running This is a simple optimisation for the current StreamCheckpointCoordinator that makes it skip barriers whent he

[GitHub] flink pull request: [Flink-1686] Streaming Iterations Slot Sharing...

2015-03-23 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/524#issuecomment-85223415 bad style habits ^^ thanks for reminding me. I also made a minor fix to the test too. --- If your project is set up for it, you can reply to this email and have

[GitHub] flink pull request: [Flink-1686] Streaming Iterations Slot Sharing...

2015-03-23 Thread senorcarbone
GitHub user senorcarbone opened a pull request: https://github.com/apache/flink/pull/524 [Flink-1686] Streaming Iterations Slot Sharing Fix Iteration head instances were using the same identifier for slot sharing which resulted into incorrect execution for streaming iterations. I

[GitHub] flink pull request: FLINK-1740

2015-03-19 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/501#issuecomment-83774008 @StephanEwen @uce I definitely agree, thanks for the feedback! --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] flink pull request: Flink 1740

2015-03-18 Thread senorcarbone
GitHub user senorcarbone opened a pull request: https://github.com/apache/flink/pull/501 Flink 1740 We now use the config parameter for numberofExecutionRetries on the generated job graph. Also piggypacked a minor fix for a compilation error that currently occurs in flink

[GitHub] flink pull request: FLINK-1740

2015-03-18 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/501#issuecomment-83197775 Indeed, cool. Good to know --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] flink pull request: FLINK-1740

2015-03-18 Thread senorcarbone
Github user senorcarbone commented on a diff in the pull request: https://github.com/apache/flink/pull/501#discussion_r26711053 --- Diff: flink-tests/src/test/java/org/apache/flink/test/javaApiOperators/GroupCombineITCase.java --- @@ -58,7 +58,7 @@ */ public class

[GitHub] flink pull request: FLINK-1740

2015-03-18 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/501#issuecomment-83196555 Thanks @mbalassi! If you merge can you remove the quickfix (2872308)? It looks like @hsaputra pushed it already --- If your project is set up for it, you can

[GitHub] flink pull request: [streaming] [wip] Fault tolerance prototype (f...

2015-03-08 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/459#issuecomment-77781300 thanks for the review Stephan! The synchronous call to executionGraph restart was a very sneaky thing, I committed it by mistake while trying some things out

[GitHub] flink pull request: [streaming] [wip] Fault tolerance prototype (f...

2015-03-06 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/459#issuecomment-77538649 hold on, will commit an update very soon. :P --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] flink pull request: [FLINK-1586] [streaming] Add support for itera...

2015-02-22 Thread senorcarbone
GitHub user senorcarbone opened a pull request: https://github.com/apache/flink/pull/432 [FLINK-1586] [streaming] Add support for iterative streaming graphs on JSON generation I refactored a bit the current JSON generation for streaming job graphs to further allow iterations

[GitHub] flink pull request: StreamWindow abstraction + modular window comp...

2015-02-14 Thread senorcarbone
Github user senorcarbone commented on the pull request: https://github.com/apache/flink/pull/395#issuecomment-74393323 +1 for merging asap. It's an improvement on both semantics and performance prospects for windowing. --- If your project is set up for it, you can reply

[GitHub] flink pull request: [FLINK-1344] [streaming] Added static StreamEx...

2015-01-26 Thread senorcarbone
GitHub user senorcarbone opened a pull request: https://github.com/apache/flink/pull/341 [FLINK-1344] [streaming] Added static StreamExecutionEnvironment initialisation and Implicits for scala sources This PR addresses the ticket [1] for further scala constructs interoperability