[GitHub] flink issue #5096: [FLINK-8165] ParameterTool serialization fix

2017-11-29 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/5096 @aljoscha , im having a hard time setting up my git credentials properly on my work laptop could you please push this for me? ---

[GitHub] flink issue #5096: [FLINK-8165] ParameterTool serialization fix

2017-11-29 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/5096 yes it fails without it, will merge this ---

[GitHub] flink pull request #5096: [FLINK-8165] ParameterTool serialization fix

2017-11-29 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/5096 [FLINK-8165] ParameterTool serialization fix *Thank you very much for contributing to Apache Flink - we are happy that you want to help us improve Flink. To help the community review your

[GitHub] flink pull request #4083: [FLINK-6742] Improve savepoint migration failure e...

2017-06-25 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/4083#discussion_r123895607 --- Diff: flink-runtime/src/main/java/org/apache/flink/runtime/checkpoint/savepoint/SavepointV2.java --- @@ -168,10 +168,27 @@ public static Savepoint

[GitHub] flink issue #2796: [FLINK-4873] Configurable ship path for YARN cluster reso...

2017-05-24 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2796 I am travelling right now, will try to look into this over the weekend. This is in no way a blocker for the release I think. :) --- If your project is set up for it, you can reply to this email

[GitHub] flink issue #3901: [FLINK-6288] fix FlinkKafkaProducer's custom Partitioner ...

2017-05-18 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3901 Thanks @tzulitai I don't feel very strongly about either way, I am just concerned for other users. I leave this decision to you I know you are already flooded with other stuff around the release

[GitHub] flink issue #3901: [FLINK-6288] fix FlinkKafkaProducer's custom Partitioner ...

2017-05-18 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3901 On the other hand this is potentially causing major data skew or errors for any people who are using the dynamic topics (they might not even be aware of it). --- If your project is set up

[GitHub] flink issue #3901: [FLINK-6288] fix FlinkKafkaProducer's custom Partitioner ...

2017-05-18 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3901 Well, I can understand it but will mean that we have to keep running with a custom build because there is no way to work around this nicely. --- If your project is set up for it, you can reply

[GitHub] flink issue #1800: [FLINK-3620] [streaming] Remove DbStateBackend

2017-05-15 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/1800 @xhook You should look at https://issues.apache.org/jira/browse/FLINK-4266 for the most up to date discussion on this topic :) --- If your project is set up for it, you can reply to this email

[GitHub] flink issue #3870: [Flink 6537] Fixes and improvements for incremental check...

2017-05-11 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3870 Hi, I am sending this because I will be away for the weekend so won't have time to test further until monday. I pulled these changes and Stephan's classloader fix but I still get some errors

[GitHub] flink issue #3856: [FLINK-5998] Fix shaded Hadoop Jar

2017-05-11 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3856 @aljoscha Yes, that might work although I prefer this fix compared to having to mess with the maven versions on deploy servers. --- If your project is set up for it, you can reply to this email

[GitHub] flink issue #3856: [FLINK-5998] Fix shaded Hadoop Jar

2017-05-10 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3856 This fix seems to work for me as well 👍 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] flink issue #3766: [FLINK-6288] fix FlinkKafkaProducer's custom Partitioner ...

2017-05-08 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3766 @tzulitai should we try to get this in the release? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] flink issue #3801: [FLINK-6364] [checkpoints] Implement incremental checkpoi...

2017-05-02 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3801 sweet --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] flink issue #3801: [FLINK-6364] [checkpoints] Implement incremental checkpoi...

2017-05-02 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3801 @StefanRRichter Do you think we should try to figure out which SST files have been compacted and exclude them from recovery? --- If your project is set up for it, you can reply to this email

[GitHub] flink pull request #3766: [FLINK-6288] fix FlinkKafkaProducer's custom Parti...

2017-05-02 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/3766#discussion_r114282783 --- Diff: flink-connectors/flink-connector-kafka-base/src/main/java/org/apache/flink/streaming/connectors/kafka/FlinkKafkaProducerBase.java --- @@ -281,6

[GitHub] flink pull request #3801: [FLINK-6364] [checkpoints] Implement incremental c...

2017-05-02 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/3801#discussion_r114282104 --- Diff: flink-contrib/flink-statebackend-rocksdb/src/main/java/org/apache/flink/contrib/streaming/state/RocksDBKeyedStateBackend.java --- @@ -621,6

[GitHub] flink pull request #3801: [FLINK-6364] [checkpoints] Implement incremental c...

2017-05-02 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/3801#discussion_r114270508 --- Diff: flink-contrib/flink-statebackend-rocksdb/src/main/java/org/apache/flink/contrib/streaming/state/RocksDBKeyedStateBackend.java --- @@ -621,6

[GitHub] flink issue #3801: [FLINK-6364] [checkpoints] Implement incremental checkpoi...

2017-04-29 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3801 Hi Thanks for the nice effort! I only skimmed through the changes to get the main idea (I will do a more thorough review in the next days) but I have some initial questions :) 1

[GitHub] flink issue #3766: [FLINK-6288] fix FlinkKafkaProducer's custom Partitioner ...

2017-04-27 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3766 I liked the proposed API and I agree that it's probably best to keep the old behaviour for the deprecated API. I don't think the Kafka partition info fetching should be a huge problem

[GitHub] flink issue #3766: [FLINK-6288] fix FlinkKafkaProducer's custom Partitioner ...

2017-04-26 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3766 I think this is reasonable as the current implementation doesnt work for dynamic new topics. (we should also deprecate the current one) But let's hear what others say :) --- If your project

[GitHub] flink issue #3766: [FLINK-6288] fix FlinkKafkaProducer's custom Partitioner ...

2017-04-26 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3766 How would this new API map to the current one in terms of backwards compatibility? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] flink issue #3766: [FLINK-6288] fix FlinkKafkaProducer's custom Partitioner ...

2017-04-25 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3766 Hi, Thanks for the PR! The first problem I noticed with this approach is that it will not work if users want to partition dynamically created topics (my use case actually). We

[GitHub] flink pull request #1668: [FLINK-3257] Add Exactly-Once Processing Guarantee...

2017-03-24 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/1668#discussion_r107968436 --- Diff: flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamIterationHead.java --- @@ -119,13 +183,154 @@ protected void

[GitHub] flink pull request #1668: [FLINK-3257] Add Exactly-Once Processing Guarantee...

2017-03-24 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/1668#discussion_r107968935 --- Diff: flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamIterationHead.java --- @@ -17,100 +17,164 @@ package

[GitHub] flink pull request #1668: [FLINK-3257] Add Exactly-Once Processing Guarantee...

2017-03-24 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/1668#discussion_r107969496 --- Diff: flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamIterationHead.java --- @@ -17,100 +17,164 @@ package

[GitHub] flink pull request #1668: [FLINK-3257] Add Exactly-Once Processing Guarantee...

2017-03-24 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/1668#discussion_r107967910 --- Diff: flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamIterationHead.java --- @@ -119,13 +183,154 @@ protected void

[GitHub] flink pull request #1668: [FLINK-3257] Add Exactly-Once Processing Guarantee...

2017-03-24 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/1668#discussion_r107968567 --- Diff: flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamIterationHead.java --- @@ -119,13 +183,154 @@ protected void

[GitHub] flink pull request #1668: [FLINK-3257] Add Exactly-Once Processing Guarantee...

2017-03-24 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/1668#discussion_r107967886 --- Diff: flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/StreamIterationHead.java --- @@ -119,13 +183,154 @@ protected void

[GitHub] flink issue #3523: [FLINK-5985] Report no task states for stateless tasks in...

2017-03-15 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3523 Ah, the reason is probably that I didnt change my job jar, and this relies on changes in the rocks backend --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] flink issue #3523: [FLINK-5985] Report no task states for stateless tasks in...

2017-03-15 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3523 @StefanRRichter It seems to work correctly locally, I am trying to see what went wrong with my yarn tests, but this shouldnt block you --- If your project is set up for it, you can reply

[GitHub] flink issue #3523: [FLINK-5985] Report no task states for stateless tasks in...

2017-03-14 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3523 It also doesnt seem to work starting from a clean state and then savepoint redeploy with changed topology so maybe I am really screwing up something --- If your project is set up for it, you can

[GitHub] flink issue #3523: [FLINK-5985] Report no task states for stateless tasks in...

2017-03-14 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3523 Hm, doesnt seem to work for the first try. What I did is I updated the client with the new jar based on your backport branch. Redeployed the job with a savepoint (to get the new Flink version), took

[GitHub] flink issue #3523: [FLINK-5985] Report no task states for stateless tasks in...

2017-03-14 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3523 There seems to have been some changes in the StreamTask and some tests so I couldn't rebase this nicely. Do you have a minute to take a look and maybe push a branch with the backport please

[GitHub] flink issue #3523: [FLINK-5985] Report no task states for stateless tasks in...

2017-03-14 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3523 Im gonna try to cherry-pick this on 1.2 and run it today --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] flink issue #3523: [FLINK-5985] Report no task states for stateless tasks in...

2017-03-13 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3523 I could only try the backported version on the topology that caused the problem initally (that is running 1.2.0) --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] flink issue #3523: [FLINK-5985] Report no task states for stateless tasks in...

2017-03-13 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3523 The changes look reasonable :) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] flink issue #3335: [FLINK-5818][Security]change checkpoint dir permission to...

2017-02-17 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3335 I agree with @StephanEwen that people probably manage the directory permissions directly when configuring the Flink jobs. It would be quite annoying if the Flink job changed the permissions you set

[GitHub] flink issue #3346: [FLINK-5763] [checkpoints] Add CheckpointOptions for self...

2017-02-17 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3346 So will the same savepoint logic apply to externalized checkpoints? I think it would be good to have a similar way of restoring from checkpoints and savepoints from a usability perspective

[GitHub] flink pull request #3199: [FLINK-5627][streaming] [checkpoints] Allow job sp...

2017-01-24 Thread gyfora
Github user gyfora closed the pull request at: https://github.com/apache/flink/pull/3199 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] flink issue #3199: [FLINK-5627][streaming] [checkpoints] Allow job specific ...

2017-01-24 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/3199 Thanks for looking into this Ufuk, I missed the previous discussion. I guess you can work around this as long as you are running on single yarn sessions and always generate the flink-conf files

[GitHub] flink pull request #2129: [FLINK-1003] [WIP] Spread out scheduling of tasks

2017-01-24 Thread gyfora
Github user gyfora closed the pull request at: https://github.com/apache/flink/pull/2129 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] flink pull request #3199: [streaming] [checkpoints] Allow job specific exter...

2017-01-24 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/3199 [streaming] [checkpoints] Allow job specific external checkpoint dir You can merge this pull request into a Git repository by running: $ git pull https://github.com/gyfora/flink external

[GitHub] flink issue #1668: [FLINK-3257] Add Exactly-Once Processing Guarantees for I...

2016-12-12 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/1668 I think the PR looks pretty good, and it sounds fair to to address termination in a later PR as this will still greatly improve the current guarantees without making the backpressure/termination

[GitHub] flink pull request #2990: Backport [FLINK-5071]

2016-12-12 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/2990 Backport [FLINK-5071] You can merge this pull request into a Git repository by running: $ git pull https://github.com/gyfora/flink FLINK-5071 Alternatively you can review and apply

[GitHub] flink issue #2839: [FLINK-5071][yarn] adjust vcore validation check

2016-11-21 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2839 looks good +1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] flink pull request #2796: [FLINK-4873] Configurable ship path for YARN clust...

2016-11-13 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/2796 [FLINK-4873] Configurable ship path for YARN cluster resources This PR makes the resource shipping directory configurable for YARN clusters by adding a new config option: `yarn.resource.ship.path

[GitHub] flink issue #2796: [FLINK-4873] Configurable ship path for YARN cluster reso...

2016-11-13 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2796 We can also easily backport this to 1.1.* We are actually using this change in production there. --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] flink issue #2509: [FLINK-4280][kafka-connector] Explicit start position con...

2016-09-26 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2509 @tzulitai makes sense ! As for for the Map<Int, Long> you are right, the multiple topic case slipped my mind :) --- If your project is set up for it, you can reply to this email and have your

[GitHub] flink issue #2509: [FLINK-4280][kafka-connector] Explicit start position con...

2016-09-26 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2509 Hi, I like the proposed changes, do you think it would make sense to add the possibility to set specific offsets on a per partition basis? ``` kafka.setStartOffsets(Map<Inte

[GitHub] flink pull request #2413: [FLINK-4471] Use user code classloader to deserial...

2016-08-25 Thread gyfora
Github user gyfora closed the pull request at: https://github.com/apache/flink/pull/2413 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] flink issue #2413: [FLINK-4471] Use user code classloader to deserialize sta...

2016-08-25 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2413 Done, we can close this I guess if you are fixing it in your PR --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] flink issue #2413: [FLINK-4471] Use user code classloader to deserialize sta...

2016-08-25 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2413 Should I push this on 1.1 then? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] flink pull request #2413: [FLINK-4471] Use user code classloader to deserial...

2016-08-24 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/2413 [FLINK-4471] Use user code classloader to deserialize state descriptor in Rocks backend Thanks for contributing to Apache Flink. Before you open your pull request, please take the following check

[GitHub] flink pull request #2399: [FLINK-4441] Make RocksDB backend return null on e...

2016-08-21 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/2399 [FLINK-4441] Make RocksDB backend return null on empty state Thanks for contributing to Apache Flink. Before you open your pull request, please take the following check list into consideration

[GitHub] flink issue #2345: [FLINK-4340] Remove RocksDB Semi-Async Checkpoint Mode

2016-08-11 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2345 I can see the merits of both checkpointing approaches but Stephan is right in the sense that allowing semi-async snapshots with dynamic scaling would need a completely new implementation and would

[GitHub] flink issue #2345: [FLINK-4340] Remove RocksDB Semi-Async Checkpoint Mode

2016-08-09 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2345 Hi, Isn't this way of checkpointing is much much slower then the semi async version? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] flink issue #2345: [FLINK-4340] Remove RocksDB Semi-Async Checkpoint Mode

2016-08-09 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2345 But I wonder what would happen in a scenario with a lot of states: Semi async: short local copy time at every snapshot + very fast restore Fully async: no copy time + very slow restore

[GitHub] flink issue #2345: [FLINK-4340] Remove RocksDB Semi-Async Checkpoint Mode

2016-08-09 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2345 Good thing about the way fully async checkpoints are restored though is that it is very trivial to insert some state adaptor code :) --- If your project is set up for it, you can reply

[GitHub] flink issue #2345: [FLINK-4340] Remove RocksDB Semi-Async Checkpoint Mode

2016-08-09 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2345 But you are right it is probably more important to keep the latency down for the running programs, and for that the fully async seems to be strictly better --- If your project is set up for it, you

[GitHub] flink issue #2345: [FLINK-4340] Remove RocksDB Semi-Async Checkpoint Mode

2016-08-09 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2345 Some of the benefits we lose on restore. Especially for very large states this can be pretty serious. Maybe this is required for the sharding to some extent but I don't see

[GitHub] flink issue #2129: [FLINK-1003] [WIP] Spread out scheduling of tasks

2016-07-06 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2129 Thanks Stephan, I agree that there is a lot to think about/improve when it comes to scheduling and dynamic scaling. Should we add this to the Key groups design doc or there is going

[GitHub] flink issue #2168: [FLINK-3964] add hint to job submission timeout exception...

2016-06-27 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2168 I think it is often the case that the timeout is caused by some failure. For instance the savepoint restore failes due to something, then you get this timeout in the command line. Maybe it's

[GitHub] flink issue #2143: [FLINK-4040] [dist] Allow JobManager/TaskManager-specific...

2016-06-21 Thread gyfora
Github user gyfora commented on the issue: https://github.com/apache/flink/pull/2143 Looks good, this addresses the issue 👍 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] flink pull request #2129: [FLINK-1003] [WIP] Spread out scheduling of tasks

2016-06-18 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/2129 [FLINK-1003] [WIP] Spread out scheduling of tasks This is a working progress PR with the core functionality implemented but no tests yet. As this is a highly critical part of the system I

[GitHub] flink pull request: [FLINK-3779] Add support for queryable state

2016-05-31 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/2051#issuecomment-222627001 You are probably right that I am little biased regarding (1), sorry :) --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] flink pull request: [FLINK-3779] Add support for queryable state

2016-05-31 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/2051#issuecomment-222606282 I will gradually add some questions/comments as I go :) 1. Do we really need a QueryableStateStream exposed in the API? As you said this is just a pretty basic

[GitHub] flink pull request: [FLINK-3779] Add support for queryable state

2016-05-31 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/2051#issuecomment-222600015 Thanks for the explanation. I will start testing/reviewing this later today. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] flink pull request: [FLINK-3779] Add support for queryable state

2016-05-30 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/2051#issuecomment-222548679 Awesome feature Ufuk, I am very excited to try this out and give some feedback :) So if I understand correctly in order to query the state I can use

[GitHub] flink pull request: [FLINK-3761] Introduction of key groups

2016-05-18 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1988#issuecomment-21205 So if I understand currently there is no way to scale jobs with non-partitioned states. This also means that window operations (that register timers

[GitHub] flink pull request: [FLINK-2829] Confusing error message when Flin...

2016-05-17 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1993#issuecomment-219704691 This is Jira issue that I opened a qhile back and forgot about it :) The issue was not really that the Slot name was not well formatted etc, it's more thatn this type

[GitHub] flink pull request: [FLINK-3761] Introduction of key groups

2016-05-13 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1988#issuecomment-219064791 Very cool stuff! I was wondering did you do any benchmarks for the performance impact of this change? For instance it would be good to know how well RocksDB behaves

[GitHub] flink pull request: [FLINK-3581] [FLINK-3582] State Iterator and A...

2016-05-02 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1957#issuecomment-216354689 Is that a problem? Maybe we could do some periodic garbage collection on the empty column families. --- If your project is set up for it, you can reply to this email

[GitHub] flink pull request: [FLINK-3581] [FLINK-3582] State Iterator and A...

2016-05-02 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1957#issuecomment-216257051 Hi, So just a quick question regarding the namespace dropping in rocks. I though you said it would be possible to do this by using prefixes in rocks. Are there some

[GitHub] flink pull request: [FLINK-3581] [FLINK-3582] State Iterator and A...

2016-05-02 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1957#issuecomment-216259593 The other possibility would be to store them in different column families. Not sure about the performance there though --- If your project is set up for it, you can

[GitHub] flink pull request: [FLINK-3798] [streaming] Clean up RocksDB back...

2016-04-21 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1918#issuecomment-212834630 ahh the actual classes were not public either, I'm so stupid to have missed that lol... :d --- If your project is set up for it, you can reply to this email and have

[GitHub] flink pull request: [FLINK-3790] [streaming] Use proper hadoop con...

2016-04-21 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1919#issuecomment-212827419 We confirmed that this works in our production environment --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well

[GitHub] flink pull request: [FLINK-3798] [streaming] Clean up RocksDB back...

2016-04-21 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/1918#discussion_r60549764 --- Diff: flink-contrib/flink-statebackend-rocksdb/src/main/java/org/apache/flink/contrib/streaming/state/RocksDBValueState.java --- @@ -64,7 +64,7

[GitHub] flink pull request: [FLINK-3790] [streaming] Use proper hadoop con...

2016-04-21 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1919#issuecomment-212816048 I need about an hour :) but working on it --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] flink pull request: [FLINK-3798] [streaming] Clean up RocksDB back...

2016-04-21 Thread gyfora
Github user gyfora commented on a diff in the pull request: https://github.com/apache/flink/pull/1918#discussion_r60544472 --- Diff: flink-contrib/flink-statebackend-rocksdb/src/main/java/org/apache/flink/contrib/streaming/state/RocksDBValueState.java --- @@ -64,7 +64,7

[GitHub] flink pull request: [FLINK-3790] [streaming] Use proper hadoop con...

2016-04-20 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/1919 [FLINK-3790] [streaming] Use proper hadoop config in rolling sink You can merge this pull request into a Git repository by running: $ git pull https://github.com/gyfora/flink rolling-conf

[GitHub] flink pull request: [FLINK-3798] [streaming] Clean up RocksDB back...

2016-04-20 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/1918 [FLINK-3798] [streaming] Clean up RocksDB backend field/method access The RocksDB state backend uses a lot package private methods and fields which makes it very hard to subclass the different parts

[GitHub] flink pull request: [FLINK-3659] Allow ConnectedStreams to Be Keye...

2016-03-24 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1831#issuecomment-200907475 @StephanEwen This PR does not change the behaviour of any existing Flink applications. It now allows though that users only specify key of one input

[GitHub] flink pull request: [FLINK-3659] Allow ConnectedStreams to Be Keye...

2016-03-24 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1831#issuecomment-200886203 I think you can go ahead merging this if no-one has any objections :) --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] flink pull request: [FLINK-3658][Kafka] Allow producing into multi...

2016-03-23 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1832#issuecomment-200354048 This is a very useful feature :) The code looks good and straightforward :+1: --- If your project is set up for it, you can reply to this email and have your

[GitHub] flink pull request: [FLINK-3659] Allow ConnectedStreams to Be Keye...

2016-03-23 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1831#issuecomment-200339097 Thanks Aljoscha, this seems to work :+1: --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] flink pull request: [FLINK-3620] [streaming] Remove DbStateBackend

2016-03-19 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/1800 [FLINK-3620] [streaming] Remove DbStateBackend You can merge this pull request into a Git repository by running: $ git pull https://github.com/gyfora/flink remove-dbbackend Alternatively you

[GitHub] flink pull request: [WIP] [streaming] OutOfCoreKvState added + maj...

2016-03-19 Thread gyfora
Github user gyfora closed the pull request at: https://github.com/apache/flink/pull/1627 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] flink pull request: [FLINK-3566] [FLINK-3563] Input type validatio...

2016-03-03 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1759#issuecomment-191821834 I tested it in my application where I was previously having both issues. Now it works perfectly, thanks! +1 from me :+1: --- If your project is set up

[GitHub] flink pull request: [FLINK-3257] Add Exactly-Once Processing Guara...

2016-02-23 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1668#issuecomment-187827823 I can't really add anything to the timeout question :D As for the snapshotting. I would go with the ListState as that potentially provides very efficient

[GitHub] flink pull request: [FLINK-3257] Add Exactly-Once Processing Guara...

2016-02-19 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1668#issuecomment-186142276 Thanks Paris, I like the idea, it's a correct modification of the original algorithm to make it much easier to implement on Flink for the price of buffering more records

[GitHub] flink pull request: [FLINK-3368][Kafka 0.8] Handle leader changes ...

2016-02-17 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1623#issuecomment-185260566 We have tested this and it worked correctly. Should we go ahead a merge it? --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] flink pull request: [FLINK-3299] Remove ApplicationID from Environ...

2016-02-16 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1642#issuecomment-184683583 In general the DbStateBackend should not misbehave under the current assumptions. What Ufuk means is that the DbStateBackend does not make any assumption about the job

[GitHub] flink pull request: [FLINK-3354] Determine correct size for RocksD...

2016-02-15 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1638#issuecomment-184196183 Sure --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] flink pull request: [FLINK-3354] Determine correct size for RocksD...

2016-02-15 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1638#issuecomment-184195205 It counts the size without HDFS replication I think that is better. --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] flink pull request: [FLINK-3354] Determine correct size for RocksD...

2016-02-15 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/1638 [FLINK-3354] Determine correct size for RocksDB snapshots You can merge this pull request into a Git repository by running: $ git pull https://github.com/gyfora/flink FLINK-3354 Alternatively

[GitHub] flink pull request: [WIP] [streaming] OutOfCoreKvState added + maj...

2016-02-12 Thread gyfora
GitHub user gyfora opened a pull request: https://github.com/apache/flink/pull/1627 [WIP] [streaming] OutOfCoreKvState added + major improvements to DbStateBackend This PR has 2 main parts: 1. A new abstract class for encapsulating functionality for OutOfCore lazy states

[GitHub] flink pull request: [FLINK-3355] [rocksdb backend] Allow passing o...

2016-02-09 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1608#issuecomment-181800676 Looks good :+1: --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] flink pull request: [FLINK-3334] [conf] Include year-month-day in ...

2016-02-08 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1604#issuecomment-181503516 Looks good :) Thanks Stephan! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] flink pull request: Enhance Partitioned State and use it in Window...

2016-01-31 Thread gyfora
Github user gyfora commented on the pull request: https://github.com/apache/flink/pull/1562#issuecomment-177611308 Hey, First of all, great work I am looking forward to having this is :) I think it would be good if we added default implementations of the List, Reducing

  1   2   3   4   >