State in flink jobs

2017-01-15 Thread Janardhan Reddy
Hi, Is the value state persisted across job restart/deployment without savepoint if checkpointing is disabled.

Measuring Execution Time

2017-01-15 Thread Charith Wickramarachchi
Hi All, I have a program that executes multiple (data dependent) iterative jobs one after another in the same execution plan. It has the following structure ExecutionEnvironment env = .. LoadData Iterative Process 1 Iterative Process 2 ... Iterative Process N env.execute("...") I

Restart the job from a checkpoint

2017-01-15 Thread tao xiao
Hi team, Can we restart a flink job from previous successful checkpoint? I know we can start a flink from a savepoint but I wonder if I can do it similar by passing the checkpoint path to the flink run command to restore the job from checkpoint.

Re: Kafka KeyedStream source

2017-01-15 Thread Tzu-Li (Gordon) Tai
Hi Niels, If it’s only for simple data filtering that does not depend on the key, a simple “flatMap” or “filter" directly after the source can be chained to the source instances. What that does is that the filter processing will be done within the same thread as the one fetching data from a

Re: Deduplicate messages from Kafka topic

2017-01-15 Thread Tzu-Li (Gordon) Tai
Hi, You’re correct that the FlinkKafkaProducer may emit duplicates to Kafka topics, as it currently only provides at-least-once guarantees. Note that this isn’t a restriction only in the FlinkKafkaProducer, but a general restriction for Kafka's message delivery. This can definitely be improved

Re: 答复: [DISCUSS] Apache Flink 1.2.0 RC0 (Non-voting testing release candidate)

2017-01-15 Thread Till Rohrmann
I found two problematic issues with Mesos HA mode which breaks it: https://issues.apache.org/jira/browse/FLINK-5495 https://issues.apache.org/jira/browse/FLINK-5496 On Fri, Jan 13, 2017 at 11:29 AM, Fabian Hueske wrote: > I tested the Table API / SQL a bit. > > I implemented

Re: Terminology: Split, Group and Partition

2017-01-15 Thread Fabian Hueske
Hi Robert, thanks for opening the ticket. Regarding injecting grouping or partitioning information, semantic annotations (forward fields) [1] is probably what you are looking for. Best, Fabian [1]

Re: some questions about submit flink job on flink-yarn

2017-01-15 Thread Till Rohrmann
Hi Huang, the reason why you cannot use the IP address to send messages to your YARN JobManager is that we no longer resolve the hostname into an IP address. Instead we start the ActorSystem with the unresolved hostname. You can see this in the following log line: `Actor system bound to hostname

Re: Flink on YARN: Cannot connect to JobManager

2017-01-15 Thread Till Rohrmann
Hi Malte, can it be that you’re trying to request more resources from your yarn cluster than there are currently available? It depends a little bit on your other settings but -yn 2 says that you request 2 TaskManagers. Additionally, Flink will also allocate another container for the JobManager.