Re: Has the "Stop" Button next to the "Cancel" Button been removed in Flink's 1.9 web interface?

2020-03-02 Thread Tzu-Li Tai
Hi Tobi, In this case, the job would indeed continue from the last offset that has been committed in Kafka (assuming that you are using the `startFromGroupOffsets` start position) for the specified group id. However, do keep in mind that those offsets are not consistent with the offsets written in

Re: Please help, I need to bootstrap keyed state into a stream

2020-08-09 Thread Tzu-Li Tai
Hi, For the NullPointerException, what seems to be happening is that you are setting NULL values in your MapState, that is not allowed by the API. Otherwise, the code that you showed for bootstrapping state seems to be fine. > I have yet to find a working example that shows how to do both > (boo

Re: is Flink supporting pre-loading of a compacted (reference) topic for a join ?

2020-01-27 Thread Tzu-Li Tai
Hi Dominique, FLIP-17 (Side Inputs) is not yet implemented, AFAIK. One possible way to overcome this right now if your reference data is static and not continuously changing, is to use the State Processor API to bootstrap a savepoint with the reference data. Have you looked into that and see if i

Re: KafkaFetcher closed before end of stream is received for all partitions.

2020-02-16 Thread Tzu-Li Tai
Hi, Your observations are correct. It is expected that the result of `KafkaDeserializationSchema#isEndOfStream` triggers a single subtask to escape its fetch loop. Therefore, if a subtask is assigned multiple partitions, as soon as one record (regardless of which partition it came from) signals en

Re: Flink Savepoint error

2020-02-16 Thread Tzu-Li Tai
Hi, Could you let me know what data source / connector you are using? My gut feeling is that perhaps some sources have already reached EOF and terminated, which may explain the exception (which is expected behaviour). Cheers, Gordon -- Sent from: http://apache-flink-user-mailing-list-archive.2

Re: [Flink 1.10] How do I use LocalCollectionOutputFormat now that writeUsingOutputFormat is deprecated?

2020-02-16 Thread Tzu-Li Tai
Hi, To collect the elements of a DataStream (usually only meant for testing purposes), you can take a look at `DataStreamUtils#collect(DataStream)`. Cheers, Gordon -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Wikiedit QuickStart with Kinesis

2016-09-01 Thread Tzu-Li Tai
I’m afraid the “AUTO” option on the Kinesis producer is actually bugged, so the internally used KPL library correctly pick up credentials with the default credential provider chain. I’ve just filed a JIRA for this: https://issues.apache.org/jira/browse/FLINK-4559

Re: Flink is looking for Kafka topic "n/a"

2018-03-06 Thread Tzu-Li Tai
Hi Mu, You mentioned that the job stopped after the "n/a" topic error, but the job failed to recover. What exception did you encounter in the restart executions? Was it the same error? This would verify if we actually should be removing more than one of these special MARKER partition states. On t

Re: Serialization and Deserialization of Avro messages stored in Kafka

2018-03-06 Thread Tzu-Li Tai
Hi Filipe, What Gordon mentioned is correct. Did you manage to fix the issue? >From your code snippet, it looks like that the `Schema` field may not be serializable. Could you double check that? Cheers, Gordon -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Restart hook and checkpoint

2018-03-06 Thread Tzu-Li Tai
Hi Ashish, Could you elaborate a bit more on why you think the restart of all operators lead to data loss? When restart occurs, Flink will restart the job from the latest complete checkpoint. All operator states will be reloaded with state written in that checkpoint, and the position of the input