Re: Question about Flink internals

Timo Walther Thu, 07 Sep 2017 05:15:02 -0700

Hi Junguk,

I try to answer your questions, but also loop in Ufuk who might now moreabout the network internals:

1. Yes, every operator/operator chain has a "setParallelism()" method dospecify the parallelism. The overall parallelism of the job can be setwhen submitting a job. The parallelism per TaskManager is determined bythe number of slots.

2. From a user's perspective you can only see the "real data".Internally, there are different types of records that flow through thetopology (namely watermarks, checkpoint barriers, latency markers, andrecords with or without timestamp metadata).


3. See my last comment.

4. Flink also uses heartbeat messages between JobManager andTaskManagers. In case of a failure the JobManager restores the entiretopology to the last successful checkpoint. See [1] for moreexplanation. In the future it is planned to recover more fine-grained.

5. Source workers should not be directly connected but though systemslike Kafka or Pravega. Not only for replaying in case of failures butalso for using it as the single source of truth in case your processinglogic needs to be adapted. E.g. you had a bug in your application andthe state that you have built is invalid, you want to be able to correctyour mistake and rebuild the state in a batch. The folks from Drivetribeshowed a very nice architecture [2]. I don't know if replaying your IoTdevices would make sense, in theory you could implement your ownconnector that implements a similar logic as Flink's Kafka consumer.

6. I don't know about the internals of iteration feature but you mightbe right. Cyclic dataflows are not fully supported yet. E.g. they arealso not participating in Flink's checkpointing mechanism.

In general, I would recommmend to import Flink into your IDE and set abreakpoint in an example (e.g. within a mapper before a keyBy) and runit in debug mode. You can step through the layers to see more about theinternals. This should answer most of your question, otherwise feel freeto ask again.


Regards,
Timo

[1]https://ci.apache.org/projects/flink/flink-docs-release-1.3/internals/stream_checkpointing.html

[2] https://data-artisans.com/blog/drivetribe-cqrs-apache-flink

Am 06.09.17 um 21:54 schrieb Junguk Cho:

Hi, All.

I am new to Flink.
I just installed Flink in clusters and start reading documents tounderstand Flink internals.
After reading some documents, I have some questions.
I have some experiences of Storm and Heron before, so I am linkingtheir mechanisms to questions to better understand Flink.
1. Can I specify worker parallelism explicitly like Storm?

2. Record in Flink
Can I think a "record" in FLINK is almost same as Tuple in Storm?
Tuple in Storm is used for carrying "real data" + "metadata (e.g.,stream type, source id and so on).
3. How does partition (e.g., shuffling,  map) works internally?
In Storm, it has (worker id) : (tcp info to next workers) tables.
So, based on this information, after executing partition function,Tuple is forwarded to next hops based on tables.
Is it the same?

4. How does Flink detect fault in case of worker dead machine failure?
Based on documents, Job manager checks liveness of task managers withheartbeat message.In Storm, supervisor (I think it is similar with Task manager) firstdetects worker dead based on heartbeat and locally re-runs it again.For machine failure, Nimbus (I think it is similar with Job manager)detects machine failure based on supervisor's heartbeat andre-schedule all assigned worker to other machine.
How does Flink work?
5. For exactly-once delivery, Flink uses checking point and recordreplay mechanism.
It needs messages queues (e.g, Kafka) for record replay.
Kafka uses TCP to send and receive data. So I wonder if data sourcedoes not use TCP (e.g., IoT sensors), what is general solutions to userecord replay?For example, source workers are directly connected to several inputs(e.g., IoT sensors) while I think it is not normal deployments.
6. Flink supports Cycles.
However, based on documents, Cycled tasks act as regular dataflowsource and sink respectively, yet they are collocated in the samephysical instance to share an in-memory buffer and thus, implementloopback stream transparently.So, what if the number of workers which make cycles is high? It wouldbe hard to put them in the same physical machine.
Thanks,
Junguk

Re: Question about Flink internals

Reply via email to