nfarah86 commented on code in PR #8093: URL: https://github.com/apache/hudi/pull/8093#discussion_r1130566785
########## website/docs/flink_configuration.md: ########## @@ -3,115 +3,179 @@ title: Flink Setup toc: true --- -## Global Configurations -When using Flink, you can set some global configurations in `$FLINK_HOME/conf/flink-conf.yaml` +[Apache Flink](https://flink.apache.org/what-is-flink/flink-architecture/) is a powerful streaming-batch integrated engine that provides a stream processing framework. Flink can process events at an incredible speed with low latency. Along with Hudi, users can use streaming ingestion like with Kafka; streaming consumption like with Kafka; and also perform batch workloads like bulk ingest, snapshot queries and incremental queries. -### Parallelism - -| Option Name | Default | Type | Description | -| ----------- | ------- | ------- | ------- | -| `taskmanager.numberOfTaskSlots` | `1` | `Integer` | The number of parallel operator or user function instances that a single TaskManager can run. We recommend setting this value > 4, and the actual value needs to be set according to the amount of data | -| `parallelism.default` | `1` | `Integer` | The default parallelism used when no parallelism is specified anywhere (default: 1). For example, If the value of [`write.bucket_assign.tasks`](#parallelism-1) is not set, this value will be used | +There are three executions modes a user can configure for Flink: +- Streaming Review Comment: There are three execution modes a user can configure for Flink, and within each execution mode, users can use Flink SQL writing to configure their job options. The following section describes the necessary configs for different job conditions. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
