[druid] branch master updated: Fix broken anchor and heading levels in Kafka/Kinesis ingestion (#11748)

cwylie Tue, 05 Oct 2021 19:31:14 -0700

This is an automated email from the ASF dual-hosted git repository.

cwylie pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/druid.git



The following commit(s) were added to refs/heads/master by this push:
     new 104c9a0  Fix broken anchor and heading levels in Kafka/Kinesis 
ingestion (#11748)
104c9a0 is described below

commit 104c9a07f0116b781ec509baab603e9023a52af1
Author: Frank Chen <[email protected]>
AuthorDate: Wed Oct 6 10:30:50 2021 +0800

    Fix broken anchor and heading levels in Kafka/Kinesis ingestion (#11748)
    
    * Fix broken anchor and heading levels
    
    * Fix CI
---
 docs/development/extensions-core/kafka-ingestion.md   | 10 +++++-----
 docs/development/extensions-core/kinesis-ingestion.md | 10 +++++-----
 2 files changed, 10 insertions(+), 10 deletions(-)

diff --git a/docs/development/extensions-core/kafka-ingestion.md 
b/docs/development/extensions-core/kafka-ingestion.md
index dcebce0..d30bf6a 100644
--- a/docs/development/extensions-core/kafka-ingestion.md
+++ b/docs/development/extensions-core/kafka-ingestion.md
@@ -147,9 +147,9 @@ Where the file `supervisor-spec.json` contains a Kafka 
supervisor spec:
 |`lateMessageRejectionStartDateTime`|ISO8601 DateTime|Configure tasks to 
reject messages with timestamps earlier than this date time; for example if 
this is set to `2016-01-01T11:00Z` and the supervisor creates a task at 
*2016-01-01T12:00Z*, Druid drops messages with timestamps earlier than 
*2016-01-01T11:00Z*. This can prevent concurrency issues if your data stream 
has late messages and you have multiple pipelines that need to operate on the 
same segments (e.g. a realtime and a nightly  [...]
 |`lateMessageRejectionPeriod`|ISO8601 Period|Configure tasks to reject 
messages with timestamps earlier than this period before the task was created; 
for example if this is set to `PT1H` and the supervisor creates a task at 
*2016-01-01T12:00Z*, messages with timestamps earlier than *2016-01-01T11:00Z* 
will be dropped. This may help prevent concurrency issues if your data stream 
has late messages and you have multiple pipelines that need to operate on the 
same segments (e.g. a realtime an [...]
 |`earlyMessageRejectionPeriod`|ISO8601 Period|Configure tasks to reject 
messages with timestamps later than this period after the task reached its 
taskDuration; for example if this is set to `PT1H`, the taskDuration is set to 
`PT1H` and the supervisor creates a task at *2016-01-01T12:00Z*, messages with 
timestamps later than *2016-01-01T14:00Z* will be dropped. **Note:** Tasks 
sometimes run past their task duration, for example, in cases of supervisor 
failover. Setting earlyMessageReject [...]
-|`autoScalerConfig`|Object|Defines auto scaling behavior for Kafka ingest 
tasks. See [Tasks Autoscaler Properties](#Task Autoscaler Properties).|no 
(default == null)|
+|`autoScalerConfig`|Object|Defines auto scaling behavior for Kafka ingest 
tasks. See [Tasks Autoscaler Properties](#task-autoscaler-properties).|no 
(default == null)|
 
-### Task Autoscaler Properties
+#### Task Autoscaler Properties
 
 > Note that Task AutoScaler is currently designated as experimental.
 
@@ -159,9 +159,9 @@ Where the file `supervisor-spec.json` contains a Kafka 
supervisor spec:
 | `taskCountMax` | Maximum number of ingestion tasks. Set `taskCountMax >= 
taskCountMin`. If `taskCountMax > {numKafkaPartitions}`, Druid only scales 
reading tasks up to the `{numKafkaPartitions}`. In this case `taskCountMax` is 
ignored.  | yes |
 | `taskCountMin` | Minimum number of ingestion tasks. When you enable 
autoscaler, Druid ignores the value of taskCount in `IOConfig` and starts with 
the `taskCountMin` number of tasks.| yes |
 | `minTriggerScaleActionFrequencyMillis` | Minimum time interval between two 
scale actions. | no (default == 600000) |
-| `autoScalerStrategy` | The algorithm of `autoScaler`. Only supports 
`lagBased`. See [Lag Based AutoScaler Strategy Related Properties](#Lag Based 
AutoScaler Strategy Related Properties) for details.| no (default == 
`lagBased`) |
+| `autoScalerStrategy` | The algorithm of `autoScaler`. Only supports 
`lagBased`. See [Lag Based AutoScaler Strategy Related 
Properties](#lag-based-autoscaler-strategy-related-properties) for details.| no 
(default == `lagBased`) |
 
-### Lag Based AutoScaler Strategy Related Properties
+##### Lag Based AutoScaler Strategy Related Properties
 | Property | Description | Required |
 | ------------- | ------------- | ------------- |
 | `lagCollectionIntervalMillis` | Period of lag points collection.  | no 
(default == 30000) |
@@ -222,7 +222,7 @@ The following example demonstrates supervisor spec with 
`lagBased` autoScaler en
 #### More on consumerProperties
 
 This must contain a property `bootstrap.servers` with a list of Kafka brokers 
in the form: `<BROKER_1>:<PORT_1>,<BROKER_2>:<PORT_2>,...`.
-By default, `isolation.level` is set to `read_committed`. It should be set to 
`read_uncommitted` if you don't want Druid to consume only committed 
transactions or working with older versions of Kafka servers with no 
Transactions support.
+By default, `isolation.level` is set to `read_committed`. It should be set to 
`read_uncommitted` if you don't want Druid to consume only committed 
transactions or working with older versions of Kafka servers with no 
transactions support.
 
 There are few cases that require fetching few/all of consumer properties at 
runtime e.g. when `bootstrap.servers` is not known upfront or not static, to 
enable SSL connections users might have to provide passwords for `keystore`, 
`truststore` and `key` secretly.
 For such consumer properties, user can implement a 
[DynamicConfigProvider](../../operations/dynamic-config-provider.md) to supply 
them at runtime, by adding
diff --git a/docs/development/extensions-core/kinesis-ingestion.md 
b/docs/development/extensions-core/kinesis-ingestion.md
index 2a4e1af..2dc60ef 100644
--- a/docs/development/extensions-core/kinesis-ingestion.md
+++ b/docs/development/extensions-core/kinesis-ingestion.md
@@ -146,9 +146,9 @@ Where the file `supervisor-spec.json` contains a Kinesis 
supervisor spec:
 |`awsAssumedRoleArn`|String|The AWS assumed role to use for additional 
permissions.|no|
 |`awsExternalId`|String|The AWS external id to use for additional 
permissions.|no|
 |`deaggregate`|Boolean|Whether to use the de-aggregate function of the KCL. 
See below for details.|no|
-|`autoScalerConfig`|Object|Defines auto scaling behavior for Kinesis ingest 
tasks. See [Tasks Autoscaler Properties](#Task Autoscaler Properties).|no 
(default == null)|
+|`autoScalerConfig`|Object|Defines auto scaling behavior for Kinesis ingest 
tasks. See [Tasks Autoscaler Properties](#task-autoscaler-properties).|no 
(default == null)|
 
-### Task Autoscaler Properties
+#### Task Autoscaler Properties
 
 > Note that Task AutoScaler is currently designated as experimental.
 
@@ -158,9 +158,9 @@ Where the file `supervisor-spec.json` contains a Kinesis 
supervisor spec:
 | `taskCountMax` | Maximum number of Kinesis ingestion tasks. Must be greater 
than or equal to `taskCountMin`. If greater than `{numKinesisShards}`, the 
maximum number of reading tasks is `{numKinesisShards}` and `taskCountMax` is 
ignored.  | yes |
 | `taskCountMin` | Minimum number of Kinesis ingestion tasks. When you enable 
the auto scaler, Druid ignores the value of taskCount in `IOConfig` and 
uses`taskCountMin` for the initial number of tasks to launch.| yes |
 | `minTriggerScaleActionFrequencyMillis` | Minimum time interval between two 
scale actions | no (default == 600000) |
-| `autoScalerStrategy` | The algorithm of `autoScaler`. ONLY `lagBased` is 
supported for now. See [Lag Based AutoScaler Strategy Related Properties](#Lag 
Based AutoScaler Strategy Related Properties) for details.| no (default == 
`lagBased`) |
+| `autoScalerStrategy` | The algorithm of `autoScaler`. ONLY `lagBased` is 
supported for now. See [Lag Based AutoScaler Strategy Related 
Properties](#lag-based-autoscaler-strategy-related-properties) for details.| no 
(default == `lagBased`) |
 
-### Lag Based AutoScaler Strategy Related Properties
+##### Lag Based AutoScaler Strategy Related Properties
 
 The Kinesis indexing service reports lag metrics measured in time milliseconds 
rather than message count which is used by Kafka.
 
@@ -303,7 +303,7 @@ The tuningConfig is optional and default parameters will be 
used if no tuningCon
 | `intermediateHandoffPeriod`           | ISO8601 Period | How often the tasks 
should hand off segments. Handoff will happen either if `maxRowsPerSegment` or 
`maxTotalRows` is hit or every `intermediateHandoffPeriod`, whichever happens 
earlier.                                                                        
                                                                                
                                                                                
                 [...]
 | `logParseExceptions`                  | Boolean        | If true, log an 
error message when a parsing exception occurs, containing information about the 
row where the error occurred.                                                   
                                                                                
                                                                                
                                                                                
                  [...]
 | `maxParseExceptions`                  | Integer        | The maximum number 
of parse exceptions that can occur before the task halts ingestion and fails. 
Overridden if `reportParseExceptions` is set.                                   
                                                                                
                                                                                
                                                                                
                 [...]
-| `maxSavedParseExceptions`             | Integer        | When a parse 
exception occurs, Druid can keep track of the most recent parse exceptions. 
"maxSavedParseExceptions" limits how many exception instances will be saved. 
These saved exceptions will be made available after the task finishes in the 
[task completion report](../../ingestion/tasks.md#reports). Overridden if 
`reportParseExceptions` is set.                                                 
                                     [...]
+| `maxSavedParseExceptions`             | Integer        | When a parse 
exception occurs, Druid can keep track of the most recent parse exceptions. 
"maxSavedParseExceptions" limits how many exception instances will be saved. 
These saved exceptions will be made available after the task finishes in the 
[task completion report](../../ingestion/tasks.md#task-reports). Overridden if 
`reportParseExceptions` is set.                                                 
                                [...]
 | `maxRecordsPerPoll`                   | Integer        | The maximum number 
of records/events to be fetched from buffer per poll. The actual maximum will 
be `Max(maxRecordsPerPoll, Max(bufferSize, 1))`                                 
                                                                                
                                                                                
                                                                                
                 [...]
 | `repartitionTransitionDuration`       | ISO8601 Period | When shards are 
split or merged, the supervisor will recompute shard -> task group mappings, 
and signal any running tasks created under the old mappings to stop early at 
(current time + `repartitionTransitionDuration`). Stopping the tasks early 
allows Druid to begin reading from the new shards more quickly. The repartition 
transition wait time controlled by this property gives the stream additional 
time to write records to the ne [...]
 | `offsetFetchPeriod`                   | ISO8601 Period | How often the 
supervisor queries Kinesis and the indexing tasks to fetch current offsets and 
calculate lag. If the user-specified value is below the minimum value (`PT5S`), 
the supervisor ignores the value and uses the minimum value instead.            
                                                                                
                                                                                
                     [...]

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[druid] branch master updated: Fix broken anchor and heading levels in Kafka/Kinesis ingestion (#11748)

Reply via email to