This is an automated email from the ASF dual-hosted git repository.
wanghailin pushed a commit to branch dev
in repository https://gitbox.apache.org/repos/asf/seatunnel.git
The following commit(s) were added to refs/heads/dev by this push:
new 194472ba8f [Improve][Document] Fix Postgre to PostgreSQL in
Postgre-CDC.md file (#7112)
194472ba8f is described below
commit 194472ba8f9a760eb6f7ac4be95a3e2023fe9c06
Author: Feng Ruohang <[email protected]>
AuthorDate: Mon Jul 8 14:54:58 2024 +0800
[Improve][Document] Fix Postgre to PostgreSQL in Postgre-CDC.md file (#7112)
---
.../source/{Postgre-CDC.md => PostgreSQL-CDC.md} | 14 +++++++-------
1 file changed, 7 insertions(+), 7 deletions(-)
diff --git a/docs/en/connector-v2/source/Postgre-CDC.md
b/docs/en/connector-v2/source/PostgreSQL-CDC.md
similarity index 96%
rename from docs/en/connector-v2/source/Postgre-CDC.md
rename to docs/en/connector-v2/source/PostgreSQL-CDC.md
index 28631919dd..e0a5936df0 100644
--- a/docs/en/connector-v2/source/Postgre-CDC.md
+++ b/docs/en/connector-v2/source/PostgreSQL-CDC.md
@@ -1,6 +1,6 @@
-# Postgre CDC
+# PostgreSQL CDC
-> Postgre CDC source connector
+> PostgreSQL CDC source connector
## Support Those Engines
@@ -40,7 +40,7 @@ describes how to set up the Postgre CDC connector to run SQL
queries against Pos
> 1. You need to ensure that the [jdbc driver jar
> package](https://mvnrepository.com/artifact/org.postgresql/postgresql) has
> been placed in directory `${SEATUNNEL_HOME}/lib/`.
-Please download and put Postgre driver in `${SEATUNNEL_HOME}/lib/` dir. For
example: cp postgresql-xxx.jar `$SEATNUNNEL_HOME/lib/`
+Please download and put PostgreSQL driver in `${SEATUNNEL_HOME}/lib/` dir. For
example: cp postgresql-xxx.jar `$SEATNUNNEL_HOME/lib/`
> Here are the steps to enable CDC (Change Data Capture) in PostgreSQL:
@@ -94,7 +94,7 @@ ALTER TABLE your_table_name REPLICA IDENTITY FULL;
| database-names | List | No | -
| Database name of the database to monitor.
[...]
| table-names | List | Yes | -
| Table name of the database to monitor. The table name needs to include the
database name, for example: `database_name.table_name`
[...]
| table-names-config | List | No | -
| Table config list. for example: [{"table":
"db1.schema1.table1","primaryKeys":["key1"]}]
[...]
-| startup.mode | Enum | No |
INITIAL | Optional startup mode for Postgre CDC consumer, valid enumerations
are `initial`, `earliest`, `latest` and `specific`. <br/> `initial`:
Synchronize historical data at startup, and then synchronize incremental
data.<br/> `earliest`: Startup from the earliest offset possible.<br/>
`latest`: Startup from the latest offset.<br/> `specific`: Startup from
user-supplied specific offsets. [...]
+| startup.mode | Enum | No |
INITIAL | Optional startup mode for PostgreSQL CDC consumer, valid
enumerations are `initial`, `earliest`, `latest` and `specific`. <br/>
`initial`: Synchronize historical data at startup, and then synchronize
incremental data.<br/> `earliest`: Startup from the earliest offset
possible.<br/> `latest`: Startup from the latest offset.<br/> `specific`:
Startup from user-supplied specific offsets. [...]
| snapshot.split.size | Integer | No | 8096
| The split size (number of rows) of table snapshot, captured tables are
split into multiple splits when read the snapshot of table.
[...]
| snapshot.fetch.size | Integer | No | 1024
| The maximum fetch size for per poll when read table snapshot.
[...]
| slot.name | String | No | -
| The name of the PostgreSQL logical decoding slot that was created for
streaming changes from a particular plug-in for a particular database/schema.
The server uses this slot to stream events to the connector that you are
configuring. Default is seatunnel.
[...]
@@ -108,8 +108,8 @@ ALTER TABLE your_table_name REPLICA IDENTITY FULL;
| sample-sharding.threshold | Integer | No | 1000
| This configuration specifies the threshold of estimated shard count to
trigger the sample sharding strategy. When the distribution factor is outside
the bounds specified by `chunk-key.even-distribution.factor.upper-bound` and
`chunk-key.even-distribution.factor.lower-bound`, and the estimated shard count
(calculated as approximate row count / chunk size) exceeds this threshold, the
sample sharding strateg [...]
| inverse-sampling.rate | Integer | No | 1000
| The inverse of the sampling rate used in the sample sharding strategy. For
example, if this value is set to 1000, it means a 1/1000 sampling rate is
applied during the sampling process. This option provides flexibility in
controlling the granularity of the sampling, thus affecting the final number of
shards. It's especially useful when dealing with very large datasets where a
lower sampling rate is prefer [...]
| exactly_once | Boolean | No | false
| Enable exactly once semantic.
[...]
-| format | Enum | No |
DEFAULT | Optional output format for Postgre CDC, valid enumerations are
`DEFAULT`、`COMPATIBLE_DEBEZIUM_JSON`.
[...]
-| debezium | Config | No | -
| Pass-through [Debezium's
properties](https://github.com/debezium/debezium/blob/v1.9.8.Final/documentation/modules/ROOT/pages/connectors/postgresql.adoc#connector-configuration-properties)
to Debezium Embedded Engine which is used to capture data changes from Postgre
server.
[...]
+| format | Enum | No |
DEFAULT | Optional output format for PostgreSQL CDC, valid enumerations are
`DEFAULT`, `COMPATIBLE_DEBEZIUM_JSON`.
[...]
+| debezium | Config | No | -
| Pass-through [Debezium's
properties](https://github.com/debezium/debezium/blob/v1.9.8.Final/documentation/modules/ROOT/pages/connectors/postgresql.adoc#connector-configuration-properties)
to Debezium Embedded Engine which is used to capture data changes from
PostgreSQL server.
[...]
| common-options | | no | -
| Source plugin common parameters, please refer to [Source Common
Options](common-options.md) for details
[...]
## Task Example
@@ -190,7 +190,7 @@ source {
## Changelog
-- Add Postgre CDC Source Connector
+- Add PostgreSQL CDC Source Connector
### next version