(flink-cdc) branch master updated: [FLINK-38059][doc] Add fluss pipeline connector documentation (#4088)

kunni Mon, 11 Aug 2025 22:31:36 -0700

This is an automated email from the ASF dual-hosted git repository.

kunni pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/flink-cdc.git



The following commit(s) were added to refs/heads/master by this push:
     new 35504bdda [FLINK-38059][doc] Add fluss pipeline connector 
documentation (#4088)
35504bdda is described below

commit 35504bddaa531ea7d9893bb73b8f410be9f5ea54
Author: Junbo Wang <beryllw...@gmail.com>
AuthorDate: Tue Aug 12 13:31:26 2025 +0800

    [FLINK-38059][doc] Add fluss pipeline connector documentation (#4088)
    
    Co-authored-by: wangjunbo <wangju...@qiyi.com>
---
 .../docs/connectors/pipeline-connectors/fluss.md   | 243 ++++++++++++++++++++
 .../docs/connectors/pipeline-connectors/fluss.md   | 246 +++++++++++++++++++++
 2 files changed, 489 insertions(+)

diff --git a/docs/content.zh/docs/connectors/pipeline-connectors/fluss.md 
b/docs/content.zh/docs/connectors/pipeline-connectors/fluss.md
new file mode 100644
index 000000000..70086632d
--- /dev/null
+++ b/docs/content.zh/docs/connectors/pipeline-connectors/fluss.md
@@ -0,0 +1,243 @@
+---
+title: "Fluss"
+weight: 4
+type: docs
+aliases:
+- /connectors/pipeline-connectors/fluss
+---
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+  http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+
+# Fluss Pipeline 连接器
+Fluss Pipeline 连接器可用作 Pipeline 的 *Data Sink*，将数据写入 
[Fluss](https://fluss.apache.org)。本文档介绍如何配置 Fluss Pipeline 连接器。
+
+## What can the connector do?
+* 自动创建不存在的表 
+* 数据同步
+
+How to create Pipeline
+----------------
+
+从 MySQL 读取数据并写入 Fluss 的 Pipeline 可定义如下：
+
+```yaml
+source:
+   type: mysql
+   name: MySQL Source
+   hostname: 127.0.0.1
+   port: 3306
+   username: admin
+   password: pass
+   tables: adb.\.*, bdb.user_table_[0-9]+, [app|web].order_\.*
+   server-id: 5401-5404
+
+sink:
+  type: fluss
+  name: Fluss Sink
+  bootstrap.servers: localhost:9123
+  # Security-related properties for the Fluss client
+  properties.client.security.protocol: sasl
+  properties.client.security.sasl.mechanism: PLAIN
+  properties.client.security.sasl.username: developer
+  properties.client.security.sasl.password: developer-pass
+
+pipeline:
+  name: MySQL to Fluss Pipeline
+  parallelism: 2
+```
+
+Pipeline Connector Options
+----------------
+<div class="highlight">
+<table class="colwidths-auto docutils">
+   <thead>
+      <tr>
+        <th class="text-left" style="width: 25%">Option</th>
+        <th class="text-left" style="width: 8%">Required</th>
+        <th class="text-left" style="width: 7%">Default</th>
+        <th class="text-left" style="width: 10%">Type</th>
+        <th class="text-left" style="width: 50%">Description</th>
+      </tr>
+    </thead>
+    <tbody>
+    <tr>
+      <td>type</td>
+      <td>required</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>指定要使用的连接器, 这里需要设置成 <code>'fluss'</code>。 </td>
+    </tr>
+    <tr>
+      <td>name</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>Sink 的名称。 </td>
+    </tr>
+    <tr>
+      <td>bootstrap.servers</td>
+      <td>required</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>用于建立与 Fluss 集群初始连接的主机/端口对列表。 </td>
+    </tr>
+    <tr>
+      <td>bucket.key</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>指定每个 Fluss 表的数据分布策略。表之间用 ';' 分隔，分桶键之间用 ',' 
分隔。格式：database1.table1:key1,key2;database1.table2:key3。
+          数据将根据分桶键的哈希值分配到各个桶中（分桶键必须是主键的子集，且不包含主键表的分区键）。
+          若表有主键但未指定分桶键，则分桶键默认为主键（不含分区键）；若表无主键且未指定分桶键，则数据将随机分配到各个桶中。 </td>
+    </tr>
+    <tr>
+      <td>bucket.num</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>每个 Fluss 表的桶数量。表之间用 ';' 分隔。格式：database1.table1:4;database1.table2:8。 
</td>
+    </tr>
+    <tr>
+      <td>properties.table.*</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>将 Fluss table 支持的参数传递给 pipeline，参考，See <a 
href="https://fluss.apache.org/docs/engine-flink/options/#storage-options";>Fluss
 table options</a>. </td>
+    </tr>
+    <tr>
+      <td>properties.client.*</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>将 Fluss client 支持的参数传递给 pipeline，See <a 
href="https://fluss.apache.org/docs/engine-flink/options/#write-options";>Fluss 
client options</a>. </td>
+    </tr>
+    </tbody>
+</table>    
+</div>
+
+## 使用说明
+
+* 支持 Fluss 主键表和日志表。
+
+* 关于自动建表
+  * 没有分区键
+  * 桶数量由 `bucket.num` 选项控制
+  * 数据分布由 `bucket.key` 
选项控制。对于主键表，若未指定分桶键，则分桶键默认为主键（不含分区键）；对于无主键的日志表，若未指定分桶键，则数据将随机分配到各个桶中。 
+
+* 不支持 schema 变更同步。如果需要忽略 schema 变更，可使用 `schema.change.behavior: IGNORE`。
+
+* 关于数据同步， Pipeline 连接器使用 [Fluss Java 
Client](https://fluss.apache.org/docs/apis/java-client/) 向 Fluss 写入数据.
+
+Data Type Mapping
+----------------
+<div class="wy-table-responsive">
+<table class="colwidths-auto docutils">
+    <thead>
+      <tr>
+        <th class="text-left">Flink CDC type</th>
+        <th class="text-left">Fluss type</th>
+        <th class="text-left" style="width:60%;">Note</th>
+      </tr>
+    </thead>
+    <tbody>
+    <tr>
+      <td>TINYINT</td>
+      <td>TINYINT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>SMALLINT</td>
+      <td>SMALLINT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>INT</td>
+      <td>INT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>BIGINT</td>
+      <td>BIGINT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>FLOAT</td>
+      <td>FLOAT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DOUBLE</td>
+      <td>DOUBLE</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DECIMAL(p, s)</td>
+      <td>DECIMAL(p, s)</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>BOOLEAN</td>
+      <td>BOOLEAN</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DATE</td>
+      <td>DATE</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>TIME</td>
+      <td>TIME</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>TIMESTAMP</td>
+      <td>TIMESTAMP</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>TIMESTAMP_LTZ</td>
+      <td>TIMESTAMP_LTZ</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>CHAR(n)</td>
+      <td>CHAR(n)</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>VARCHAR(n)</td>
+      <td>VARCHAR(n)</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>BINARY(n)</td>
+      <td>BINARY(n)</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>VARBINARY(N)</td>
+      <td>BYTES</td>
+      <td></td>
+    </tr>
+    </tbody>
+</table>
+</div>
+
+{{< top >}}
diff --git a/docs/content/docs/connectors/pipeline-connectors/fluss.md 
b/docs/content/docs/connectors/pipeline-connectors/fluss.md
new file mode 100644
index 000000000..9e3994ea4
--- /dev/null
+++ b/docs/content/docs/connectors/pipeline-connectors/fluss.md
@@ -0,0 +1,246 @@
+---
+title: "Fluss"
+weight: 4
+type: docs
+aliases:
+- /connectors/pipeline-connectors/fluss
+---
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+  http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+
+# Fluss Pipeline Connector
+
+The Fluss Pipeline connector can be used as the *Data Sink* of the pipeline, 
and write data to [Fluss](https://fluss.apache.org). This document describes 
how to set up the Fluss Pipeline connector.
+
+## What can the connector do?
+* Create table automatically if not exist
+* Data synchronization
+
+How to create Pipeline
+----------------
+
+The pipeline for reading data from MySQL and sink to Fluss can be defined as 
follows:
+
+```yaml
+source:
+   type: mysql
+   name: MySQL Source
+   hostname: 127.0.0.1
+   port: 3306
+   username: admin
+   password: pass
+   tables: adb.\.*, bdb.user_table_[0-9]+, [app|web].order_\.*
+   server-id: 5401-5404
+
+sink:
+  type: fluss
+  name: Fluss Sink
+  bootstrap.servers: localhost:9123
+  # Security-related properties for the Fluss client
+  properties.client.security.protocol: sasl
+  properties.client.security.sasl.mechanism: PLAIN
+  properties.client.security.sasl.username: developer
+  properties.client.security.sasl.password: developer-pass
+
+pipeline:
+  name: MySQL to Fluss Pipeline
+  parallelism: 2
+```
+
+Pipeline Connector Options
+----------------
+<div class="highlight">
+<table class="colwidths-auto docutils">
+   <thead>
+      <tr>
+        <th class="text-left" style="width: 25%">Option</th>
+        <th class="text-left" style="width: 8%">Required</th>
+        <th class="text-left" style="width: 7%">Default</th>
+        <th class="text-left" style="width: 10%">Type</th>
+        <th class="text-left" style="width: 50%">Description</th>
+      </tr>
+    </thead>
+    <tbody>
+    <tr>
+      <td>type</td>
+      <td>required</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>Specify what connector to use, here should be <code>'fluss'</code>. 
</td>
+    </tr>
+    <tr>
+      <td>name</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>The name of the sink.</td>
+    </tr>
+    <tr>
+      <td>bootstrap.servers</td>
+      <td>required</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>The bootstrap servers for the Fluss sink connection. </td>
+    </tr>
+    <tr>
+      <td>bucket.key</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>Specific the distribution policy of each Fluss table.Tables are 
separated by ';', and bucket keys are separated by ','. 
+          Format: database1.table1:key1,key2;database1.table2:key3. Data will 
be distributed to each bucket according to the hash value of bucket-key (It 
must be a subset of the primary keys excluding partition keys of the primary 
key table). 
+          If the table has a primary key and a bucket key is not specified, 
the bucket key will be used as primary key(excluding the partition key).
+          If the table has no primary key and the bucket key is not specified, 
the data will be distributed to each bucket randomly. </td>
+    </tr>
+    <tr>
+      <td>bucket.num</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>The number of buckets of each Fluss table.Tables are separated by 
';'.Format: database1.table1:4;database1.table2:8. </td>
+    </tr>
+    <tr>
+      <td>properties.table.*</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>Pass options of Fluss table to pipeline，See <a 
href="https://fluss.apache.org/docs/engine-flink/options/#storage-options";>Fluss
 table options</a>. </td>
+    </tr>
+    <tr>
+      <td>properties.client.*</td>
+      <td>optional</td>
+      <td style="word-wrap: break-word;">(none)</td>
+      <td>String</td>
+      <td>Pass options of Fluss client to pipeline，See <a 
href="https://fluss.apache.org/docs/engine-flink/options/#write-options";>Fluss 
client options</a>. </td>
+    </tr>
+    </tbody>
+</table>    
+</div>
+
+## Usage Notes
+
+* Support Fluss primary key table and log table.
+
+* For creating table automatically
+  * There is no partition key
+  * The number of buckets is controlled by `bucket.num`
+  * The distribution keys are controlled by option `bucket.key`. For primary 
key table and a bucket key is not specified, the bucket key will be used as 
primary key(excluding the partition key). For log table has no primary key and 
the bucket key is not specified, the data will be distributed to each bucket 
randomly. 
+
+* Not support schema change synchronization.If you want to ignore schema 
change, use `schema.change.behavior: IGNORE`.
+
+* For data synchronization, the pipeline connector uses [Fluss Java 
Client](https://fluss.apache.org/docs/apis/java-client/)
+  to write data to Fluss.
+
+Data Type Mapping
+----------------
+<div class="wy-table-responsive">
+<table class="colwidths-auto docutils">
+    <thead>
+      <tr>
+        <th class="text-left">Flink CDC type</th>
+        <th class="text-left">Fluss type</th>
+        <th class="text-left" style="width:60%;">Note</th>
+      </tr>
+    </thead>
+    <tbody>
+    <tr>
+      <td>TINYINT</td>
+      <td>TINYINT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>SMALLINT</td>
+      <td>SMALLINT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>INT</td>
+      <td>INT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>BIGINT</td>
+      <td>BIGINT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>FLOAT</td>
+      <td>FLOAT</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DOUBLE</td>
+      <td>DOUBLE</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DECIMAL(p, s)</td>
+      <td>DECIMAL(p, s)</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>BOOLEAN</td>
+      <td>BOOLEAN</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>DATE</td>
+      <td>DATE</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>TIME</td>
+      <td>TIME</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>TIMESTAMP</td>
+      <td>TIMESTAMP</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>TIMESTAMP_LTZ</td>
+      <td>TIMESTAMP_LTZ</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>CHAR(n)</td>
+      <td>CHAR(n)</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>VARCHAR(n)</td>
+      <td>VARCHAR(n)</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>BINARY(n)</td>
+      <td>BINARY(n)</td>
+      <td></td>
+    </tr>
+    <tr>
+      <td>VARBINARY(N)</td>
+      <td>BYTES</td>
+      <td></td>
+    </tr>
+    </tbody>
+</table>
+</div>
+
+{{< top >}}

(flink-cdc) branch master updated: [FLINK-38059][doc] Add fluss pipeline connector documentation (#4088)

Reply via email to