[inlong-website] branch master updated: [INLONG-645][Doc] Starrocks load node documents (#646)

dockerzhang Sun, 11 Dec 2022 22:21:58 -0800

This is an automated email from the ASF dual-hosted git repository.

dockerzhang pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/inlong-website.git



The following commit(s) were added to refs/heads/master by this push:
     new 24e5cad76c [INLONG-645][Doc] Starrocks load node documents (#646)
24e5cad76c is described below

commit 24e5cad76cceee4b6f3352d4b5a827f110c3e4cc
Author: Liao Rui <[email protected]>
AuthorDate: Mon Dec 12 14:21:46 2022 +0800

    [INLONG-645][Doc] Starrocks load node documents (#646)
    
    Co-authored-by: ryanrliao <[email protected]>
---
 docs/data_node/load_node/starrocks.md              | 322 +++++++++++++++++++++
 docs/introduction.md                               |   2 +
 .../current/data_node/load_node/starrocks.md       | 319 ++++++++++++++++++++
 .../current/introduction.md                        |   2 +
 4 files changed, 645 insertions(+)

diff --git a/docs/data_node/load_node/starrocks.md 
b/docs/data_node/load_node/starrocks.md
new file mode 100644
index 0000000000..193d8bf5e5
--- /dev/null
+++ b/docs/data_node/load_node/starrocks.md
@@ -0,0 +1,322 @@
+---
+title: StarRocks
+sidebar_position: 17
+---
+
+import {siteVariables} from '../../version';
+
+## Overview
+ - `StarRocks Load` node supports writing data to the StarRocks database.
+ - Two modes are supported for sink to StarRocks: 
+Single-sink for specify fixed database name and table name to sink. 
+Multi-sink for custom database name and table name according to src format, 
which suitable for scenarios such as multi-table writing or whole database 
synchronization.
+ - This document describes how to set up a StarRocks Load node to sink to 
StarRocks.
+
+## Supported Version
+
+| Load Node           | StarRocks version  |                                   
                                                                                
                                                                                
                                                                                
                                                                                
                        
+|---------------------|----------------|
+| [StarRocks](./starrocks.md) | 2.0+          |  
+
+## Dependencies
+
+In order to set up the StarRocks Load node, the dependency information needed 
to use a build automation tool
+such as Maven or SBT is provided below.
+
+### Maven dependency
+
+<pre><code parentName="pre">
+{`<dependency>
+    <groupId>org.apache.inlong</groupId>
+    <artifactId>sort-connector-starrocks</artifactId>
+    <version>${siteVariables.inLongVersion}</version>
+</dependency>
+`}
+</code></pre>
+```
+
+## Prepare
+### Create MySql Extract table
+- For Single-sink: Create a table `cdc.cdc_mysql_source` in the MySQL 
database. The command is as follows:
+```sql
+[root@fe001 ~]# mysql -u root -h localhost -P 3306 -p123456
+mysql> use cdc;
+Database changed
+mysql> CREATE TABLE `cdc_mysql_source` (
+       `id` int(11) NOT NULL AUTO_INCREMENT,
+       `name` varchar(64) DEFAULT NULL,
+       `dr` tinyint(3) DEFAULT 0,
+       PRIMARY KEY (`id`)
+       );
+Query OK, 0 rows affected (0.02 sec)
+
+mysql> insert into cdc_mysql_source values(1, 'zhangsan', 0),(2, 'lisi', 
0),(3, 'wangwu', 0);
+Query OK, 3 rows affected (0.01 sec)
+Records: 3  Duplicates: 0  Warnings: 0
+
+mysql> select * from cdc_mysql_source;
++----+----------+----+
+| id | name     | dr |
++----+----------+----+
+|  1 | zhangsan |  0 |
+|  2 | lisi     |  0 |
+|  3 | wangwu   |  0 |
++----+----------+----+
+3 rows in set (0.07 sec)
+```
+- For Multi-sink: Create tables `user_db.user_id_name`、`user_db.user_id_name` 
in the MySQL database. The command is as follows:
+```sql
+[root@fe001 ~]# mysql -u root -h localhost -P 3306 -p123456
+mysql> use user_db;
+Database changed
+mysql> CREATE TABLE `user_id_name` (
+       `id` int(11) NOT NULL AUTO_INCREMENT,
+       `name` varchar(64) DEFAULT NULL
+       PRIMARY KEY (`id`)
+       );
+Query OK, 0 rows affected (0.02 sec)
+
+mysql> CREATE TABLE `user_id_score` (
+       `id` int(11) NOT NULL AUTO_INCREMENT,
+       `score` double default 0,
+       PRIMARY KEY (`id`)
+       );
+Query OK, 0 rows affected (0.02 sec)
+
+mysql> insert into user_id_name values(1001, 'lily'),(1002, 'tom'),(1003, 
'alan');
+Query OK, 3 rows affected (0.01 sec)
+Records: 3  Duplicates: 0  Warnings: 0 
+
+mysql> insert into user_id_score values(1001, 99),(1002, 96),(1003, 98);
+Query OK, 3 rows affected (0.01 sec)
+Records: 3  Duplicates: 0  Warnings: 0 
+
+mysql> select * from user_id_name;
++------+--------+
+|  id  | name   |
++------+--------+
+| 1001 | lily   |
+| 1002 | tom    |
+| 1003 | alan   |
++----+----------+
+3 rows in set (0.07 sec)    
+
+mysql> select * from user_id_score;
++------+------+
+|  id  | name |
++------+------+
+| 1001 | 99   |
+| 1002 | 96   |
+| 1003 | 98   |
++----+--------+
+3 rows in set (0.07 sec)  
+```
+
+### Create StarRocks Load table
+- For Single-sink: Create a table `cdc.cdc_starrocks_sink` in the StarRocks 
database. The command is as follows:
+```sql
+[root@fe001 ~]# mysql -u username -h localhost -P 9030 -p password
+mysql> use cdc;
+Reading table information for completion of table and column names
+You can turn off this feature to get a quicker startup with -A
+Database changed
+
+mysql> CREATE TABLE `cdc_starrocks_sink` (
+       `id` int(11) NOT NULL COMMENT "user id",
+       `name` varchar(50) NOT NULL COMMENT "user name",
+       `dr` tinyint(4) NULL COMMENT "delete tag"
+       ) ENGINE=OLAP
+       PRIMARY KEY(`id`)
+       COMMENT "OLAP"
+       DISTRIBUTED BY HASH(`id`) BUCKETS 1
+       PROPERTIES (
+       "replication_allocation" = "tag.location.default: 1"
+       );
+Query OK, 0 rows affected (0.06 sec)
+```
+- For Multi-sink: Create tables 
`user_db.starrocks_user_id_name`、`user_db.starrocks_user_id_score` in the 
StarRocks database. The command is as follows:
+```sql
+[root@fe001 ~]# mysql -u username -h localhost -P 9030 -p password
+mysql> use user_db;
+Reading table information for completion of table and column names
+You can turn off this feature to get a quicker startup with -A
+Database changed
+
+mysql> CREATE TABLE `starrocks_user_id_name` (
+       `id` int(11) NOT NULL COMMENT "用户id",
+       `name` varchar(50) NOT NULL COMMENT "昵称"
+       ) ENGINE=OLAP
+       PRIMARY KEY(`id`)
+       COMMENT "OLAP"
+       DISTRIBUTED BY HASH(`id`) BUCKETS 1
+       PROPERTIES (
+       "replication_allocation" = "tag.location.default: 1"
+       );
+Query OK, 0 rows affected (0.06 sec)
+
+mysql> CREATE TABLE `starrocks_user_id_score` (
+       `id` int(11) NOT NULL COMMENT "用户id",
+       `score` double default 0
+       ) ENGINE=OLAP
+       PRIMARY KEY(`id`)
+       COMMENT "OLAP"
+       DISTRIBUTED BY HASH(`id`) BUCKETS 1
+       PROPERTIES (
+       "replication_allocation" = "tag.location.default: 1"
+       );
+Query OK, 0 rows affected (0.06 sec)
+```
+
+## How to create a StarRocks Load Node
+
+### Usage for SQL API
+- For Single-sink: StarRocks load
+```sql
+[root@tasknode001 flink-1.13.5]# ./bin/sql-client.sh -l 
./opt/connectors/mysql-cdc-inlong/ -l ./opt/connectors/starrocks/
+Flink SQL> SET 'execution.checkpointing.interval' = '3s';
+[INFO] Session property has been set.
+
+Flink SQL> SET 'table.dynamic-table-options.enabled' = 'true';
+[INFO] Session property has been set.
+
+Flink SQL> CREATE TABLE cdc_mysql_source (
+    >   id int
+    >   ,name VARCHAR
+    >   ,dr TINYINT
+    >   ,PRIMARY KEY (id) NOT ENFORCED
+    > ) WITH (
+    >  'connector' = 'mysql-cdc-inlong',
+    >  'hostname' = 'localhost',
+    >  'port' = '3306',
+    >  'username' = 'root',
+    >  'password' = '123456',
+    >  'database-name' = 'cdc',
+    >  'table-name' = 'cdc_mysql_source'
+    > );
+[INFO] Execute statement succeed.
+
+Flink SQL> CREATE TABLE cdc_starrocks_sink (
+    > id INT,
+    > name STRING,
+    > dr TINYINT
+    > ) WITH (
+    >  'connector' = 'starrocks-inlong',
+    >  'fenodes' = 'localhost:8030',
+    >  'table.identifier' = 'cdc.cdc_starrocks_sink',
+    >  'username' = 'username',
+    >  'password' = 'password',
+    >  'sink.properties.format' = 'json',
+    >  'sink.properties.strip_outer_array' = 'true'
+    > );
+[INFO] Execute statement succeed.
+
+Flink SQL> insert into cdc_starrocks_sink select * from cdc_mysql_source /*+ 
OPTIONS('server-id'='5402') */;
+[INFO] Submitting SQL update statement to the cluster...
+[INFO] SQL update statement has been successfully submitted to the cluster:
+Job ID: 5f89691571d7b3f3ca446589e3d0c3d3
+```
+- For Single-sink: StarRocks load
+```sql
+./bin/sql-client.sh -l ./opt/connectors/mysql-cdc-inlong/ -l 
./opt/connectors/starrocks/
+Flink SQL> SET 'execution.checkpointing.interval' = '3s';
+[INFO] Session property has been set.
+
+Flink SQL> SET 'table.dynamic-table-options.enabled' = 'true';
+[INFO] Session property has been set.
+
+Flink SQL> CREATE TABLE cdc_mysql_source (
+    >   id int
+    >   ,name VARCHAR
+    >   ,dr TINYINT
+    >   ,PRIMARY KEY (id) NOT ENFORCED
+    > ) WITH (
+    >  'connector' = 'mysql-cdc-inlong',
+    >  'hostname' = 'localhost',
+    >  'port' = '3306',
+    >  'username' = 'root',
+    >  'password' = '123456',
+    >  'database-name' = 'test',
+    >  'table-name' = 'cdc_mysql_source'
+    > );
+[INFO] Execute statement succeed.
+
+Flink SQL> CREATE TABLE cdc_starrocks_sink (
+    > id INT,
+    > name STRING,
+    > dr TINYINT
+    > ) WITH (
+    >  'connector' = 'starrocks-inlong',
+    >  'fenodes' = 'localhost:8030',
+    >  'username' = 'username',
+    >  'password' = 'password',
+    >  'sink.multiple.enable' = 'true',
+    >  'sink.multiple.format' = 'canal-json',
+    >  'sink.multiple.database-pattern' = '${database}',
+    >  'sink.multiple.table-pattern' = 'starrocks_${table}'
+    > );
+[INFO] Execute statement succeed.
+
+Flink SQL> insert into cdc_starrocks_sink select * from cdc_mysql_source /*+ 
OPTIONS('server-id'='5402') */;
+[INFO] Submitting SQL update statement to the cluster...
+[INFO] SQL update statement has been successfully submitted to the cluster:
+Job ID: 30feaa0ede92h6b6e25ea0cfda26df5e
+```
+
+### Usage for InLong Dashboard
+
+TODO: It will be supported in the future.
+
+### Usage for InLong Manager Client
+
+TODO: It will be supported in the future.
+
+## StarRocks Load Node Options
+
+| Option                            | Required     | Default           | Type  
  | Description |
+|-----------------------------------|--------------|-------------------|---------|-------------|
+| connector                         | required     | (none)            | 
string  | Specify which connector to use, valid values are: `starrocks-inlong` |
+| jdbc-url                          | required     | (none)            | 
string  | this will be used to execute queries in starrocks. |                  
+| load-url                          | required     | (none)            | 
string  | fe_ip:http_port;fe_ip:http_port separated with ';', which would be 
used to do the batch sinking. |                                                 
            
+| database-name                     | required     | (none)            | 
string  | starrocks database name |
+| table-name                        | required     | (none)            | 
string  | starrocks table name |
+| username                          | required     | (none)            | 
string  | starrocks connecting username |
+| password                          | required     | (none)            | 
string  | starrocks connecting password |
+| sink.semantic                     | optional     | at-least-once     | 
string  | at-least-once or exactly-once(flush at checkpoint only and options 
like sink.buffer-flush.* won't work either). |
+| sink.version                      | optional     | AUTO             | string 
 | The version of implementaion for sink exactly-once. Only availible for 
connector 1.2.4+. If V2, use StarRocks' stream load transaction interface which 
requires StarRocks 2.4+. If V1, use stream load non-transaction interface. If 
AUTO, connector will choose the stream load transaction interface automatically 
if the StarRocks supports the feature, otherwise choose non-transaction 
interface. |
+| sink.buffer-flush.max-bytes       | optional     | 94371840(90M)    | string 
 | the max batching size of the serialized data, range: [64MB, 10GB]. |
+| sink.buffer-flush.max-rows        | optional     | 500000           | string 
 | the max batching rows, range: [64,000, 5000,000]. |
+| sink.buffer-flush.interval-ms     | optional     | 300000           | string 
 | the flushing time interval, range: [1000ms, 3600000ms]. |
+| sink.max-retries                  | optional     | 3                | string 
 | max retry times of the stream load request, range: [0, 10]. |
+| sink.connect.timeout-ms           | optional     | 1000             | string 
 | Timeout in millisecond for connecting to the load-url, range: [100, 60000]. |
+| sink.properties.format            | optional     | CSV              | string 
 | The file format of data loaded into starrocks. Valid values: CSV and JSON. 
Default value: CSV. |
+| sink.properties.*                 | optional     | NONE             | string 
 | the stream load properties like 'sink.properties.columns' = 'k1, k2, 
k3',details in STREAM LOAD. Since 2.4, the flink-connector-starrocks supports 
partial updates for Primary Key model. |
+| sink.properties.ignore_json_size  | optional     | false            | string 
 | ignore the batching size (100MB) of json data | 
+| sink.multiple.enable              | optional   | false             | boolean 
 | Determine whether to support multiple sink writing, default is `false`. when 
`sink.multiple.enable` is `true`, need 
`sink.multiple.format`、`sink.multiple.database-pattern`、`sink.multiple.table-pattern`
 be correctly set.  |
+| sink.multiple.format              | optional   | (none)            | string  
 | The format of multiple sink, it represents the real format of the raw binary 
data. can be `canal-json` or `debezium-json` at present. See [kafka -- Dynamic 
Topic 
Extraction](https://github.com/apache/inlong-website/blob/master/docs/data_node/load_node/kafka.md)
 for more details.  |
+| sink.multiple.database-pattern    | optional   | (none)            | string  
 | Extract database name from the raw binary data, this is only used in the 
multiple sink writing scenario.                 | 
+| sink.multiple.table-pattern       | optional   | (none)            | string  
 | Extract table name from the raw binary data, this is only used in the 
multiple sink writing scenario. |
+
+## Data Type Mapping
+
+| Flink type | StarRocks type |
+| ---------- | -------------- |
+| BOOLEAN    | BOOLEAN        |
+| TINYINT    | TINYINT        |
+| SMALLINT   | SMALLINT       |
+| INTEGER    | INTEGER        |
+| BIGINT     | BIGINT         |
+| FLOAT      | FLOAT          |
+| DOUBLE     | DOUBLE         |
+| DECIMAL    | DECIMAL        |
+| BINARY     | INT            |
+| CHAR       | JSON / STRING  |
+| VARCHAR    | JSON / STRING  |
+| STRING     | JSON / STRING  |
+| DATE       | DATE           |
+| TIMESTAMP_WITHOUT_TIME_ZONE(N) | DATETIME |
+| TIMESTAMP_WITH_LOCAL_TIME_ZONE(N) | DATETIME |
+| ARRAY&lt;T&gt;   | ARRAY&lt;T&gt;       |
+| MAP&lt;KT,VT&gt; | JSON / JSON STRING |
+| ROW&lt;arg T...&gt; | JSON / JSON STRING |
+
+See 
[flink-connector-starrocks](https://github.com/StarRocks/starrocks-connector-for-apache-flink/blob/main/README.md)
 for more details.
\ No newline at end of file
diff --git a/docs/introduction.md b/docs/introduction.md
index e0d238de04..5fdcff9235 100644
--- a/docs/introduction.md
+++ b/docs/introduction.md
@@ -91,4 +91,6 @@ Apache InLong serves the entire life cycle from data 
collection to landing,  and
 |              | Greenplum         | 4.x, 5.x, 6.x                | 
Lightweight, Standard |
 |              | Elasticsearch     | 6.x, 7.x                     | 
Lightweight, Standard |
 |              | SQLServer         | 2012, 2014, 2016, 2017, 2019 | 
Lightweight, Standard |
+|              | Doris             | >= 0.13                      | 
Lightweight, Standard |
+|              | StarRocks         | >= 2.0                       | 
Lightweight, Standard |
 |              | HDFS              | 2.x, 3.x                     | 
Lightweight, Standard |
diff --git 
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data_node/load_node/starrocks.md
 
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data_node/load_node/starrocks.md
new file mode 100644
index 0000000000..84de4b7573
--- /dev/null
+++ 
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data_node/load_node/starrocks.md
@@ -0,0 +1,319 @@
+---
+title: StarRocks
+sidebar_position: 17
+---
+
+import {siteVariables} from '../../version';
+
+## 概览
+
+`StarRocks Load` 节点支持将数据写入 StarRocks 数据库。 
+支持单表写入和多表写入两种模式：单表写入为指定固定库名表名写入；多表写入支持根据源端数据格式自定义库名表名写入，适用于源端多表写入或者整库同步等场景。
+本文档介绍如何设置 StarRocks Load 节点实现写入 StarRocks 数据库表。
+
+## 支持的版本
+
+| Load 节点             | StarRocks 版本 |                                         
                                                                                
                                                                                
                                                                                
                                                                                
                  
+|---------------------|----------|
+| [StarRocks](./starrocks.md) | 2.0+    |  
+
+## 依赖
+
+为了设置 StarRocks Load 节点, 下面提供了使用构建自动化工具（例如 Maven 或 SBT）所需要的依赖信息。
+
+### Maven 依赖
+
+<pre><code parentName="pre">
+{`<dependency>
+    <groupId>org.apache.inlong</groupId>
+    <artifactId>sort-connector-starrocks</artifactId>
+    <version>${siteVariables.inLongVersion}</version>
+</dependency>
+`}
+</code></pre>
+
+## 准备
+### 创建 MySQL Extract 表
+- 单表写入：在 MySQL `cdc` 数据库中创建表 `cdc_mysql_source`。 命令如下:
+```sql
+[root@fe001 ~]# mysql -u root -h localhost -P 3306 -p123456
+mysql> use cdc;
+Database changed
+mysql> CREATE TABLE `cdc_mysql_source` (
+       `id` int(11) NOT NULL AUTO_INCREMENT,
+       `name` varchar(64) DEFAULT NULL,
+       `dr` tinyint(3) DEFAULT 0,
+       PRIMARY KEY (`id`)
+       );
+Query OK, 0 rows affected (0.02 sec)
+
+mysql> insert into cdc_mysql_source values(1, 'zhangsan', 0),(2, 'lisi', 
0),(3, 'wangwu', 0);
+Query OK, 3 rows affected (0.01 sec)
+Records: 3  Duplicates: 0  Warnings: 0
+
+mysql> select * from cdc_mysql_source;
++----+----------+----+
+| id | name     | dr |
++----+----------+----+
+|  1 | zhangsan |  0 |
+|  2 | lisi     |  0 |
+|  3 | wangwu   |  0 |
++----+----------+----+
+3 rows in set (0.07 sec)
+```
+- 多表写入：在 MySQL `user_db` 数据库中创建表 `user_id_name`、`user_id_score`。 命令如下:
+```sql
+[root@fe001 ~]# mysql -u root -h localhost -P 3306 -p123456
+mysql> use user_db;
+Database changed
+mysql> CREATE TABLE `user_id_name` (
+       `id` int(11) NOT NULL AUTO_INCREMENT,
+       `name` varchar(64) DEFAULT NULL
+       PRIMARY KEY (`id`)
+       );
+Query OK, 0 rows affected (0.02 sec)
+
+mysql> CREATE TABLE `user_id_score` (
+       `id` int(11) NOT NULL AUTO_INCREMENT,
+       `score` double default 0,
+       PRIMARY KEY (`id`)
+       );
+Query OK, 0 rows affected (0.02 sec)
+
+mysql> insert into user_id_name values(1001, 'lily'),(1002, 'tom'),(1003, 
'alan');
+Query OK, 3 rows affected (0.01 sec)
+Records: 3  Duplicates: 0  Warnings: 0 
+
+mysql> insert into user_id_score values(1001, 99),(1002, 96),(1003, 98);
+Query OK, 3 rows affected (0.01 sec)
+Records: 3  Duplicates: 0  Warnings: 0 
+
+mysql> select * from user_id_name;
++------+--------+
+|  id  | name   |
++------+--------+
+| 1001 | lily   |
+| 1002 | tom    |
+| 1003 | alan   |
++----+----------+
+3 rows in set (0.07 sec)    
+
+mysql> select * from user_id_score;
++------+------+
+|  id  | name |
++------+------+
+| 1001 | 99   |
+| 1002 | 96   |
+| 1003 | 98   |
++----+--------+
+3 rows in set (0.07 sec)  
+```
+
+### 创建 StarRocks Load 表
+- 单表写入：在 StarRocks `cdc`数据库中创建表`cdc_starrocks_sink`。命令如下:
+```sql
+[root@fe001 ~]# mysql -u username -h localhost -P 9030 -p password
+mysql> use cdc;
+Reading table information for completion of table and column names
+You can turn off this feature to get a quicker startup with -A
+Database changed
+
+mysql> CREATE TABLE `cdc_starrocks_sink` (
+       `id` int(11) NOT NULL COMMENT "用户id",
+       `name` varchar(50) NOT NULL COMMENT "昵称",
+       `dr` tinyint(4) NULL COMMENT "逻辑删除"
+       ) ENGINE=OLAP
+       PRIMARY KEY(`id`)
+       COMMENT "OLAP"
+       DISTRIBUTED BY HASH(`id`) BUCKETS 1
+       PROPERTIES (
+       "replication_allocation" = "tag.location.default: 1"
+       );
+Query OK, 0 rows affected (0.06 sec)
+```
+- 多表写入：在 StarRocks 
`user_db`数据库中创建表`starrocks_user_id_name`、`starrocks_user_id_score`。命令如下:
+```sql
+[root@fe001 ~]# mysql -u username -h localhost -P 9030 -p password
+mysql> use user_db;
+Reading table information for completion of table and column names
+You can turn off this feature to get a quicker startup with -A
+Database changed
+
+mysql> CREATE TABLE `starrocks_user_id_name` (
+       `id` int(11) NOT NULL COMMENT "用户id",
+       `name` varchar(50) NOT NULL COMMENT "昵称"
+       ) ENGINE=OLAP
+       PRIMARY KEY(`id`)
+       COMMENT "OLAP"
+       DISTRIBUTED BY HASH(`id`) BUCKETS 1
+       PROPERTIES (
+       "replication_allocation" = "tag.location.default: 1"
+       );
+Query OK, 0 rows affected (0.06 sec)
+
+mysql> CREATE TABLE `starrocks_user_id_score` (
+       `id` int(11) NOT NULL COMMENT "用户id",
+       `score` double default 0
+       ) ENGINE=OLAP
+       PRIMARY KEY(`id`)
+       COMMENT "OLAP"
+       DISTRIBUTED BY HASH(`id`) BUCKETS 1
+       PROPERTIES (
+       "replication_allocation" = "tag.location.default: 1"
+       );
+Query OK, 0 rows affected (0.06 sec)
+```
+
+## 如何创建 StarRocks Load 节点
+
+### SQL API 用法
+- 单表写入： StarRocks 单表写入
+```sql
+[root@tasknode001 flink-1.13.5]# ./bin/sql-client.sh -l 
./opt/connectors/mysql-cdc-inlong/ -l ./opt/connectors/starrocks/
+Flink SQL> SET 'execution.checkpointing.interval' = '3s';
+[INFO] Session property has been set.
+
+Flink SQL> SET 'table.dynamic-table-options.enabled' = 'true';
+[INFO] Session property has been set.
+
+Flink SQL> CREATE TABLE cdc_mysql_source (
+    >   id int
+    >   ,name VARCHAR
+    >   ,dr TINYINT
+    >   ,PRIMARY KEY (id) NOT ENFORCED
+    > ) WITH (
+    >  'connector' = 'mysql-cdc-inlong',
+    >  'hostname' = 'localhost',
+    >  'port' = '3306',
+    >  'username' = 'root',
+    >  'password' = '123456',
+    >  'database-name' = 'cdc',
+    >  'table-name' = 'cdc_mysql_source'
+    > );
+[INFO] Execute statement succeed.
+
+Flink SQL> CREATE TABLE cdc_starrocks_sink (
+    > id INT,
+    > name STRING,
+    > dr TINYINT
+    > ) WITH (
+    >  'connector' = 'starrocks-inlong',
+    >  'fenodes' = 'localhost:8030',
+    >  'table.identifier' = 'cdc.cdc_starrocks_sink',
+    >  'username' = 'username',
+    >  'password' = 'password',
+    >  'sink.properties.format' = 'json'
+    > );
+[INFO] Execute statement succeed.
+
+Flink SQL> insert into cdc_starrocks_sink select * from cdc_mysql_source /*+ 
OPTIONS('server-id'='5402') */;
+[INFO] Submitting SQL update statement to the cluster...
+[INFO] SQL update statement has been successfully submitted to the cluster:
+Job ID: 5f89691571d7b3f3ca446589e3d0c3d3
+```
+
+- 多表写入： StarRocks 多表写入
+```sql
+./bin/sql-client.sh -l ./opt/connectors/mysql-cdc-inlong/ -l 
./opt/connectors/starrocks/
+Flink SQL> SET 'execution.checkpointing.interval' = '3s';
+[INFO] Session property has been set.
+
+Flink SQL> SET 'table.dynamic-table-options.enabled' = 'true';
+[INFO] Session property has been set.
+
+Flink SQL> CREATE TABLE cdc_mysql_source (
+    >   id int
+    >   ,name VARCHAR
+    >   ,dr TINYINT
+    >   ,PRIMARY KEY (id) NOT ENFORCED
+    > ) WITH (
+    >  'connector' = 'mysql-cdc-inlong',
+    >  'hostname' = 'localhost',
+    >  'port' = '3306',
+    >  'username' = 'root',
+    >  'password' = '123456',
+    >  'database-name' = 'test',
+    >  'table-name' = 'cdc_mysql_source'
+    > );
+[INFO] Execute statement succeed.
+
+Flink SQL> CREATE TABLE cdc_starrocks_sink (
+    > id INT,
+    > name STRING,
+    > dr TINYINT
+    > ) WITH (
+    >  'connector' = 'starrocks-inlong',
+    >  'fenodes' = 'localhost:8030',
+    >  'username' = 'username',
+    >  'password' = 'password',
+    >  'sink.multiple.enable' = 'true',
+    >  'sink.multiple.format' = 'canal-json',
+    >  'sink.multiple.database-pattern' = '${database}',
+    >  'sink.multiple.table-pattern' = 'StarRocks_${table}'
+    > );
+[INFO] Execute statement succeed.
+
+Flink SQL> insert into cdc_starrocks_sink select * from cdc_mysql_source /*+ 
OPTIONS('server-id'='5402') */;
+[INFO] Submitting SQL update statement to the cluster...
+[INFO] SQL update statement has been successfully submitted to the cluster:
+Job ID: 30feaa0ede92h6b6e25ea0cfda26df5e
+```
+
+### InLong Dashboard 用法
+
+TODO: 将在未来支持此功能。
+
+### InLong Manager Client 用法
+
+TODO: 将在未来支持此功能。
+
+## StarRocks Load 节点参数
+
+| 参数                               | 是否必选  | 默认值            | 数据类型  | 描述     |
+| --------------------------------- | ------- | ----------------- | ------- | 
------- |
+| connector                         | 必选     | 无                | string  | 
指定使用哪个connector，合法值为`starrocks-inlong` |
+| jdbc-url                          | 必选     | 无                | string  | 
用于在starrocks中执行查询 |                  
+| load-url                          | 必选     | 无                | string  | 
格式为 fe_ip:http_port;fe_ip:http_port 用分号(;)隔开。用于向starrocks批量写入数据。|               
                                  
+| database-name                     | 必选     | 无                | string  | 
starrocks的数据库名 |
+| table-name                        | 必选     | 无                | string  | 
starrocks的表名 |
+| username                          | 必选     | 无                | string  | 
starrocks连接的用户名 |
+| password                          | 必选     | 无                | string  | 
starrocks连接的口令 |
+| sink.semantic                     | 可选     | at-least-once    | string  | 
可选值为 at-least-once 或 exactly-once (仅在checkpoint时刷新数据，`sink.buffer-flush.*` 
等参数将不再工作) |
+| sink.version                      | 可选     | AUTO             | string  | 
exectly-once语义的实现版本，只有connector在1.2.4及以上的版本时才可用。如果填V2，则使用StarRocks的stream 
load事务接口需要2.4及以上的StarRocks版本。如果填V1，则使用stream 
load非事务接口。如果填AUTO，则connector根据StarRocks是否支持事务的特性来自动选择stream load的事务接口。 |
+| sink.buffer-flush.max-bytes       | 可选     | 94371840(90M)    | string  | 
批量刷新缓存数据的大小阈值，范围：[64MB, 10GB] |
+| sink.buffer-flush.max-rows        | 可选     | 500000           | string  | 
批量刷新缓存数据的行数阈值，范围：[64,000, 5000,000] |
+| sink.buffer-flush.interval-ms     | 可选     | 300000           | string  | 
批量刷新缓存数据的时间间隔，范围：[1000ms, 3600000ms] |
+| sink.max-retries                  | 可选     | 3                | string  | 
stream load请求的最大重试次数，范围：[0, 10] |
+| sink.connect.timeout-ms           | 可选     | 1000             | string  | 
连接到指定的load-url的超时时间，单位：毫秒，范围：[100, 60000] |
+| sink.properties.format            | 可选     | CSV              | string  | 
导入到starocks的数据文件格式，可选的值为：CSV和JSON。默认为: CSV |
+| sink.properties.*                 | 可选     | 无                | string  | 
stream load的属性，例如：'sink.properties.columns' = 'k1, k2, k3'。从StarRocks 
2.4开始，flink-connector-starrocks支持Primary Key模式下的数据部分更新。 |
+| sink.properties.ignore_json_size  | 可选     | false            | string  |  
忽略json数据的批量大小限制(100MB) | 
+| sink.multiple.enable              | 可选     | false            | boolean | 
决定是否开始多表(整库)写入特性，默认为`false`。当 `sink.multiple.enable` 为 `true` 时，也需要设置 
`sink.multiple.format`、`sink.multiple.database-pattern`和`sink.multiple.table-pattern`
 |
+| sink.multiple.format              | 可选     | 无               | string   | 
多表(整库)写入的数据格式，它表示connector之间流转的原始二进制数据的实际格式，目前支持`canal-json` 和 
`debezium-json`。可以查看[kafka -- Dynamic Topic 
Extraction](https://github.com/apache/inlong-website/blob/master/docs/data_node/load_node/kafka.md)获取更多信息。
  |
+| sink.multiple.database-pattern    | 可选     | 无               | string   | 
从原始二进制数据中提取数据库名，仅在多表(整库)同步场景中使用。 | 
+| sink.multiple.table-pattern       | 可选     | 无               | string   | 
从原始二进制数据中提取表名，仅在多表(整库)同步场景中使用。 |
+
+## 数据类型映射
+
+| Flink类型   | StarRocks类型   |
+| ---------- | -------------- |
+| BOOLEAN    | BOOLEAN        |
+| TINYINT    | TINYINT        |
+| SMALLINT   | SMALLINT       |
+| INTEGER    | INTEGER        |
+| BIGINT     | BIGINT         |
+| FLOAT      | FLOAT          |
+| DOUBLE     | DOUBLE         |
+| DECIMAL    | DECIMAL        |
+| BINARY     | INT            |
+| CHAR       | JSON / STRING  |
+| VARCHAR    | JSON / STRING  |
+| STRING     | JSON / STRING  |
+| DATE       | DATE           |
+| TIMESTAMP_WITHOUT_TIME_ZONE(N) | DATETIME |
+| TIMESTAMP_WITH_LOCAL_TIME_ZONE(N) | DATETIME |
+| ARRAY&lt;T&gt;   | ARRAY&lt;T&gt;       |
+| MAP&lt;KT,VT&gt; | JSON / JSON STRING |
+| ROW&lt;arg T...&gt; | JSON / JSON STRING |
+
+查看 
[flink-connector-starrocks](https://github.com/StarRocks/starrocks-connector-for-apache-flink/blob/main/README.md)
 获取更多信息。
\ No newline at end of file
diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md 
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
index fef9824181..d503fcdf94 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/introduction.md
@@ -88,6 +88,8 @@ Apache InLong 服务于数据采集到落地的整个生命周期，按数据的
 |              | Greenplum         | 4.x, 5.x, 6.x                | 
Lightweight, Standard |
 |              | Elasticsearch     | 6.x, 7.x                     | 
Lightweight, Standard |
 |              | SQLServer         | 2012, 2014, 2016, 2017, 2019 | 
Lightweight, Standard |
+|              | Doris             | >= 0.13                      | 
Lightweight, Standard |
+|              | StarRocks         | >= 2.0                       | 
Lightweight, Standard |
 |              | HDFS              | 2.x, 3.x                     | 
Lightweight, Standard |

[inlong-website] branch master updated: [INLONG-645][Doc] Starrocks load node documents (#646)

Reply via email to