This is an automated email from the ASF dual-hosted git repository.
luzhijing pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/doris-website.git
The following commit(s) were added to refs/heads/master by this push:
new 5a08e34ed9 [docs](3.0) Update typo and sidebar of 3.0 version docs
(#908)
5a08e34ed9 is described below
commit 5a08e34ed91942b0908063d05fa879e83f85144c
Author: KassieZ <[email protected]>
AuthorDate: Wed Jul 24 18:48:16 2024 +0800
[docs](3.0) Update typo and sidebar of 3.0 version docs (#908)
---
blog/migrate-lakehouse-from-bigquery-to-doris.md | 2 +-
docs/data-operate/export/export-overview.md | 41 +++++++++++-----------
docs/practical-guide/log-storage-analysis.md | 2 +-
.../current/data-operate/export/export-overview.md | 5 +--
.../practical-guide/log-storage-analysis.md | 2 +-
.../practical-guide/log-storage-analysis.md | 2 +-
.../data-operate/export/export-overview.md | 5 +--
.../practical-guide/log-storage-analysis.md | 2 +-
.../data-operate/export/export-overview.md | 5 +--
.../practical-guide/log-storage-analysis.md | 2 +-
.../practical-guide/log-storage-analysis.md | 2 +-
.../data-operate/export/export-overview.md | 40 +++++++++++----------
.../practical-guide/log-storage-analysis.md | 2 +-
.../data-operate/export/export-overview.md | 41 +++++++++++-----------
.../practical-guide/log-storage-analysis.md | 2 +-
versioned_sidebars/version-3.0-sidebars.json | 7 ----
16 files changed, 81 insertions(+), 81 deletions(-)
diff --git a/blog/migrate-lakehouse-from-bigquery-to-doris.md
b/blog/migrate-lakehouse-from-bigquery-to-doris.md
index 671f85c2eb..5189770f95 100644
--- a/blog/migrate-lakehouse-from-bigquery-to-doris.md
+++ b/blog/migrate-lakehouse-from-bigquery-to-doris.md
@@ -170,7 +170,7 @@ The implementation was carried out by 1 Data Engineer, 1
Software Engineer, and
- It supports seamless data import from Apache Iceberg. The Machine Learning
and data mining team can directly import data without needing to create a
separate pipeline like with BigQuery.
-- It supports vector data storage for AI chatbots. Data can be directly
imported from the File Store Service (S3) instead of having to push it to Redis
as before.
+- It supports [vector data
storage](https://python.langchain.com/v0.2/docs/integrations/vectorstores/apache_doris/)
for AI chatbots. Data can be directly imported from the File Store Service
(S3) instead of having to push it to Redis as before.
- It provides efficient data aggregation through the Rollup mechanism.
diff --git a/docs/data-operate/export/export-overview.md
b/docs/data-operate/export/export-overview.md
index fc90d2b6e7..29a45cf5c1 100644
--- a/docs/data-operate/export/export-overview.md
+++ b/docs/data-operate/export/export-overview.md
@@ -86,26 +86,27 @@ Parquet and ORC file formats have their own data types.
Doris's export function
The following table shows the mapping between Doris data types and Parquet,
ORC file format data types:
1. Doris export to ORC file format data type mapping table:
- | Doris Type | Orc Type |
- | ----- | ----- |
- | boolean | boolean |
- | tinyint | tinyint |
- | smallint | smallint |
- | int | int |
- | bigint | bigint |
- | largeInt | string |
- | date | string |
- | datev2 | string |
- | datetime | string |
- | datetimev2 | timestamp |
- | float | float |
- | double | double |
- | char / varchar / string | string |
- | decimal | decimal |
- | struct | struct |
- | map | map |
- | array | array |
- |json| Not support|
+
+ |Doris Type|Orc Type|
+ | -------- | ------- |
+ |boolean|boolean|
+ |tinyint|tinyint|
+ |smallint|smallint|
+ |int|int|
+ |bigint|bigint|
+ |largeInt|string|
+ |date|string|
+ |datev2|string|
+ |datetime|string|
+ |datetimev2|timestamp|
+ |float|float|
+ |double|double|
+ |char / varchar / string|string|
+ |decimal|decimal|
+ |struct|struct|
+ |map|map|
+ |array|array|
+ |json| Not supported|
2. When Doris exports to Parquet file format, it first converts Doris
in-memory data to Arrow in-memory data format, then writes out to Parquet file
format. The mapping relationship between Doris data types and Arrow data types
is:
diff --git a/docs/practical-guide/log-storage-analysis.md
b/docs/practical-guide/log-storage-analysis.md
index d1c15772e8..99d251de70 100644
--- a/docs/practical-guide/log-storage-analysis.md
+++ b/docs/practical-guide/log-storage-analysis.md
@@ -194,7 +194,7 @@ You can find FE configuration fields in `fe/conf/fe.conf`.
Refer to the followin
| Configuration fields to be optimized | Description
|
| :----------------------------------------------------------- |
:----------------------------------------------------------- |
| `max_running_txn_num_per_db = 10000` | Increase the
parameter value to adapt to high-concurrency import transactions. |
-| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` |
Increase the retention time to handle high-frequency import transactions with
high memory usage. |
+| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` |
Increase the retention time to handle high-frequency import transactions with
high memory usage. |
| `enable_round_robin_create_tablet = true` | When creating
Tablets, use a Round Robin strategy to distribute evenly. |
| `tablet_rebalancer_type = partition` | When
balancing Tablets, use a strategy to evenly distribute within each partition. |
| `enable_single_replica_load = true` | Enable
single-replica import, where multiple replicas only need to build an index once
to reduce CPU consumption. |
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md
index 5fa29031fe..4a5441a640 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md
@@ -86,8 +86,9 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出
以下是 Apache Doris 数据类型和 Parquet、ORC 文件格式的数据类型映射关系表:
1. Doris 导出到 Orc 文件格式的数据类型映射表:
+
|Doris Type|Orc Type|
- | ----- | ----- |
+ | -------- | ------- |
|boolean|boolean|
|tinyint|tinyint|
|smallint|smallint|
@@ -107,7 +108,7 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出
|array|array|
|json|不支持|
- <br/>
+
2. Apache Doris 导出到 Parquet 文件格式时,会先将 Apache Doris 内存数据转换为 Arrow 内存数据格式,然后由
Arrow 写出到 Parquet 文件格式。Apache Doris 数据类型到 Arrow 数据类的映射关系为:
|Doris Type|Arrow Type|
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/practical-guide/log-storage-analysis.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/practical-guide/log-storage-analysis.md
index c079a16380..a6c5dd1a1c 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/practical-guide/log-storage-analysis.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/practical-guide/log-storage-analysis.md
@@ -170,7 +170,7 @@ Apache Doris 对 Flexible Schema 的日志数据提供了几个方面的支持
| 需调整参数 | 说明
|
| :----------------------------------------------------------- |
:----------------------------------------------------------- |
| `max_running_txn_num_per_db = 10000` |
高并发导入运行事务数较多,需调高参数。 |
-| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` |
高频导入事务标签内存占用多,保留时间调短。 |
+| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` |
高频导入事务标签内存占用多,保留时间调短。 |
| `enable_round_robin_create_tablet = true` | 创建 Tablet
时,采用 Round Robin 策略,尽量均匀。 |
| `tablet_rebalancer_type = partition` | 均衡 Tablet
时,采用每个分区内尽量均匀的策略。 |
| `enable_single_replica_load = true` |
开启单副本导入,多个副本只需构建一次索引,减少 CPU 消耗。 |
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/practical-guide/log-storage-analysis.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/practical-guide/log-storage-analysis.md
index 0430c5eae8..1a358d8a3b 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/practical-guide/log-storage-analysis.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/practical-guide/log-storage-analysis.md
@@ -170,7 +170,7 @@ Apache Doris 对 Flexible Schema 的日志数据提供了几个方面的支持
| 需调整参数 | 说明
|
| :----------------------------------------------------------- |
:----------------------------------------------------------- |
| `max_running_txn_num_per_db = 10000` |
高并发导入运行事务数较多,需调高参数。 |
-| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` |
高频导入事务标签内存占用多,保留时间调短。 |
+| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` |
高频导入事务标签内存占用多,保留时间调短。 |
| `enable_round_robin_create_tablet = true` | 创建 Tablet
时,采用 Round Robin 策略,尽量均匀。 |
| `tablet_rebalancer_type = partition` | 均衡 Tablet
时,采用每个分区内尽量均匀的策略。 |
| `enable_single_replica_load = true` |
开启单副本导入,多个副本只需构建一次索引,减少 CPU 消耗。 |
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/export-overview.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/export-overview.md
index 76552b3edd..3e5e736f9b 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/export-overview.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/data-operate/export/export-overview.md
@@ -86,8 +86,9 @@ Parquet、ORC 文件格式拥有自己的数据类型。Doris 的导出功能能
以下是 Doris 数据类型和 Parquet、ORC 文件格式的数据类型映射关系表:
1. Doris 导出到 Orc 文件格式的数据类型映射表:
+
|Doris Type|Orc Type|
- | ----- | ----- |
+ | -------- | ------- |
|boolean|boolean|
|tinyint|tinyint|
|smallint|smallint|
@@ -105,8 +106,8 @@ Parquet、ORC 文件格式拥有自己的数据类型。Doris 的导出功能能
|struct|struct|
|map|map|
|array|array|
+ |json|不支持|
- <br/>
2. Doris 导出到 Parquet 文件格式时,会先将 Doris 内存数据转换为 arrow 内存数据格式,然后由 arrow 写出到
parquet 文件格式。Doris 数据类型到 arrow 数据类的映射关系为:
|Doris Type|Arrow Type|
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/practical-guide/log-storage-analysis.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/practical-guide/log-storage-analysis.md
index 424fccbb89..50f3a02571 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/practical-guide/log-storage-analysis.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/practical-guide/log-storage-analysis.md
@@ -170,7 +170,7 @@ Apache Doris 对 Flexible Schema 的日志数据提供了几个方面的支持
| 需调整参数 | 说明
|
| :----------------------------------------------------------- |
:----------------------------------------------------------- |
| `max_running_txn_num_per_db = 10000` |
高并发导入运行事务数较多,需调高参数。 |
-| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` |
高频导入事务标签内存占用多,保留时间调短。 |
+| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` |
高频导入事务标签内存占用多,保留时间调短。 |
| `enable_round_robin_create_tablet = true` | 创建 Tablet
时,采用 Round Robin 策略,尽量均匀。 |
| `tablet_rebalancer_type = partition` | 均衡 Tablet
时,采用每个分区内尽量均匀的策略。 |
| `enable_single_replica_load = true` |
开启单副本导入,多个副本只需构建一次索引,减少 CPU 消耗。 |
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/data-operate/export/export-overview.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/data-operate/export/export-overview.md
index 5fa29031fe..4a5441a640 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/data-operate/export/export-overview.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/data-operate/export/export-overview.md
@@ -86,8 +86,9 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出
以下是 Apache Doris 数据类型和 Parquet、ORC 文件格式的数据类型映射关系表:
1. Doris 导出到 Orc 文件格式的数据类型映射表:
+
|Doris Type|Orc Type|
- | ----- | ----- |
+ | -------- | ------- |
|boolean|boolean|
|tinyint|tinyint|
|smallint|smallint|
@@ -107,7 +108,7 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出
|array|array|
|json|不支持|
- <br/>
+
2. Apache Doris 导出到 Parquet 文件格式时,会先将 Apache Doris 内存数据转换为 Arrow 内存数据格式,然后由
Arrow 写出到 Parquet 文件格式。Apache Doris 数据类型到 Arrow 数据类的映射关系为:
|Doris Type|Arrow Type|
diff --git
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/practical-guide/log-storage-analysis.md
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/practical-guide/log-storage-analysis.md
index c079a16380..a6c5dd1a1c 100644
---
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/practical-guide/log-storage-analysis.md
+++
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.0/practical-guide/log-storage-analysis.md
@@ -170,7 +170,7 @@ Apache Doris 对 Flexible Schema 的日志数据提供了几个方面的支持
| 需调整参数 | 说明
|
| :----------------------------------------------------------- |
:----------------------------------------------------------- |
| `max_running_txn_num_per_db = 10000` |
高并发导入运行事务数较多,需调高参数。 |
-| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` |
高频导入事务标签内存占用多,保留时间调短。 |
+| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` |
高频导入事务标签内存占用多,保留时间调短。 |
| `enable_round_robin_create_tablet = true` | 创建 Tablet
时,采用 Round Robin 策略,尽量均匀。 |
| `tablet_rebalancer_type = partition` | 均衡 Tablet
时,采用每个分区内尽量均匀的策略。 |
| `enable_single_replica_load = true` |
开启单副本导入,多个副本只需构建一次索引,减少 CPU 消耗。 |
diff --git a/versioned_docs/version-2.0/practical-guide/log-storage-analysis.md
b/versioned_docs/version-2.0/practical-guide/log-storage-analysis.md
index 8d096b974c..23b13a1006 100644
--- a/versioned_docs/version-2.0/practical-guide/log-storage-analysis.md
+++ b/versioned_docs/version-2.0/practical-guide/log-storage-analysis.md
@@ -194,7 +194,7 @@ You can find FE configuration fields in `fe/conf/fe.conf`.
Refer to the followin
| Configuration fields to be optimized | Description
|
| :----------------------------------------------------------- |
:----------------------------------------------------------- |
| `max_running_txn_num_per_db = 10000` | Increase the
parameter value to adapt to high-concurrency import transactions. |
-| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` |
Increase the retention time to handle high-frequency import transactions with
high memory usage. |
+| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` |
Increase the retention time to handle high-frequency import transactions with
high memory usage. |
| `enable_round_robin_create_tablet = true` | When creating
Tablets, use a Round Robin strategy to distribute evenly. |
| `tablet_rebalancer_type = partition` | When
balancing Tablets, use a strategy to evenly distribute within each partition. |
| `enable_single_replica_load = true` | Enable
single-replica import, where multiple replicas only need to build an index once
to reduce CPU consumption. |
diff --git a/versioned_docs/version-2.1/data-operate/export/export-overview.md
b/versioned_docs/version-2.1/data-operate/export/export-overview.md
index 3a3a2abf41..5a9897acc2 100644
--- a/versioned_docs/version-2.1/data-operate/export/export-overview.md
+++ b/versioned_docs/version-2.1/data-operate/export/export-overview.md
@@ -86,25 +86,27 @@ Parquet and ORC file formats have their own data types.
Doris's export function
The following table shows the mapping between Doris data types and Parquet,
ORC file format data types:
1. Doris export to ORC file format data type mapping table:
- | Doris Type | Orc Type |
- | ----- | ----- |
- | boolean | boolean |
- | tinyint | tinyint |
- | smallint | smallint |
- | int | int |
- | bigint | bigint |
- | largeInt | string |
- | date | string |
- | datev2 | string |
- | datetime | string |
- | datetimev2 | timestamp |
- | float | float |
- | double | double |
- | char / varchar / string | string |
- | decimal | decimal |
- | struct | struct |
- | map | map |
- | array | array |
+
+ |Doris Type|Orc Type|
+ | -------- | ------- |
+ |boolean|boolean|
+ |tinyint|tinyint|
+ |smallint|smallint|
+ |int|int|
+ |bigint|bigint|
+ |largeInt|string|
+ |date|string|
+ |datev2|string|
+ |datetime|string|
+ |datetimev2|timestamp|
+ |float|float|
+ |double|double|
+ |char / varchar / string|string|
+ |decimal|decimal|
+ |struct|struct|
+ |map|map|
+ |array|array|
+ |json| Not supported|
2. When Doris exports to Parquet file format, it first converts Doris
in-memory data to Arrow in-memory data format, then writes out to Parquet file
format. The mapping relationship between Doris data types and Arrow data types
is:
diff --git a/versioned_docs/version-2.1/practical-guide/log-storage-analysis.md
b/versioned_docs/version-2.1/practical-guide/log-storage-analysis.md
index a834ee462f..61657d0398 100644
--- a/versioned_docs/version-2.1/practical-guide/log-storage-analysis.md
+++ b/versioned_docs/version-2.1/practical-guide/log-storage-analysis.md
@@ -194,7 +194,7 @@ You can find FE configuration fields in `fe/conf/fe.conf`.
Refer to the followin
| Configuration fields to be optimized | Description
|
| :----------------------------------------------------------- |
:----------------------------------------------------------- |
| `max_running_txn_num_per_db = 10000` | Increase the
parameter value to adapt to high-concurrency import transactions. |
-| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` |
Increase the retention time to handle high-frequency import transactions with
high memory usage. |
+| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` |
Increase the retention time to handle high-frequency import transactions with
high memory usage. |
| `enable_round_robin_create_tablet = true` | When creating
Tablets, use a Round Robin strategy to distribute evenly. |
| `tablet_rebalancer_type = partition` | When
balancing Tablets, use a strategy to evenly distribute within each partition. |
| `enable_single_replica_load = true` | Enable
single-replica import, where multiple replicas only need to build an index once
to reduce CPU consumption. |
diff --git a/versioned_docs/version-3.0/data-operate/export/export-overview.md
b/versioned_docs/version-3.0/data-operate/export/export-overview.md
index fc90d2b6e7..75b7247ed9 100644
--- a/versioned_docs/version-3.0/data-operate/export/export-overview.md
+++ b/versioned_docs/version-3.0/data-operate/export/export-overview.md
@@ -86,26 +86,27 @@ Parquet and ORC file formats have their own data types.
Doris's export function
The following table shows the mapping between Doris data types and Parquet,
ORC file format data types:
1. Doris export to ORC file format data type mapping table:
- | Doris Type | Orc Type |
- | ----- | ----- |
- | boolean | boolean |
- | tinyint | tinyint |
- | smallint | smallint |
- | int | int |
- | bigint | bigint |
- | largeInt | string |
- | date | string |
- | datev2 | string |
- | datetime | string |
- | datetimev2 | timestamp |
- | float | float |
- | double | double |
- | char / varchar / string | string |
- | decimal | decimal |
- | struct | struct |
- | map | map |
- | array | array |
- |json| Not support|
+
+ |Doris Type|Orc Type|
+ | -------- | ------- |
+ |boolean|boolean|
+ |tinyint|tinyint|
+ |smallint|smallint|
+ |int|int|
+ |bigint|bigint|
+ |largeInt|string|
+ |date|string|
+ |datev2|string|
+ |datetime|string|
+ |datetimev2|timestamp|
+ |float|float|
+ |double|double|
+ |char / varchar / string|string|
+ |decimal|decimal|
+ |struct|struct|
+ |map|map|
+ |array|array|
+ |json| Not supported|
2. When Doris exports to Parquet file format, it first converts Doris
in-memory data to Arrow in-memory data format, then writes out to Parquet file
format. The mapping relationship between Doris data types and Arrow data types
is:
diff --git a/versioned_docs/version-3.0/practical-guide/log-storage-analysis.md
b/versioned_docs/version-3.0/practical-guide/log-storage-analysis.md
index d1c15772e8..99d251de70 100644
--- a/versioned_docs/version-3.0/practical-guide/log-storage-analysis.md
+++ b/versioned_docs/version-3.0/practical-guide/log-storage-analysis.md
@@ -194,7 +194,7 @@ You can find FE configuration fields in `fe/conf/fe.conf`.
Refer to the followin
| Configuration fields to be optimized | Description
|
| :----------------------------------------------------------- |
:----------------------------------------------------------- |
| `max_running_txn_num_per_db = 10000` | Increase the
parameter value to adapt to high-concurrency import transactions. |
-| `streaming_lable_keep_max_second = 3600``label_keep_max_second = 7200` |
Increase the retention time to handle high-frequency import transactions with
high memory usage. |
+| `streaming_label_keep_max_second = 3600``label_keep_max_second = 7200` |
Increase the retention time to handle high-frequency import transactions with
high memory usage. |
| `enable_round_robin_create_tablet = true` | When creating
Tablets, use a Round Robin strategy to distribute evenly. |
| `tablet_rebalancer_type = partition` | When
balancing Tablets, use a strategy to evenly distribute within each partition. |
| `enable_single_replica_load = true` | Enable
single-replica import, where multiple replicas only need to build an index once
to reduce CPU consumption. |
diff --git a/versioned_sidebars/version-3.0-sidebars.json
b/versioned_sidebars/version-3.0-sidebars.json
index d5fa282479..625bc35d8d 100644
--- a/versioned_sidebars/version-3.0-sidebars.json
+++ b/versioned_sidebars/version-3.0-sidebars.json
@@ -1552,13 +1552,6 @@
"faq/sql-faq",
"faq/lakehouse-faq"
]
- },
- {
- "type": "category",
- "label": "Release notes",
- "items": [
- "releasenotes/release-3.0.0"
- ]
}
]
}
\ No newline at end of file
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]