(doris-website) branch master updated: [faq](lakehouse) add jdbc be java oom faq (#3014)

zykkk Wed, 29 Oct 2025 20:28:10 -0700

This is an automated email from the ASF dual-hosted git repository.

zykkk pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/doris-website.git



The following commit(s) were added to refs/heads/master by this push:
     new 85c57ec380a [faq](lakehouse) add jdbc be java oom faq (#3014)
85c57ec380a is described below

commit 85c57ec380ab33a63f9505e93502d09726382f83
Author: zy-kkk <[email protected]>
AuthorDate: Thu Oct 30 11:26:18 2025 +0800

    [faq](lakehouse) add jdbc be java oom faq (#3014)
---
 docs/faq/lakehouse-faq.md                                        | 9 +++++++++
 .../docusaurus-plugin-content-docs/current/faq/lakehouse-faq.md  | 9 +++++++++
 .../version-1.2/faq/lakehouse-faq.md                             | 9 +++++++++
 .../version-2.0/faq/lakehouse-faq.md                             | 9 +++++++++
 .../version-2.1/faq/lakehouse-faq.md                             | 9 +++++++++
 .../version-3.x/faq/lakehouse-faq.md                             | 9 +++++++++
 versioned_docs/version-1.2/faq/lakehouse-faq.md                  | 9 +++++++++
 versioned_docs/version-2.0/faq/lakehouse-faq.md                  | 9 +++++++++
 versioned_docs/version-2.1/faq/lakehouse-faq.md                  | 9 +++++++++
 versioned_docs/version-3.x/faq/lakehouse-faq.md                  | 9 +++++++++
 10 files changed, 90 insertions(+)

diff --git a/docs/faq/lakehouse-faq.md b/docs/faq/lakehouse-faq.md
index 36da4c65204..0f23b873347 100644
--- a/docs/faq/lakehouse-faq.md
+++ b/docs/faq/lakehouse-faq.md
@@ -105,6 +105,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. When synchronizing MySQL data to Doris using JDBC Catalog, date data 
synchronization error occurs. Verify if the MySQL version matches the MySQL 
driver package, for example, MySQL 8 and above require the driver 
com.mysql.cj.jdbc.Driver.
 
+5. When a single field is too large, a Java memory OOM occurs on the BE side 
during a query.
+
+   When Jdbc Scanner reads data through JDBC, the session variable 
`batch_size` determines the number of rows processed in the JVM per batch. If a 
single field is too large, it may cause `field_size * batch_size` (approximate 
value, considering JVM static memory and data copy overhead) to exceed the JVM 
memory limit, resulting in OOM.
+
+   Solutions:
+
+   - Reduce the `batch_size` value by executing `set batch_size = 512;`. The 
default value is 4064.
+   - Increase the BE JVM memory by modifying the `-Xmx` parameter in 
`JAVA_OPTS`. For example: `-Xmx8g`.
+
 ## Hive Catalog
 
 1. Accessing Iceberg or Hive table through Hive Catalog reports an error: 
`failed to get schema` or `Storage schema reading not supported`
diff --git 
a/i18n/zh-CN/docusaurus-plugin-content-docs/current/faq/lakehouse-faq.md 
b/i18n/zh-CN/docusaurus-plugin-content-docs/current/faq/lakehouse-faq.md
index 4147a3f93a4..95f15f41892 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/faq/lakehouse-faq.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/faq/lakehouse-faq.md
@@ -106,6 +106,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
    请在 `jdbc_url` 中添加 `useSSL=true`
 
 4. 使用 JDBC Catalog 将 MySQL 数据同步到 Doris 中，日期数据同步错误。需要校验下 MySQL 的版本是否与 MySQL 
的驱动包是否对应，比如 MySQL8 以上需要使用驱动 com.mysql.cj.jdbc.Driver。
+   
+5. 单个字段过大，查询时 BE 侧 Java 内存 OOM
+
+   Jdbc Scanner 在通过 jdbc 读取时，由 session variable `batch_size` 决定每批次数据在 JVM 
中处理的数量，如果单个字段过大，导致 `字段大小 * batch_size`(近似值，由于 JVM 中 static 以及数据 copy 占用)超过 JVM 
内存限制，就会出现 OOM。
+
+   解决方法：
+
+   - 减小 `batch_size` 的值，可以通过 `set batch_size = 512;` 来调整，默认值为 4064。
+   - 增大 BE 的 JVM 内存，通过修改 `JAVA_OPTS` 参数中的 `-Xmx` 来调整 JVM 最大堆内存大小。例如：`"-Xmx8g`。
 
 ## Hive Catalog
 
diff --git 
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-1.2/faq/lakehouse-faq.md 
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-1.2/faq/lakehouse-faq.md
index d81b04aafea..034b7b54001 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-1.2/faq/lakehouse-faq.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-1.2/faq/lakehouse-faq.md
@@ -107,6 +107,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. 使用 JDBC Catalog 将 MySQL 数据同步到 Doris 中，日期数据同步错误。需要校验下 MySQL 的版本是否与 MySQL 
的驱动包是否对应，比如 MySQL8 以上需要使用驱动 com.mysql.cj.jdbc.Driver。
 
+5. 单个字段过大，查询时 BE 侧 Java 内存 OOM
+
+   Jdbc Scanner 在通过 jdbc 读取时，由 session variable `batch_size` 决定每批次数据在 JVM 
中处理的数量，如果单个字段过大，导致 `字段大小 * batch_size`(近似值，由于 JVM 中 static 以及数据 copy 占用)超过 JVM 
内存限制，就会出现 OOM。
+
+   解决方法：
+
+   - 减小 `batch_size` 的值，可以通过 `set batch_size = 512;` 来调整，默认值为 4064。
+   - 增大 BE 的 JVM 内存，通过修改 `JAVA_OPTS` 参数中的 `-Xmx` 来调整 JVM 最大堆内存大小。例如：`"-Xmx8g`。
+
 ## Hive Catalog
 
 1. 通过 Hive Metastore 访问 Iceberg 表报错：`failed to get schema` 或 `Storage schema 
reading not supported`
diff --git 
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/faq/lakehouse-faq.md 
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/faq/lakehouse-faq.md
index d81b04aafea..034b7b54001 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/faq/lakehouse-faq.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.0/faq/lakehouse-faq.md
@@ -107,6 +107,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. 使用 JDBC Catalog 将 MySQL 数据同步到 Doris 中，日期数据同步错误。需要校验下 MySQL 的版本是否与 MySQL 
的驱动包是否对应，比如 MySQL8 以上需要使用驱动 com.mysql.cj.jdbc.Driver。
 
+5. 单个字段过大，查询时 BE 侧 Java 内存 OOM
+
+   Jdbc Scanner 在通过 jdbc 读取时，由 session variable `batch_size` 决定每批次数据在 JVM 
中处理的数量，如果单个字段过大，导致 `字段大小 * batch_size`(近似值，由于 JVM 中 static 以及数据 copy 占用)超过 JVM 
内存限制，就会出现 OOM。
+
+   解决方法：
+
+   - 减小 `batch_size` 的值，可以通过 `set batch_size = 512;` 来调整，默认值为 4064。
+   - 增大 BE 的 JVM 内存，通过修改 `JAVA_OPTS` 参数中的 `-Xmx` 来调整 JVM 最大堆内存大小。例如：`"-Xmx8g`。
+
 ## Hive Catalog
 
 1. 通过 Hive Metastore 访问 Iceberg 表报错：`failed to get schema` 或 `Storage schema 
reading not supported`
diff --git 
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/faq/lakehouse-faq.md 
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/faq/lakehouse-faq.md
index ac1fa0ad9dd..9890a381979 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/faq/lakehouse-faq.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-2.1/faq/lakehouse-faq.md
@@ -107,6 +107,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. 使用 JDBC Catalog 将 MySQL 数据同步到 Doris 中，日期数据同步错误。需要校验下 MySQL 的版本是否与 MySQL 
的驱动包是否对应，比如 MySQL8 以上需要使用驱动 com.mysql.cj.jdbc.Driver。
 
+5. 单个字段过大，查询时 BE 侧 Java 内存 OOM
+
+   Jdbc Scanner 在通过 jdbc 读取时，由 session variable `batch_size` 决定每批次数据在 JVM 
中处理的数量，如果单个字段过大，导致 `字段大小 * batch_size`(近似值，由于 JVM 中 static 以及数据 copy 占用)超过 JVM 
内存限制，就会出现 OOM。
+
+   解决方法：
+
+   - 减小 `batch_size` 的值，可以通过 `set batch_size = 512;` 来调整，默认值为 4064。
+   - 增大 BE 的 JVM 内存，通过修改 `JAVA_OPTS` 参数中的 `-Xmx` 来调整 JVM 最大堆内存大小。例如：`"-Xmx8g`。
+
 ## Hive Catalog
 
 1. 通过 Hive Catalog 访问 Iceberg 或 Hive 表报错：`failed to get schema` 或 `Storage 
schema reading not supported`
diff --git 
a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.x/faq/lakehouse-faq.md 
b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.x/faq/lakehouse-faq.md
index ac1fa0ad9dd..9890a381979 100644
--- a/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.x/faq/lakehouse-faq.md
+++ b/i18n/zh-CN/docusaurus-plugin-content-docs/version-3.x/faq/lakehouse-faq.md
@@ -107,6 +107,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. 使用 JDBC Catalog 将 MySQL 数据同步到 Doris 中，日期数据同步错误。需要校验下 MySQL 的版本是否与 MySQL 
的驱动包是否对应，比如 MySQL8 以上需要使用驱动 com.mysql.cj.jdbc.Driver。
 
+5. 单个字段过大，查询时 BE 侧 Java 内存 OOM
+
+   Jdbc Scanner 在通过 jdbc 读取时，由 session variable `batch_size` 决定每批次数据在 JVM 
中处理的数量，如果单个字段过大，导致 `字段大小 * batch_size`(近似值，由于 JVM 中 static 以及数据 copy 占用)超过 JVM 
内存限制，就会出现 OOM。
+
+   解决方法：
+
+   - 减小 `batch_size` 的值，可以通过 `set batch_size = 512;` 来调整，默认值为 4064。
+   - 增大 BE 的 JVM 内存，通过修改 `JAVA_OPTS` 参数中的 `-Xmx` 来调整 JVM 最大堆内存大小。例如：`"-Xmx8g`。
+
 ## Hive Catalog
 
 1. 通过 Hive Catalog 访问 Iceberg 或 Hive 表报错：`failed to get schema` 或 `Storage 
schema reading not supported`
diff --git a/versioned_docs/version-1.2/faq/lakehouse-faq.md 
b/versioned_docs/version-1.2/faq/lakehouse-faq.md
index b52e4a31154..96e8e0c5304 100644
--- a/versioned_docs/version-1.2/faq/lakehouse-faq.md
+++ b/versioned_docs/version-1.2/faq/lakehouse-faq.md
@@ -105,6 +105,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. When synchronizing MySQL data to Doris using JDBC Catalog, date data 
synchronization error occurs. Verify if the MySQL version matches the MySQL 
driver package, for example, MySQL 8 and above require the driver 
com.mysql.cj.jdbc.Driver.
 
+5. When a single field is too large, a Java memory OOM occurs on the BE side 
during a query.
+
+   When Jdbc Scanner reads data through JDBC, the session variable 
`batch_size` determines the number of rows processed in the JVM per batch. If a 
single field is too large, it may cause `field_size * batch_size` (approximate 
value, considering JVM static memory and data copy overhead) to exceed the JVM 
memory limit, resulting in OOM.
+
+   Solutions:
+
+   - Reduce the `batch_size` value by executing `set batch_size = 512;`. The 
default value is 4064.
+   - Increase the BE JVM memory by modifying the `-Xmx` parameter in 
`JAVA_OPTS`. For example: `-Xmx8g`.
+
 ## Hive Catalog
 
 1. Error accessing Iceberg table via Hive Metastore: `failed to get schema` or 
`Storage schema reading not supported`
diff --git a/versioned_docs/version-2.0/faq/lakehouse-faq.md 
b/versioned_docs/version-2.0/faq/lakehouse-faq.md
index b52e4a31154..96e8e0c5304 100644
--- a/versioned_docs/version-2.0/faq/lakehouse-faq.md
+++ b/versioned_docs/version-2.0/faq/lakehouse-faq.md
@@ -105,6 +105,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. When synchronizing MySQL data to Doris using JDBC Catalog, date data 
synchronization error occurs. Verify if the MySQL version matches the MySQL 
driver package, for example, MySQL 8 and above require the driver 
com.mysql.cj.jdbc.Driver.
 
+5. When a single field is too large, a Java memory OOM occurs on the BE side 
during a query.
+
+   When Jdbc Scanner reads data through JDBC, the session variable 
`batch_size` determines the number of rows processed in the JVM per batch. If a 
single field is too large, it may cause `field_size * batch_size` (approximate 
value, considering JVM static memory and data copy overhead) to exceed the JVM 
memory limit, resulting in OOM.
+
+   Solutions:
+
+   - Reduce the `batch_size` value by executing `set batch_size = 512;`. The 
default value is 4064.
+   - Increase the BE JVM memory by modifying the `-Xmx` parameter in 
`JAVA_OPTS`. For example: `-Xmx8g`.
+
 ## Hive Catalog
 
 1. Error accessing Iceberg table via Hive Metastore: `failed to get schema` or 
`Storage schema reading not supported`
diff --git a/versioned_docs/version-2.1/faq/lakehouse-faq.md 
b/versioned_docs/version-2.1/faq/lakehouse-faq.md
index ce44cb67e6a..2885fab5ba7 100644
--- a/versioned_docs/version-2.1/faq/lakehouse-faq.md
+++ b/versioned_docs/version-2.1/faq/lakehouse-faq.md
@@ -105,6 +105,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. When synchronizing MySQL data to Doris using JDBC Catalog, date data 
synchronization error occurs. Verify if the MySQL version matches the MySQL 
driver package, for example, MySQL 8 and above require the driver 
com.mysql.cj.jdbc.Driver.
 
+5. When a single field is too large, a Java memory OOM occurs on the BE side 
during a query.
+
+   When Jdbc Scanner reads data through JDBC, the session variable 
`batch_size` determines the number of rows processed in the JVM per batch. If a 
single field is too large, it may cause `field_size * batch_size` (approximate 
value, considering JVM static memory and data copy overhead) to exceed the JVM 
memory limit, resulting in OOM.
+
+   Solutions:
+
+   - Reduce the `batch_size` value by executing `set batch_size = 512;`. The 
default value is 4064.
+   - Increase the BE JVM memory by modifying the `-Xmx` parameter in 
`JAVA_OPTS`. For example: `-Xmx8g`.
+
 ## Hive Catalog
 
 1. Accessing Iceberg or Hive table through Hive Catalog reports an error: 
`failed to get schema` or `Storage schema reading not supported`
diff --git a/versioned_docs/version-3.x/faq/lakehouse-faq.md 
b/versioned_docs/version-3.x/faq/lakehouse-faq.md
index ce44cb67e6a..2885fab5ba7 100644
--- a/versioned_docs/version-3.x/faq/lakehouse-faq.md
+++ b/versioned_docs/version-3.x/faq/lakehouse-faq.md
@@ -105,6 +105,15 @@ ln -s 
/etc/pki/ca-trust/extracted/openssl/ca-bundle.trust.crt /etc/ssl/certs/ca-
 
 4. When synchronizing MySQL data to Doris using JDBC Catalog, date data 
synchronization error occurs. Verify if the MySQL version matches the MySQL 
driver package, for example, MySQL 8 and above require the driver 
com.mysql.cj.jdbc.Driver.
 
+5. When a single field is too large, a Java memory OOM occurs on the BE side 
during a query.
+
+   When Jdbc Scanner reads data through JDBC, the session variable 
`batch_size` determines the number of rows processed in the JVM per batch. If a 
single field is too large, it may cause `field_size * batch_size` (approximate 
value, considering JVM static memory and data copy overhead) to exceed the JVM 
memory limit, resulting in OOM.
+
+   Solutions:
+
+   - Reduce the `batch_size` value by executing `set batch_size = 512;`. The 
default value is 4064.
+   - Increase the BE JVM memory by modifying the `-Xmx` parameter in 
`JAVA_OPTS`. For example: `-Xmx8g`.
+
 ## Hive Catalog
 
 1. Accessing Iceberg or Hive table through Hive Catalog reports an error: 
`failed to get schema` or `Storage schema reading not supported`


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

(doris-website) branch master updated: [faq](lakehouse) add jdbc be java oom faq (#3014)

Reply via email to