This is an automated email from the ASF dual-hosted git repository.
RussellSpitzer pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/iceberg.git
The following commit(s) were added to refs/heads/main by this push:
new ebd0100b39 Docs: Document adaptive split sizing configurations
ebd0100b39 is described below
commit ebd0100b398dc9b8f60350866f9f064b4750cfc1
Author: Pratham Manja <[email protected]>
AuthorDate: Thu May 28 00:53:02 2026 +0530
Docs: Document adaptive split sizing configurations
---
docs/docs/spark-configuration.md | 2 ++
1 file changed, 2 insertions(+)
diff --git a/docs/docs/spark-configuration.md b/docs/docs/spark-configuration.md
index 5972aafc3d..8d9dd6dd27 100644
--- a/docs/docs/spark-configuration.md
+++ b/docs/docs/spark-configuration.md
@@ -207,6 +207,8 @@ val spark = SparkSession.builder()
| spark.sql.iceberg.executor-cache.locality.enabled | false
| Enables locality-aware executor
cache usage
|
| spark.sql.iceberg.merge-schema | false
| Enables modifying the table schema
to match the write schema. Only adds columns missing columns
|
| spark.sql.iceberg.report-column-stats | true
| Report Puffin Table Statistics if
available to Spark's Cost Based Optimizer. CBO must be enabled for this to be
effective |
+| spark.sql.iceberg.read.adaptive-split-size.enabled | Table default
| Enables adaptive split sizing for
read operations. When enabled, split size is automatically adjusted based on
scan size and parallelism |
+| spark.sql.iceberg.read.adaptive-split-size.parallelism |
max(spark.default.parallelism, spark.sql.shuffle.partitions)
| Overrides the parallelism used for adaptive split sizing.
Must be greater than 0 |
| spark.sql.iceberg.async-micro-batch-planning-enabled | false
| Enables asynchronous microbatch
planning to reduce planning latency by pre-fetching file scan tasks
|
### Read options