This is an automated email from the ASF dual-hosted git repository.
ruifengz pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 4d3b45db9195 [SPARK-54849][PYTHON][FOLLOW-UP] Update minimum version
of pyarrow in `setup.py` and `pyspark_upgrade.rst`
4d3b45db9195 is described below
commit 4d3b45db9195b20f312aaccf8f5fbac9b6ee2f5c
Author: Ruifeng Zheng <[email protected]>
AuthorDate: Tue Jan 20 15:54:46 2026 +0800
[SPARK-54849][PYTHON][FOLLOW-UP] Update minimum version of pyarrow in
`setup.py` and `pyspark_upgrade.rst`
### What changes were proposed in this pull request?
Update minimum version of pyarrow in `setup.py` and `pyspark_upgrade.rst`
### Why are the changes needed?
the two parts were missing in previous pr
### Does this PR introduce _any_ user-facing change?
no
### How was this patch tested?
ci
### Was this patch authored or co-authored using generative AI tooling?
no
Closes #53863 from zhengruifeng/py_setup_doc.
Authored-by: Ruifeng Zheng <[email protected]>
Signed-off-by: Ruifeng Zheng <[email protected]>
---
python/docs/source/migration_guide/pyspark_upgrade.rst | 1 +
python/packaging/classic/setup.py | 2 +-
python/packaging/client/setup.py | 2 +-
python/packaging/connect/setup.py | 2 +-
4 files changed, 4 insertions(+), 3 deletions(-)
diff --git a/python/docs/source/migration_guide/pyspark_upgrade.rst
b/python/docs/source/migration_guide/pyspark_upgrade.rst
index fbd63539b380..2a6d9c55d2ff 100644
--- a/python/docs/source/migration_guide/pyspark_upgrade.rst
+++ b/python/docs/source/migration_guide/pyspark_upgrade.rst
@@ -21,6 +21,7 @@ Upgrading PySpark
Upgrading from PySpark 4.1 to 4.2
---------------------------------
+* In Spark 4.2, the minimum supported version for PyArrow has been raised from
15.0.0 to 18.0.0 in PySpark.
* In Spark 4.2, columnar data exchange between PySpark and the JVM uses Apache
Arrow by default. The configuration
``spark.sql.execution.arrow.pyspark.enabled`` now defaults to true. To restore
the legacy (non-Arrow) row-based data exchange, set
``spark.sql.execution.arrow.pyspark.enabled`` to ``false``.
* In Spark 4.2, regular Python UDFs are Arrow-optimized by default. The
configuration ``spark.sql.execution.pythonUDF.arrow.enabled`` now defaults to
true. To restore the legacy behavior for Python UDF execution, set
``spark.sql.execution.pythonUDF.arrow.enabled`` to ``false``.
* In Spark 4.2, regular Python UDTFs are Arrow-optimized by default. The
configuration ``spark.sql.execution.pythonUDTF.arrow.enabled`` now defaults to
true. To restore the legacy behavior for Python UDTF execution, set
``spark.sql.execution.pythonUDTF.arrow.enabled`` to ``false``.
diff --git a/python/packaging/classic/setup.py
b/python/packaging/classic/setup.py
index 54ec4abe3be9..a8be5da8ec35 100755
--- a/python/packaging/classic/setup.py
+++ b/python/packaging/classic/setup.py
@@ -152,7 +152,7 @@ if in_spark:
# python/packaging/client/setup.py, and python/packaging/connect/setup.py
_minimum_pandas_version = "2.2.0"
_minimum_numpy_version = "1.21"
-_minimum_pyarrow_version = "15.0.0"
+_minimum_pyarrow_version = "18.0.0"
_minimum_grpc_version = "1.76.0"
_minimum_googleapis_common_protos_version = "1.71.0"
_minimum_pyyaml_version = "3.11"
diff --git a/python/packaging/client/setup.py b/python/packaging/client/setup.py
index ee404210f293..31936c9487a0 100755
--- a/python/packaging/client/setup.py
+++ b/python/packaging/client/setup.py
@@ -135,7 +135,7 @@ try:
# python/packaging/classic/setup.py, and python/packaging/connect/setup.py
_minimum_pandas_version = "2.2.0"
_minimum_numpy_version = "1.21"
- _minimum_pyarrow_version = "15.0.0"
+ _minimum_pyarrow_version = "18.0.0"
_minimum_grpc_version = "1.76.0"
_minimum_googleapis_common_protos_version = "1.71.0"
_minimum_pyyaml_version = "3.11"
diff --git a/python/packaging/connect/setup.py
b/python/packaging/connect/setup.py
index 9a1a4ea81255..21ba8e0c2b50 100755
--- a/python/packaging/connect/setup.py
+++ b/python/packaging/connect/setup.py
@@ -88,7 +88,7 @@ try:
# python/packaging/classic/setup.py, and python/packaging/client/setup.py
_minimum_pandas_version = "2.0.0"
_minimum_numpy_version = "1.21"
- _minimum_pyarrow_version = "11.0.0"
+ _minimum_pyarrow_version = "18.0.0"
_minimum_grpc_version = "1.76.0"
_minimum_googleapis_common_protos_version = "1.71.0"
_minimum_pyyaml_version = "3.11"
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]