Apache Spark 3.5.0 Expectations (?)

Dongjoon Hyun Sun, 28 May 2023 17:42:27 -0700

Hi, All.

Apache Spark 3.5.0 is scheduled for August (1st Release Candidate) and
currently a few notable things are under discussions in the mailing list.


I believe it's a good time to share a short summary list (containing both
completed and in-progress items) to give a highlight in advance and to
collect your targets too.

Please share your expectations or working items if you want to prioritize
them more in the community in Apache Spark 3.5.0 timeframe.

(Sorted by ID)
SPARK-40497 Upgrade Scala 2.13.11
SPARK-42452 Remove hadoop-2 profile from Apache Spark 3.5.0
SPARK-42913 Upgrade to Hadoop 3.3.5 (aws-java-sdk-bundle: 1.12.262 ->
1.12.316)
SPARK-43024 Upgrade Pandas to 2.0.0
SPARK-43200 Remove Hadoop 2 reference in docs
SPARK-43347 Remove Python 3.7 Support
SPARK-43348 Support Python 3.8 in PyPy3
SPARK-43351 Add Spark Connect Go prototype code and example
SPARK-43379 Deprecate old Java 8 versions prior to 8u371
SPARK-43394 Upgrade to Maven 3.8.8
SPARK-43436 Upgrade to RocksDbjni 8.1.1.1
SPARK-43446 Upgrade to Apache Arrow 12.0.0
SPARK-43447 Support R 4.3.0
SPARK-43489 Remove protobuf 2.5.0
SPARK-43519 Bump Parquet to 1.13.1
SPARK-43581 Upgrade kubernetes-client to 6.6.2
SPARK-43588 Upgrade to ASM 9.5
SPARK-43600 Update K8s doc to recommend K8s 1.24+
SPARK-43738 Upgrade to DropWizard Metrics 4.2.18
SPARK-43831 Build and Run Spark on Java 21
SPARK-43832 Upgrade to Scala 2.12.18
SPARK-43836 Make Scala 2.13 as default in Spark 3.5
SPARK-43842 Upgrade gcs-connector to 2.2.14
SPARK-43844 Update to ORC 1.9.0
UMBRELLA: Add SQL functions into Scala, Python and R API

Thanks,
Dongjoon.

PS. The above is not a list of release blockers. Instead, it could be a
nice-to-have from someone's perspective.

Apache Spark 3.5.0 Expectations (?)

Reply via email to