[
https://issues.apache.org/jira/browse/SPARK-42696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yuming Wang reassigned SPARK-42696:
-----------------------------------
Assignee: jiangjiguang0719 (was: Yuming Wang)
> Speed up parquet reading with Java Vector API
> ---------------------------------------------
>
> Key: SPARK-42696
> URL: https://issues.apache.org/jira/browse/SPARK-42696
> Project: Spark
> Issue Type: New Feature
> Components: Input/Output
> Affects Versions: 3.5.0
> Reporter: jiangjiguang0719
> Assignee: jiangjiguang0719
> Priority: Major
>
> Parquet has supported use Java 17 Vector API to perform bit-unpacking to
> enjoy 4x ~ 8x performance gain in microbenchmark.
> I have finished the TPC-H(SF100) benchmark with spark integrated parquet
> optimization, each SQL has a different performance gain, Q6 can reach up 11%
>
> Please assign it to me, I will summit a PR, thanks!
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]