xiaoli created SPARK-37034: ------------------------------ Summary: What's the progress of vectorized execution for spark? Key: SPARK-37034 URL: https://issues.apache.org/jira/browse/SPARK-37034 Project: Spark Issue Type: New Feature Components: Spark Core Affects Versions: 3.2.0 Reporter: xiaoli
Spark has support vectorized read for ORC and parquet. What's the progress of other vectorized execution, e.g. vectorized write, join, aggr, simple operator (string function, math function)? Hive support vectorized execution in [early version|[https://cwiki.apache.org/confluence/display/hive/vectorized+query+execution]|https://cwiki.apache.org/confluence/display/hive/vectorized+query+execution].] As we know, Spark is replacement of Hive. I guess the reason why Spark does not support vectorized execution maybe the difficulty of design or implementation in Spark is larger. What's the main issue for Spark to support vectorized execution? -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org