xiaoli created SPARK-37034:
------------------------------

             Summary: What's the progress of vectorized execution for spark?
                 Key: SPARK-37034
                 URL: https://issues.apache.org/jira/browse/SPARK-37034
             Project: Spark
          Issue Type: New Feature
          Components: Spark Core
    Affects Versions: 3.2.0
            Reporter: xiaoli


Spark has support vectorized read for ORC and parquet. What's the progress of 
other vectorized execution, e.g. vectorized write,  join, aggr, simple operator 
(string function, math function)? 

Hive support vectorized execution in [early 
version|[https://cwiki.apache.org/confluence/display/hive/vectorized+query+execution]|https://cwiki.apache.org/confluence/display/hive/vectorized+query+execution].]
 As we know, Spark is replacement of Hive. I guess the reason why Spark does 
not support vectorized execution maybe the difficulty of design or 
implementation in Spark is larger. What's the main issue for Spark to support 
vectorized execution?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to