[
https://issues.apache.org/jira/browse/HIVE-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14209327#comment-14209327
]
Reynold Xin commented on HIVE-7333:
-----------------------------------
Don't think any changes are necessary in Spark. At the end of the day you can
run arbitrary code on arbitrary records for each partition - using that alone
should be sufficient to run vectorization.
You can even put an entire partition of records into one iterator output ...
> Create RDD translator, translating Hive Tables into Spark RDDs [Spark Branch]
> -----------------------------------------------------------------------------
>
> Key: HIVE-7333
> URL: https://issues.apache.org/jira/browse/HIVE-7333
> Project: Hive
> Issue Type: Sub-task
> Components: Spark
> Reporter: Xuefu Zhang
> Assignee: Rui Li
> Labels: Spark-M1
>
> Please refer to the design specification.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)