[
https://issues.apache.org/jira/browse/HUDI-1371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17243574#comment-17243574
]
Vinoth Chandar edited comment on HUDI-1371 at 12/3/20, 11:18 PM:
-----------------------------------------------------------------
||Engine||Table Type||Listing Mechanism||
|Spark SQL on Hive |COW|a) with path filter: parallel listing (we good here)
Open: how do we avoid Spark from listing again? i.e the Parquet datasource
being wrapped.
b) |
|Spark SQL on Hive |MOR| |
|Spark Datasource|COW| |
|Spark Datasource|MOR| |
|Presto|COW| |
|Presto|MOR| |
|Hive|COW| |
|Hive |MOR| |
was (Author: vc):
||Engine||Table Type||Listing Mechanism||
|Spark SQL on Hive |COW|a) with path filter: parallel listing (we good here)
|
|Spark SQL on Hive |MOR| |
|Spark Datasource|COW| |
|Spark Datasource|MOR| |
|Presto|COW| |
|Presto|MOR| |
|Hive|COW| |
|Hive |MOR| |
> Implement Spark datasource by fetching file listing from metadata table
> -----------------------------------------------------------------------
>
> Key: HUDI-1371
> URL: https://issues.apache.org/jira/browse/HUDI-1371
> Project: Apache Hudi
> Issue Type: Sub-task
> Components: Spark Integration
> Reporter: Vinoth Chandar
> Assignee: Udit Mehrotra
> Priority: Blocker
> Fix For: 0.7.0
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)