[
https://issues.apache.org/jira/browse/HUDI-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
pengzhiwei updated HUDI-1591:
-----------------------------
Summary: Implement Spark's FileIndex for Hudi to support queries via Hudi
DataSource using non-globbed table path and partition pruning (was: Improve
Hoodie Table Query Performance And Ease Of Use For Spark)
> Implement Spark's FileIndex for Hudi to support queries via Hudi DataSource
> using non-globbed table path and partition pruning
> ------------------------------------------------------------------------------------------------------------------------------
>
> Key: HUDI-1591
> URL: https://issues.apache.org/jira/browse/HUDI-1591
> Project: Apache Hudi
> Issue Type: Improvement
> Components: Spark Integration
> Affects Versions: 0.9.0
> Reporter: pengzhiwei
> Assignee: pengzhiwei
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.9.0
>
>
> We have found same problems on query hoodie table on spark:
> 1、Users must specify "*" to tell the partition level to spark for the query.
> 2、Cannot support partition prune for COW table.
> This issue wants to achieve the following goals:
> 1、Support No Stars query for hoodie table.
> 2、Support partition prune for COW table.
> Refer to the documentation for more details about this: [Optimization For
> Hudi COW
> Query|https://docs.google.com/document/d/1qG014M3VZg3lMswsZv7cYB9Tb0vz8yXgqvlI_Jlnnsc/edit#heading=h.k6ro6dhgwh8y]
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)