[ 
https://issues.apache.org/jira/browse/HUDI-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

pengzhiwei updated HUDI-1591:
-----------------------------
    Summary: Implement Spark's FileIndex for Hudi to support queries via Hudi 
DataSource using non-globbed table path and partition pruning  (was: Improve 
Hoodie Table Query Performance And Ease Of Use For Spark)

> Implement Spark's FileIndex for Hudi to support queries via Hudi DataSource 
> using non-globbed table path and partition pruning
> ------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HUDI-1591
>                 URL: https://issues.apache.org/jira/browse/HUDI-1591
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: Spark Integration
>    Affects Versions: 0.9.0
>            Reporter: pengzhiwei
>            Assignee: pengzhiwei
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.9.0
>
>
> We have found same  problems on query hoodie table on spark:
> 1、Users must  specify "*" to tell the partition level to spark for the query.
> 2、Cannot support partition prune for COW table.
> This issue wants to achieve the following goals:
> 1、Support No Stars query for hoodie table.
> 2、Support partition prune for COW table.
> Refer to the documentation for more details about this: [Optimization For 
> Hudi COW 
> Query|https://docs.google.com/document/d/1qG014M3VZg3lMswsZv7cYB9Tb0vz8yXgqvlI_Jlnnsc/edit#heading=h.k6ro6dhgwh8y]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to