[
https://issues.apache.org/jira/browse/HUDI-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
pengzhiwei updated HUDI-1591:
-----------------------------
Description:
We have found same problems on query hoodie table on spark:
1、Users must specify "*" to tell the partition level to spark for the query.
2、Cannot support partition prune for COW table.
This issue wants to achieve the following goals:
1、Support No Stars query for hoodie table.
2、Support partition prune for COW table.
Refer to the documentation for more details about this: [Optimization For Hudi
COW
Query|https://docs.google.com/document/d/1qG014M3VZg3lMswsZv7cYB9Tb0vz8yXgqvlI_Jlnnsc/edit#heading=h.k6ro6dhgwh8y]
was:
We have found same problems on query hoodie table on spark:
1、Users must specify "*" to tell the partition level to spark for the query.
2、Cannot support partition prune for COW table.
3、Performance issue in the *HoodieROTablePathFilter* for COW table.
This issue wants to achieve the following goals:
1、Support No Stars query for hoodie table.
2、Support partition prune for COW table.
3、Solve the performance issue with *HoodieROTablePathFilter.*
Refer to the documentation for more details about this: [Optimization For Hudi
COW
Query|https://docs.google.com/document/d/1qG014M3VZg3lMswsZv7cYB9Tb0vz8yXgqvlI_Jlnnsc/edit#heading=h.k6ro6dhgwh8y]
> Improve Hoodie Table Query Performance And Ease Of Use For Spark
> ----------------------------------------------------------------
>
> Key: HUDI-1591
> URL: https://issues.apache.org/jira/browse/HUDI-1591
> Project: Apache Hudi
> Issue Type: Improvement
> Components: Spark Integration
> Reporter: pengzhiwei
> Assignee: pengzhiwei
> Priority: Major
> Fix For: 0.8.0
>
>
> We have found same problems on query hoodie table on spark:
> 1、Users must specify "*" to tell the partition level to spark for the query.
> 2、Cannot support partition prune for COW table.
> This issue wants to achieve the following goals:
> 1、Support No Stars query for hoodie table.
> 2、Support partition prune for COW table.
> Refer to the documentation for more details about this: [Optimization For
> Hudi COW
> Query|https://docs.google.com/document/d/1qG014M3VZg3lMswsZv7cYB9Tb0vz8yXgqvlI_Jlnnsc/edit#heading=h.k6ro6dhgwh8y]
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)