[ 
https://issues.apache.org/jira/browse/HUDI-8164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-8164:
----------------------------
    Priority: Critical  (was: Major)

> Chain multiple indices support in HoodieFileIndex
> -------------------------------------------------
>
>                 Key: HUDI-8164
>                 URL: https://issues.apache.org/jira/browse/HUDI-8164
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: reader-core
>            Reporter: Danny Chen
>            Assignee: Danny Chen
>            Priority: Critical
>             Fix For: 1.0.0
>
>
> In HoodieFileIndex, we have 5 indices support: 
>  
>  * RecordLevelIndexSupport
>  * BucketIndexSupport
>  * SecondaryIndexSupport
>  * FunctionalIndexSupport
>  * BloomFiltersIndexSupport
>  * ColumnStatsIndexSupport
>  
> But the logic would return early when the first index support applied and got 
> pruned files.
> A better way is we can chain the multiple indices support all-together: one 
> index support can still be applied based on the pruned files returned by 
> another index support.
> Key points in contribution:
> 1. think through the shared contract/param among these indices support;
> 2. add a orthoganal test for random index supoprt composition, valid the 
> result set and the indices support are applied effectively.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to