[ 
https://issues.apache.org/jira/browse/HUDI-7569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinaykumar Bhat updated HUDI-7569:
----------------------------------
    Epic Link: HUDI-512

> Fix wrong result while using RLI for pruning files
> --------------------------------------------------
>
>                 Key: HUDI-7569
>                 URL: https://issues.apache.org/jira/browse/HUDI-7569
>             Project: Apache Hudi
>          Issue Type: Bug
>            Reporter: Vinaykumar Bhat
>            Priority: Major
>              Labels: hudi-1.0.0-beta2
>             Fix For: 1.0.0
>
>
> Data skipping (pruning files) for RLI is supported only when the query 
> predicate has `EqualTo` or `In` expressions/filters on the record-key column. 
> However, the logic for detecting valid `In` expression/filter on record-key 
> has bugs. It tries to prune files assuming that `In` expression/filter can 
> reference only record-key column even when the `In` query is based on other 
> columns.
>  
> For example, a query of the foem `select * from trips_table where driver in 
> ('abc', 'xyz')` has the potential to return wrong results if the record-key 
> for this table also has values 'abc' or 'xyz' for some rows of the table.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to