[
https://issues.apache.org/jira/browse/HUDI-7569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vinaykumar Bhat reassigned HUDI-7569:
-------------------------------------
Story Points: 4
Assignee: Vinaykumar Bhat
> Fix wrong result while using RLI for pruning files
> --------------------------------------------------
>
> Key: HUDI-7569
> URL: https://issues.apache.org/jira/browse/HUDI-7569
> Project: Apache Hudi
> Issue Type: Bug
> Reporter: Vinaykumar Bhat
> Assignee: Vinaykumar Bhat
> Priority: Major
> Labels: hudi-1.0.0-beta2, pull-request-available
> Fix For: 1.0.0
>
>
> Data skipping (pruning files) for RLI is supported only when the query
> predicate has `EqualTo` or `In` expressions/filters on the record-key column.
> However, the logic for detecting valid `In` expression/filter on record-key
> has bugs. It tries to prune files assuming that `In` expression/filter can
> reference only record-key column even when the `In` query is based on other
> columns.
>
> For example, a query of the foem `select * from trips_table where driver in
> ('abc', 'xyz')` has the potential to return wrong results if the record-key
> for this table also has values 'abc' or 'xyz' for some rows of the table.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)