Vinaykumar Bhat created HUDI-7569:
-------------------------------------
Summary: Fix wrong result while using RLI for pruning files
Key: HUDI-7569
URL: https://issues.apache.org/jira/browse/HUDI-7569
Project: Apache Hudi
Issue Type: Bug
Reporter: Vinaykumar Bhat
Data skipping (pruning files) for RLI is supported only when the query
predicate has `EqualTo` or `In` expressions/filters on the record-key column.
However, the logic for detecting valid `In` expression/filter on record-key has
bugs. It tries to prune files assuming that `In` expression/filter can
reference only record-key column even when the `In` query is based on other
columns.
For example, a query of the foem `select * from trips_table where driver in
('abc', 'xyz')` has the potential to return wrong results if the record-key for
this table also has values 'abc' or 'xyz' for some rows of the table.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)