Vinaykumar Bhat created HUDI-7569:
-------------------------------------

             Summary: Fix wrong result while using RLI for pruning files
                 Key: HUDI-7569
                 URL: https://issues.apache.org/jira/browse/HUDI-7569
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Vinaykumar Bhat


Data skipping (pruning files) for RLI is supported only when the query 
predicate has `EqualTo` or `In` expressions/filters on the record-key column. 
However, the logic for detecting valid `In` expression/filter on record-key has 
bugs. It tries to prune files assuming that `In` expression/filter can 
reference only record-key column even when the `In` query is based on other 
columns.

 

For example, a query of the foem `select * from trips_table where driver in 
('abc', 'xyz')` has the potential to return wrong results if the record-key for 
this table also has values 'abc' or 'xyz' for some rows of the table.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to