satish created HUDI-1539:
----------------------------
Summary: Bug in HoodieCombineRealtimeRecordReader returns wrong
results
Key: HUDI-1539
URL: https://issues.apache.org/jira/browse/HUDI-1539
Project: Apache Hudi
Issue Type: Bug
Reporter: satish
https://github.com/apache/hudi/issues/2346#issuecomment-758591316
in a rt table
the hive query has predicate push down
there are no less than 3 splits (thus no less than 3 recordReaders in
HoodieCombineRealtimeRecordReader), and the records satisfy the predicate are
in the split which is in a relatively back position of the List
2 recordReaders in succession with this.currentRecordReader.next(key, value)
returns false, as the predicate push down has filtered the baseFile.
In step 4, it leads to HoodieCombineRealtimeRecordReader::next(NullWritable
key, ArrayWritable value) return false and the reader will stop read next. So,
records which satisfy the predicate are in the remanined recordReaders but can
not be read.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)