voonhous opened a new issue, #13923:
URL: https://github.com/apache/hudi/issues/13923

   ### Bug Description
   
   **What happened:**
   Tracking this as an issue:
   https://github.com/apache/hudi/pull/13724#issuecomment-3296438818
   
   1. `HoodieAvroReaderContext#getFileRecordIterator` does not take in the 
reader config so HFile cache might not take effect in this read path: `(reader 
= (HoodieAvroFileReader) 
HoodieIOFactory.getIOFactory(storage).getReaderFactory(HoodieRecord.HoodieRecordType.AVRO).getFileReader(new
 HoodieConfig(), filePath, fileFormat, Option.empty());` where `new 
HoodieConfig()` is used);
   2. HFile log block does not seem to take the cache configs and make them 
take effect.
   
   These do not affect reuse read path as they use the cache configs and 
`ReusableFileGroupRecordBufferLoader` for reading log files which cache the 
records.
   
   **What you expected:**
   
   When HFile configs are passed in, they should not be ignored, and not use 
the default values.
   
   **Steps to reproduce:**
   1. Add breakpoint to `HoodieConfig()` in the aboe lines
   2. Set non-default HFile reader configs
   3. Observe that the non-default configurations are ignored
   
   ### Environment
   
   **Hudi version:**
   **Query engine:** (Spark/Flink/Trino etc)
   **Relevant configs:**
   
   
   ### Logs and Stack Trace
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to