suryaprasanna opened a new pull request, #18417:
URL: https://github.com/apache/hudi/pull/18417

   ### Describe the issue this Pull Request addresses
   
   File index operations currently do not provide enough visibility into how 
much time is spent listing partitions, fetching files from metadata, and 
filtering files into file slices. This makes it harder to debug slow queries 
and identify where file-index time is being spent.
   
   ### Summary and Changelog
   
   Adds timing and context logs in `BaseHoodieTableFileIndex` for key 
file-index stages.
   
   - Log time taken to list partition paths with and without partition 
predicates
   - Log cache miss counts before fetching uncached partition files
   - Log time taken by `getAllFilesInPartitions`
   - Log time taken by `filterFiles` while building file slices
   - Include table name in partition listing failure messages
   
   ### Impact
   
   No public API or user-facing behavior change. This improves observability 
for file-index execution and helps diagnose performance issues in partition and 
file listing paths.
   
   ### Risk Level
   
   low
   
   This change only adds logging and slightly updates an internal exception 
message with table context. No functional behavior is intended to change.
   
   ### Documentation Update
   
   none
   
   ### Contributor's checklist
   
   - [x] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [x] Enough context is provided in the sections above
   - [x] Adequate tests were added if applicable
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to