prashantwason opened a new pull request, #8484:
URL: https://github.com/apache/hudi/pull/8484

   [HUDI-6092] Reuse schema objects while deserializing log blocks.
   
   ### Change Logs
   
   1. Added a ConcurrentHashMap in HoodieDataBlock to hold schema string to 
schema object mapping
   2. In HoodieHFileDataBlock and HoodieAvroDataBlock, use the above map to 
retrive the schema object rather than parsing the schema every time.
   
   Also introduced some try { } blocks in code to auto close resources which 
were being leaked.
   
   ### Impact
   
   When reading log files with a very large number of log blocks, there is 
reduced memory consumption.
   
   ### Risk level (write none, low medium or high below)
   
   None
   
   ### Documentation Update
   
   None
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to