kkrugler opened a new pull request, #7658:
URL: https://github.com/apache/hudi/pull/7658

   ### Change Logs
   
   Currently a log msg that says "Creating new file for partition path" is 
generated every time the current partition changes, even when no new file is 
being created (which is confusing).
   
   Also if you have data flowing from multiple topics, with even a slight lag 
between them, you can wind up with current partition bouncing back and forth 
when the event time is close to the partition boundary. This in turn generates 
(in our case) upwards of 1M messages, given our data rate is between 5 and 20M 
records/second.
   
   ### Impact
   
   N/A
   
   ### Risk level (write none, low medium or high below)
   
   low
   
   ### Documentation Update
   
   N/A
   
   ### Contributor's checklist
   
   - [X] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [X] Change Logs and Impact were stated clearly
   - [X] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to