lokesh-lingarajan-0310 opened a new pull request, #9538:
URL: https://github.com/apache/hudi/pull/9538

   ### Change Logs
   
   Recent refactoring to support batching within commit for GCS incr job moved 
the filtering of  objects after the checkpoint batching. The issue with this on 
bootstrap scenarios where we are looking for only latest commits, we will have 
to go through the entire set of commits based on sourcelimit instead of 
directly skipping to the latest commit. 
   
   Fix is to apply filtering before we start checkpoint batching. This change 
list will bring GCS job similar to S3 job. 
   
   ### Impact
   
   medium
   
   ### Risk level (write none, low medium or high below)
   
   low
   
   ### Contributor's checklist
   
   - [x] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [x] Change Logs and Impact were stated clearly
   - [x] Adequate tests were added if applicable
   - [x] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to