bvaradar opened a new pull request #968: [HUDI-169] Speed up rolling back of 
instants
URL: https://github.com/apache/incubator-hudi/pull/968
 
 
   The logic which rolls-back a commit performs parallel rollback  only at 
partition level.  Depending on the input batch, there could be skew in number 
of files per partition that needs to be rolled-back.
   
   For partition with too many files to be rolled-back, each rollback appends a 
log-block to the file-slice. This could potentially take a long time as they 
are done serially. 
   
   This PR addresses the problem by parallelizing at file-level

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to