nsivabalan commented on issue #4044: URL: https://github.com/apache/hudi/issues/4044#issuecomment-997295187
Hey @nikita-sheremet-clearscale : what do you mean by "What happens when some partitions were deleted and then indexed?" ? can you please clarify. In general, its not encouraged to delete data files manually, but leave it to rollback to take care of cleaning up partially failed commits. my guess is that, your large job failed due to OOM or some other reason. And so the next job you started, does a rollback of the partially failed data which is too huge and hence takes a lot of time. but even then, deletions should not take such a long time though. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
