mkk1490 commented on issue #3400: URL: https://github.com/apache/hudi/issues/3400#issuecomment-905417495
@nsivabalan Thanks for the information on how CoW works. I was investigating the updates in my table. I found that updates are spread across all partitions and few million records are getting updated in each of the partition. I have enabled small file handling and the average size of each part file is 800 MB and I can see for every commit, there is an increase in size. I also checked the other table where there was no major increase in total size on disk. The reason being updates were not spread across multiple partitions and were concentrated on a few partitions. Thanks for all the information. This can be closed. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
