mkk1490 commented on issue #3400:
URL: https://github.com/apache/hudi/issues/3400#issuecomment-905417495


   @nsivabalan Thanks for the information on how CoW works. I was investigating 
the updates in my table. I found that updates are spread across all partitions 
and few million records are getting updated in each of the partition. 
   I have enabled small file handling and the average size of each part file is 
800 MB and I can see for every commit, there is an increase in size.
   I also checked the other table where there was no major increase in total 
size on disk. The reason being updates were not spread  across multiple 
partitions and were concentrated on a few partitions. 
   
   Thanks for all the information. This can be closed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to