yanzhanc commented on issue #2252:
URL: https://github.com/apache/hudi/issues/2252#issuecomment-738211160


   there is no failure or cleaning kicking during the process. and we do not 
see any  .rollback or .clean files. But the http head request is very high from 
our observation. and when set partition key to be empty string, the number of 
request will be slightly smaller. 
   and we have one more question. From our observation, the same data write by 
hudi will be smaller in size than parquet. we have 76 GB data as parquet but 
38GB data as hudi when written to s3. To my understanding hudi should be bigger 
because it contains more meta data but yet it is not. Do you have an 
explanation for this? 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to