yanzhanc commented on issue #2252: URL: https://github.com/apache/hudi/issues/2252#issuecomment-738211160
there is no failure or cleaning kicking during the process. and we do not see any .rollback or .clean files. But the http head request is very high from our observation. and when set partition key to be empty string, the number of request will be slightly smaller. and we have one more question. From our observation, the same data write by hudi will be smaller in size than parquet. we have 76 GB data as parquet but 38GB data as hudi when written to s3. To my understanding hudi should be bigger because it contains more meta data but yet it is not. Do you have an explanation for this? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
