bvaradar commented on issue #2267:
URL: https://github.com/apache/hudi/issues/2267#issuecomment-731650631


   @bithw1 : The 3 Parquet files are different versions of the same file. 
   
   Your second and third batch has both new records and updates. Hudi will 
write new records to parquet files but can route updates to delta files. As you 
have very few records in the initial parquet file, Hudi will route both new 
records and updates to the same Parquet file and a new version of the file is 
created. 
   
   If you do only have updates in batch 2, you will see the delta files.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to