bvaradar commented on issue #2267: URL: https://github.com/apache/hudi/issues/2267#issuecomment-731650631
@bithw1 : The 3 Parquet files are different versions of the same file. Your second and third batch has both new records and updates. Hudi will write new records to parquet files but can route updates to delta files. As you have very few records in the initial parquet file, Hudi will route both new records and updates to the same Parquet file and a new version of the file is created. If you do only have updates in batch 2, you will see the delta files. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
