bvaradar commented on issue #1825: URL: https://github.com/apache/hudi/issues/1825#issuecomment-659804284
@asheeshgarg : Just to be on consistent terminology, the 2 physical files you listed is 2 different versions of the same file. So, queries will be seeing one parquet file. This is fine and expected if we are ingesting only few records per batch which can all fit in a single file. Some questions : Are you ingesting each source dataset as a separate batch to Hudi ? If so, I only see 3 commits ? Can you paste the contents of 3 commit files : 20200716171413.commit, 20200716170252.commit and 20200716154733.commit and let me know how many records were you expecting to ingest. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
