kirkuz commented on issue #1828: URL: https://github.com/apache/hudi/issues/1828#issuecomment-658587504
Hi @bhasudha, thanks for this information. Now, it's clear how it's working. My use case is as following: I want to have only last change in parquet files, because when I try to read it with AWS Athena it sees doubled record with different _hoodie_commit. In ideal world I want to have one S3 bucket with only last change (to not force users to deduplicate it in AWS Athena query) and the second bucket with all commits to have whole history. Can you recommend me sth? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
