kirkuz commented on issue #1828:
URL: https://github.com/apache/hudi/issues/1828#issuecomment-658587504


   Hi @bhasudha, thanks for this information. Now, it's clear how it's working. 
My use case is as following: I want to have only last change in parquet files, 
because when I try to read it with AWS Athena it sees doubled record with 
different _hoodie_commit. In ideal world I want to have one S3 bucket with only 
last change (to not force users to deduplicate it in AWS Athena query) and the 
second bucket with all commits to have whole history. 
   
   Can you recommend me sth? 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to