kirkuz commented on issue #2323: URL: https://github.com/apache/hudi/issues/2323#issuecomment-768163638
I've found it very time and resource consuming. This is why I've decided to change my requirements to partition data by a column that should never change for a particular row (and I changed my index to SIMPLE only), therefore there shouldn't be a case that records moves from part_1 to part_2. In such approach I will pay more for AWS Athena queries (cause I will query more data due to bigger partitions), but less for AWS EMR to process such kind of data. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
