shenbachand commented on issue #9787: URL: https://github.com/apache/hudi/issues/9787#issuecomment-1738800272
Thanks Danny. The scenario is we are processing customer data and storing that in S3 partitioned by date. In case any customer doesn't want to hold their data by company, we have to delete the records for that particular customer in S3 including history data. Hence we cannot delete the whole partition as that will have all the customer details for that day. I will try the Flink state TTL options. The one question here is, will this be solved through the concurrency option, where the other writer (another job/tech stack) can use the same Hudi metadata files and delete the required records? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
