chenbodeng719 commented on issue #8279: URL: https://github.com/apache/hudi/issues/8279#issuecomment-1488618172
At first, I want to bulk insert hudi to load all data fastly with "write.tasks=256". Then I stop the bulk insert and continue to upsert with "write.tasks=256" to catch up with the msg from kafka. After I catch up with the delay msg, I want to reduce the consumer cluster to change write.tasks to 50. It turns out that this will cause data loss. Is this as expected? Am I doing wrong? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
