ad1happy2go commented on issue #10559: URL: https://github.com/apache/hudi/issues/10559#issuecomment-1908435279
@rishabhreply Sorry, but I am a bit confused. Do you really want to use insert_overwrite in this case? If you just submit two parallel jobs with insert_overwrite, one is going to overwrite the others data in any case. Even if you sequentially then also you will miss the data ingested by first one. So you can only use insert_overwrite if you want to process all 10 files in one batch. Let me know in case I am not thinking in right direction -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
