bithw1 commented on issue #6378:
URL: https://github.com/apache/hudi/issues/6378#issuecomment-1212673797
> 1. If you are in streaming mode, each miniBatch creates a commit. For
example, a commit will be created during a checkpoint in the `Flink engine`.
> 2. If you write in batch mode, and write 1000 records at a time, only one
commit will be created.
>
> Happy to help.
Thanks for the helpful reply!
For my case of the COW table and I would like to write 5 records into this
table, one record per minute, I am using spark, and
not using streaming,
I am doing is:
0. start spark application
1. kick off 1st spark write job and write one record
2. sleep 1 minutes
3. kick off 2nd spark write job and write one record
....
n. stop spark application
Will each spark job create a commit? It looks to me that no matter how many
records are written by one spark job, it will always create one commit?
Thanks!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]