bithw1 commented on issue #6378:
URL: https://github.com/apache/hudi/issues/6378#issuecomment-1212673797

   > 1. If you are in streaming mode, each miniBatch creates a commit. For 
example, a commit will be created during a checkpoint in the `Flink engine`.
   > 2. If you write in batch mode, and write 1000 records at a time, only one 
commit will be created.
   > 
   > Happy to help.
   
   Thanks for the helpful reply!
   
   For my case of the  COW table and I would like to write 5 records into this 
table, one record per minute, I am using spark, and
   not using streaming,
   
   I am doing is:  
   0. start spark application
   1. kick off 1st spark write job and write one record
   2. sleep 1 minutes
   3. kick off 2nd spark write job and write one record
   ....
   n. stop spark application
   
   Will each spark job create a commit? It looks to me that no matter how many 
records are written by one spark job, it will always create one commit?
   
    Thanks!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to