BruceKellan commented on issue #6378:
URL: https://github.com/apache/hudi/issues/6378#issuecomment-1212674948

   > > 1. If you are in streaming mode, each miniBatch creates a commit. For 
example, a commit will be created during a checkpoint in the `Flink engine`.
   > > 2. If you write in batch mode, and write 1000 records at a time, only 
one commit will be created.
   > > 
   > > Happy to help.
   > 
   > Thanks @BruceKellan for the helpful reply! I have a further question: for 
my case of the COW table and I would like to write 5 records into this table, 
one record per minute, I am using spark, and not using streaming,
   > 
   > I am doing is: 0. start spark application
   > 
   > 1. kick off 1st spark write job and write one record
   > 2. sleep 1 minutes
   > 3. kick off 2nd spark write job and write one record
   >    ....
   >    n. stop spark application
   > 
   > Will each spark job create a commit? It looks to me that no matter how 
many records are written by one spark job, it will always create one commit?
   > 
   > Thanks!
   
   Yes, you are actually using 5 batch spark application in this scenario.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to