huliwuli commented on issue #10716:
URL: https://github.com/apache/hudi/issues/10716#issuecomment-1967593378

   > @huliwuli "insert" operation type should handle merging small files. I see 
you set up small file size limit as 10 MB. can you remove that config (default 
104857600) or increase that and see if that helps.
   
   Not working with EMR 6.15, it still generates 40 files -50 files for one-day 
data.  
   
   "hoodie.copyonwrite.record.size.estimate"  This setting works, however I am 
using MoR type table. Not sure whether it will occur any risks if I use this 
setting on the MoR-type table.
   
   I know bulk_insert works and controls the file size, but is it okay to use 
bulk_insert with append for daily delta data?
   
   My use case is only to insert data into a partition(date),  and sometimes 
need to delete previous data. That's why I use MoR
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to