huliwuli commented on issue #10716: URL: https://github.com/apache/hudi/issues/10716#issuecomment-1967593378
> @huliwuli "insert" operation type should handle merging small files. I see you set up small file size limit as 10 MB. can you remove that config (default 104857600) or increase that and see if that helps. Not working with EMR 6.15, it still generates 40 files -50 files for one-day data. "hoodie.copyonwrite.record.size.estimate" This setting works, however I am using MoR type table. Not sure whether it will occur any risks if I use this setting on the MoR-type table. I know bulk_insert works and controls the file size, but is it okay to use bulk_insert with append for daily delta data? My use case is only to insert data into a partition(date), and sometimes need to delete previous data. That's why I use MoR -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
