cocopc opened a new issue #1737:
URL: https://github.com/apache/hudi/issues/1737


   streamng Job Info:
   table : MOR table, upsert
   partition: Non-partitioned
   params: hoodie.upsert.shuffle.parallelism=1
   batch interval:  120s 
   
   wheh use spark streaming wirte data to hudi, each batch will create a 
parquet file . how to merge these small parquet files? 
   
   
![](http://pcmyp.oss-cn-beijing.aliyuncs.com/markdown/2020-06-15-095032.png?x-oss-process=style/wm_qf)
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to