zk19930911 opened a new issue #654: Many small files
URL: https://github.com/apache/incubator-hudi/issues/654
 
 
   spark-streamning batch is 60s,There are many small files in each batch.
   I can set which parameter to merge these parquet files。
   
   Official website said :: Compaction is also pluggable, which can be extended 
to stitch older, less frequently updated data files to further reduce the total 
number of files.
   
   please help me

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to