zk19930911 opened a new issue #654: Many small files URL: https://github.com/apache/incubator-hudi/issues/654 spark-streamning batch is 60s,There are many small files in each batch. I can set which parameter to merge these parquet files。 Official website said :: Compaction is also pluggable, which can be extended to stitch older, less frequently updated data files to further reduce the total number of files. please help me
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
