Jingsong Lee created FLINK-19345:
------------------------------------

             Summary: Introduce File streaming sink compaction
                 Key: FLINK-19345
                 URL: https://issues.apache.org/jira/browse/FLINK-19345
             Project: Flink
          Issue Type: New Feature
          Components: Table SQL / Runtime
            Reporter: Jingsong Lee
            Assignee: Jingsong Lee
             Fix For: 1.12.0


Users often complain that many small files are written out. Small files will 
affect the performance of file reading and the DFS system, and even the 
stability of the DFS system.

Target: 
 * Compact all files generated by this job in a single checkpoint.
 * With compaction, Users can have smaller checkpoint interval, even to seconds.

Document: 
https://docs.google.com/document/d/1cdlyoqgBq9yJEiHFBziimIoKHapQiEY2-0Tn8IF6G-c/edit?usp=sharing



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to