Github user zhangminglei commented on the issue:
https://github.com/apache/flink/pull/6075
Hi, Thanks @StephanEwen . For a short checkpoint time that whether lead to
poor compression or not, I will give an actual production test under
compression situation and as an attachment attached to
[FLINK-9407](https://issues.apache.org/jira/browse/FLINK-9407) in the next few
days.
Actually, We having been using this PR (but not polished yet) in our
production environment for a long time. And getting a very nice results ending
that compared to spark streaming. I will put the test results in the form of
attachments to [FLINK-9407](https://issues.apache.org/jira/browse/FLINK-9407)
as a reference. And what you are more concerned about the short checkpoint
interval will lead to poor compression, yea I would agree but we need a test
for it. But we have all known that the longer snapshots interval, the lower
performance impact of asynchronous or synchronous snapshotting we would get.
So, I think people do not inclined to adopt short checkpoint intervals for
getting a high throughput and low latency in production env. For example, in
our calculation of uv jobs, the checkpoint intervals is 30 seconds. Anyway, I
will still give a test results under the compression situation.
I suggest this PR can merge as a temporary solution to reduce the user's
learning curve since some users already had used ```BucketingSink``` in their
project and wants this functionality as we can see in the user mail list a few
days ago. And In a short time, they may not switch to a new sink
```StreamingFileSink ``` . This may be one of the good reasons.
By the way, I will watch
[FLINK-9749](https://issues.apache.org/jira/browse/FLINK-9749) and the subtask
[FLINK-9753](https://issues.apache.org/jira/browse/FLINK-9753) soon and give
more response in the next couple of days.
---