I've added the following fix to the 1.1 branch

 * [FLINK-4875] [metrics] Use correct operator name

It is a crucial fix for streaming topologies that involve multi-chains.
2 users already ran into this.

On 25.10.2016 14:43, Robert Metzger wrote:
+1 for a bugfix release soon.

On Tue, Oct 25, 2016 at 10:53 AM, Stephan Ewen <se...@apache.org> wrote:

Thanks fort starting this Ufuk.

I would like to add the following issues to 1.1.4:

Build errors due to Storm dependencies *(fix pending)*
     - [FLINK-4298] [storm compatibility] Add proper repository for Closure
dependencies.

Stability on S3 considering eventual consistency *(fix pending)*
     - [FLINK-4218] [checkpoints] Do not fail checkpoints when state size
cannot be determined

Avoiding Zombie TaskManagers *(still needs to be done)*
     - [FLINK-3347] [akka] TaskManager (or its ActorSystem) need to restart
in case they notice quarantine

Adding a limit to the amount of data spilled during checkpoint alignments
*(fix
is work in progress)*
     - [FLINK-4904] [checkpoints] Add a limit for how much data may be
spilled in checkpoint alignments


I can push the first two fixes to the 1.1.4 branch in a bit, the fourth one
later today.
The third one (akka) is still pending.

Best,
Stephan



On Mon, Oct 24, 2016 at 3:32 PM, Ufuk Celebi <u...@apache.org> wrote:

Hey all,

I would like to start the discussion for kicking off the next bug fix
release, Flink 1.1.4. What do you think about aiming for a RC by end
of this week?

Users reported some instabilities/inconveniences that would be good to
fix.
Personally, I would like to backport the following fixes:

(1) https://issues.apache.org/jira/browse/FLINK-4619: Answer client if
savepoint restore fails (Already merged for master, needs minimal
adjustment for 1.1)
(2) https://issues.apache.org/jira/browse/FLINK-4715: Safety net for
stuck task cancellation (Already reviewed for master, waiting for
tests to finish of backport)
(3) https://issues.apache.org/jira/browse/FLINK-4510: Always create
CheckpointCoordinator (Already merged for master, needs minimal
adjustments for 1.1)

Furthermore, I would like to address the following:

(4) https://issues.apache.org/jira/browse/FLINK-4445: Add option to
ignore unmatched state when restoring from savepoint
(5) https://issues.apache.org/jira/browse/FLINK-4894: Don't block on
buffer request after broadcast event

Strictly speaking, the (4) is not a bug fix. But given that it would
only add an optional flag to savepoint restoring and should have been
addressed for 1.1.0 already, I would like to get it in.


Reply via email to