@Clarkkkkk thanks for you reply! Sorry for late response. `triggerPartitionProducerStateCheck` called if there is a `PartitionNotFoundException`, that is, producer not found. Please note that former, we ask JM to check producer state, If it is a Timeout Exception, it will try again and assume it's still running; however, now we ALWAYS assume producer is still running and try again. So with the changes we use a loosely fail strategy.
For the single-thread pool, could we just reuse `JobMaster#scheduledExecutorService`? [ Full content available at: https://github.com/apache/flink/pull/6680 ] This message was relayed via gitbox.apache.org for [email protected]
