[ 
https://issues.apache.org/jira/browse/FLINK-11011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16704499#comment-16704499
 ] 

Andrey Zagrebin commented on FLINK-11011:
-----------------------------------------

I would suspect that the job is being stopped and checkpoint coordinator has 
been already shutdown but some task manager still sends decline/ack messages.

The solution could be check whether the job is still running to output the 
error:

 
{code:java}
JobMaster.declineCheckpoint() { ....
if (executionGraph.getState() == JobStatus.RUNNING) {
 log.error("Received DeclineCheckpoint message for job {} with no 
CheckpointCoordinator",
 jobGraph.getJobID());
}
{code}
 

The same applies for `JobMaster.acknowledgeCheckpoint`

> Elasticsearch 6 sink end-to-end test unstable
> ---------------------------------------------
>
>                 Key: FLINK-11011
>                 URL: https://issues.apache.org/jira/browse/FLINK-11011
>             Project: Flink
>          Issue Type: Bug
>          Components: E2E Tests
>    Affects Versions: 1.8.0, 1.7.1
>            Reporter: Timo Walther
>            Assignee: Timo Walther
>            Priority: Critical
>
> The log contains errors:
> {code}
> 2018-11-26 12:55:02,363 ERROR org.apache.flink.runtime.jobmaster.JobMaster - 
> Received DeclineCheckpoint message for job 1a7516a04fb0cc85bdb3aa21548bd9bb 
> with no CheckpointCoordinator
> {code}
>  
> See also: https://api.travis-ci.org/v3/job/459693461/log.txt



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to