[
https://issues.apache.org/jira/browse/FLINK-12038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16888482#comment-16888482
]
shuai.xu commented on FLINK-12038:
----------------------------------
This failure can be easily re-produced in my local machine. I enabled the logs
of YARN, and found the reason. You can find the log of unregisterAM in
jobmanager.log. When the job is finished, it will try to unregisterAM to YARN.
In fact, it is not necessary to call killApplication, as the whole YARN mini
cluster will be closed in the tearDown of test case.
The bellowing is part of logs of job master:
2019-07-16 18:20:34,376 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source
(1/2) (e13567c7f2d7a389c74f4583a67e34e8) switched from SCHEDULED to DEPLOYING.
2019-07-16 18:20:34,376 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Deploying Source:
Custom Source (1/2) (attempt #0) to container_1563272405568_0001_01_000002 @
e011239174096.et15sqa (dataPort=42072)
2019-07-16 18:20:34,404 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source
(2/2) (fc3d9d65a75eabaf00d7d9372d2b9884) switched from SCHEDULED to DEPLOYING.
2019-07-16 18:20:34,405 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Deploying Source:
Custom Source (2/2) (attempt #0) to container_1563272405568_0001_01_000003 @
e011239174096.et15sqa (dataPort=41793)
2019-07-16 18:20:34,405 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (1/2)
(65db57ac7166e0a96a3c5318bb262fb0) switched from SCHEDULED to DEPLOYING.
2019-07-16 18:20:34,414 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Deploying Sink:
Unnamed (1/2) (attempt #0) to container_1563272405568_0001_01_000003 @
e011239174096.et15sqa (dataPort=41793)
2019-07-16 18:20:34,447 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (2/2)
(22c3e0c0fd37dd00e75fcf855e2a6ca4) switched from SCHEDULED to DEPLOYING.
2019-07-16 18:20:34,447 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Deploying Sink:
Unnamed (2/2) (attempt #0) to container_1563272405568_0001_01_000002 @
e011239174096.et15sqa (dataPort=42072)
2019-07-16 18:20:34,897 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source
(1/2) (e13567c7f2d7a389c74f4583a67e34e8) switched from DEPLOYING to RUNNING.
2019-07-16 18:20:34,949 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source
(2/2) (fc3d9d65a75eabaf00d7d9372d2b9884) switched from DEPLOYING to RUNNING.
2019-07-16 18:20:35,056 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (1/2)
(65db57ac7166e0a96a3c5318bb262fb0) switched from DEPLOYING to RUNNING.
2019-07-16 18:20:35,067 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (2/2)
(22c3e0c0fd37dd00e75fcf855e2a6ca4) switched from DEPLOYING to RUNNING.
2019-07-16 18:20:35,450 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source
(2/2) (fc3d9d65a75eabaf00d7d9372d2b9884) switched from RUNNING to FINISHED.
2019-07-16 18:20:35,480 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source
(1/2) (e13567c7f2d7a389c74f4583a67e34e8) switched from RUNNING to FINISHED.
2019-07-16 18:20:35,494 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (2/2)
(22c3e0c0fd37dd00e75fcf855e2a6ca4) switched from RUNNING to FINISHED.
2019-07-16 18:20:35,508 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (1/2)
(65db57ac7166e0a96a3c5318bb262fb0) switched from RUNNING to FINISHED.
2019-07-16 18:20:35,513 INFO
org.apache.flink.runtime.executiongraph.ExecutionGraph - Job Flink Streaming
Job (2f9313ea4fd33bef68111ed380a2ae1b) switched from state RUNNING to FINISHED.
2019-07-16 18:20:35,513 INFO
org.apache.flink.runtime.checkpoint.CheckpointCoordinator - Stopping checkpoint
coordinator for job 2f9313ea4fd33bef68111ed380a2ae1b.
2019-07-16 18:20:35,513 INFO
org.apache.flink.runtime.checkpoint.StandaloneCompletedCheckpointStore -
Shutting down
2019-07-16 18:20:35,564 INFO org.apache.flink.runtime.dispatcher.MiniDispatcher
- Job 2f9313ea4fd33bef68111ed380a2ae1b reached globally terminal state FINISHED.
2019-07-16 18:20:35,573 INFO org.apache.flink.runtime.jobmaster.JobMaster -
Stopping the JobMaster for job Flink Streaming
Job(2f9313ea4fd33bef68111ed380a2ae1b).
2019-07-16 18:20:35,664 INFO
org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl - Suspending SlotPool.
2019-07-16 18:20:35,666 INFO org.apache.flink.runtime.jobmaster.JobMaster -
Close ResourceManager connection 165d22977dc31b3b410489789fdc1050: JobManager
is shutting down..
2019-07-16 18:20:35,668 INFO
org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl - Stopping SlotPool.
2019-07-16 18:20:35,668 INFO org.apache.flink.yarn.YarnResourceManager -
Disconnect job manager
[email protected]://[email protected]:50088/user/jobmanager_0
for job 2f9313ea4fd33bef68111ed380a2ae1b from the resource manager.
2019-07-16 18:20:35,681 INFO
org.apache.flink.runtime.jobmaster.JobManagerRunner - JobManagerRunner already
shutdown.
2019-07-16 18:20:36,844 INFO
org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Shutting
YarnJobClusterEntrypoint down with application status SUCCEEDED. Diagnostics
null.
2019-07-16 18:20:36,844 INFO
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint - Shutting down
rest endpoint.
2019-07-16 18:20:36,889 INFO
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint - Removing cache
directory /tmp/flink-web-0b120d31-7130-4fcb-bfa8-699abdd5b81e/flink-web-ui
2019-07-16 18:20:36,890 INFO
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint -
http://e011239174096.et15sqa:38342 lost leadership
2019-07-16 18:20:36,892 INFO
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint - Shut down
complete.
2019-07-16 18:20:36,895 INFO org.apache.flink.yarn.YarnResourceManager - Shut
down cluster because application is in SUCCEEDED, diagnostics null.
2019-07-16 18:20:36,896 INFO org.apache.flink.yarn.YarnResourceManager -
{color:#FF0000}Unregister application from the YARN Resource Manager with final
status SUCCEEDED.{color}
2019-07-16 18:20:36,981 INFO
org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl - Waiting for application
to be successfully unregistered.
> YARNITCase stalls on travis
> ---------------------------
>
> Key: FLINK-12038
> URL: https://issues.apache.org/jira/browse/FLINK-12038
> Project: Flink
> Issue Type: Bug
> Components: Deployment / YARN, Tests
> Affects Versions: 1.9.0
> Reporter: Chesnay Schepler
> Assignee: shuai.xu
> Priority: Critical
> Labels: test-stability
> Fix For: 1.9.0
>
>
> https://travis-ci.org/apache/flink/jobs/511932978
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)