[
https://issues.apache.org/jira/browse/FLINK-16907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072845#comment-17072845
]
Piotr Nowojski commented on FLINK-16907:
----------------------------------------
[~xinyitong] I think the stack trace that you posted is cut in half on
executing a letter in the mailbox
{code}
synchronized (getCheckpointLock()) {
letter.run(); // <== your stack trace is
blocked here
}
{code}
without this it would be hard to understand what is going on. If the thread is
really stack on trying to execute a letter, not inside the mailbox letter, it
means that something is utterly broken with the process. Please try to provide
the full stack trace. Those things could be also helpful:
* [thread dump|https://access.redhat.com/solutions/18178]
* full logs from task managers and job manager
or attach with a debugger to the blocked task, and figure out on what is this
task blocked.
> did not react to cancelling signal for 30 seconds, but is stuck in method:
> --------------------------------------------------------------------------
>
> Key: FLINK-16907
> URL: https://issues.apache.org/jira/browse/FLINK-16907
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Task
> Affects Versions: 1.9.1
> Environment: Hadoop version: 3.1.1.3.1.0.0-78
> flink-1.9.1
> hdp-3.1.0.0-78-kafka
> Reporter: 辛羿彤
> Priority: Major
> Labels: test
>
> 2020-04-01 16:44:02,516 WARN org.apache.flink.runtime.taskmanager.Task - Task
> 'Source: Custom Source -> Map -> Sink: Unnamed (1/1)' did not react to
> cancelling signal for 30 seconds, but is stuck in method:
> org.apache.flink.streaming.runtime.tasks.SourceStreamTask.runAlternativeMailboxLoop(SourceStreamTask.java:149)
> org.apache.flink.streaming.runtime.tasks.SourceStreamTask.processInput(SourceStreamTask.java:122)
> org.apache.flink.streaming.runtime.tasks.StreamTask.run(StreamTask.java:301)
> org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:406)
> org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:705)
> org.apache.flink.runtime.taskmanager.Task.run(Task.java:530)
> java.lang.Thread.run(Thread.java:748)
> 2020-04-01 16:44:32,517 WARN org.apache.flink.runtime.taskmanager.Task - Task
> 'Source: Custom Source -> Map -> Sink: Unnamed (1/1)' did not react to
> cancelling signal for 30 seconds, but is stuck in method:
> org.apache.flink.streaming.runtime.tasks.SourceStreamTask.runAlternativeMailboxLoop(SourceStreamTask.java:149)
> org.apache.flink.streaming.runtime.tasks.SourceStreamTask.processInput(SourceStreamTask.java:122)
> org.apache.flink.streaming.runtime.tasks.StreamTask.run(StreamTask.java:301)
> org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:406)
> org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:705)
> org.apache.flink.runtime.taskmanager.Task.run(Task.java:530)
> java.lang.Thread.run(Thread.java:748)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)