[
https://issues.apache.org/jira/browse/IGNITE-10482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16723002#comment-16723002
]
Andrey Gura edited comment on IGNITE-10482 at 12/17/18 2:00 PM:
----------------------------------------------------------------
{{FaiureProcessor}} dumps all stacks. This behaviour is switched off on TC in
order to avoid too large logs. See
{{FailureProcessor#IGNITE_DUMP_THREADS_ON_FAILURE}} for more details.
was (Author: agura):
{{FaiureProcessor}} dumps all stacks. This behaviour is switched off on TC in
order to avoid tool large logs. See
{{FailureProcessor#IGNITE_DUMP_THREADS_ON_FAILURE}} for more details.
> Print stacktrace of the blocked thread in failure handler.
> ----------------------------------------------------------
>
> Key: IGNITE-10482
> URL: https://issues.apache.org/jira/browse/IGNITE-10482
> Project: Ignite
> Issue Type: Bug
> Components: cache
> Reporter: Roman Kondakov
> Priority: Major
> Labels: iep-14
>
> If a blocked thread is detected by the error handler, only a detector's
> thread is printed to the log. But it is much more informative to print a
> stacktrace of the blocked thread.
> As shown below, {{disco-event-worker}} thread has detected blocked
> {{sys-stripe-0}} thread. But stacktrace of {{sys-stripe-0}} is not printed
> despite the fact it is of particular interest.
> {noformat}
> [2018-11-29
> 18:50:36,925][ERROR][disco-event-worker-#37%continuous.CacheContinuousQueryOrderingEventTest0%][G]
> Blocked system-critical thread has been detected. This can lead to
> cluster-wide undefined behaviour [threadName=sys-stripe-0, blockedFor=10s]
> [2018-11-29 18:50:36,926][WARN
> ][disco-event-worker-#37%continuous.CacheContinuousQueryOrderingEventTest0%][G]
> Thread
> [name="sys-stripe-0-#1%continuous.CacheContinuousQueryOrderingEventTest0%",
> id=13, state=WAITING, blockCnt=9, waitCnt=3704]
> [2018-11-29
> 18:50:36,927][ERROR][disco-event-worker-#37%continuous.CacheContinuousQueryOrderingEventTest0%][IgniteTestResources]
> Critical system error detected. Will be handled accordingly to configured
> handler [hnd=NoOpFailureHandler [super=AbstractFailureHandler
> [ignoredFailureTypes=SingletonSet [SYSTEM_WORKER_BLOCKED]]],
> failureCtx=FailureContext [type=SYSTEM_WORKER_BLOCKED, err=class
> o.a.i.IgniteException: GridWorker [name=sys-stripe-0,
> igniteInstanceName=continuous.CacheContinuousQueryOrderingEventTest0,
> finished=false, heartbeatTs=1543506626138]]]
> class org.apache.ignite.IgniteException: GridWorker [name=sys-stripe-0,
> igniteInstanceName=continuous.CacheContinuousQueryOrderingEventTest0,
> finished=false, heartbeatTs=1543506626138]
> at
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1833)
> at
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1828)
> at
> org.apache.ignite.internal.worker.WorkersRegistry.onIdle(WorkersRegistry.java:233)
> at
> org.apache.ignite.internal.util.worker.GridWorker.onIdle(GridWorker.java:297)
> at
> org.apache.ignite.internal.managers.discovery.GridDiscoveryManager$DiscoveryWorker.body(GridDiscoveryManager.java:2812)
> at
> org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:120)
> at java.lang.Thread.run(Thread.java:748)
> {noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)