[
https://issues.apache.org/jira/browse/IGNITE-11179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ARomantsov updated IGNITE-11179:
--------------------------------
Description:
Start two nodes, one of them fail after block "start caches in recovery mode"
{code:java}
[12:47:57,333][SEVERE][tcp-disco-msg-worker-#2][G] Blocked system-critical
thread has been detected. This can lead to cluster-wide undefined behaviour
[threadName=ttl-cleanup-worker, blockedFor=16s]
[12:47:57,334][WARNING][tcp-disco-msg-worker-#2][G] Thread
[name="ttl-cleanup-worker-#45", id=63, state=WAITING, blockCnt=0, waitCnt=1]
[12:47:57,336][SEVERE][tcp-disco-msg-worker-#2][] Critical system error
detected. Will be handled accordingly to configured handler
[hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0,
super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet
[SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]],
failureCtx=FailureContext [type=SYSTEM_WORKER_BLOCKED, err=class
o.a.i.IgniteException: GridWorker [name=ttl-cleanup-worker,
igniteInstanceName=null, finished=false, heartbeatTs=1549187260799]]]
class org.apache.ignite.IgniteException: GridWorker [name=ttl-cleanup-worker,
igniteInstanceName=null, finished=false, heartbeatTs=1549187260799]
at
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1833)
at
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1828)
at
org.apache.ignite.internal.worker.WorkersRegistry.onIdle(WorkersRegistry.java:233)
at
org.apache.ignite.internal.util.worker.GridWorker.onIdle(GridWorker.java:297)
at
org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.lambda$new$0(ServerImpl.java:2680)
at
org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorker.body(ServerImpl.java:7218)
at
org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.body(ServerImpl.java:2717)
at
org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:120)
at
org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorkerThread.body(ServerImpl.java:7156)
at org.apache.ignite.spi.IgniteSpiThread.run(IgniteSpiThread.java:62)
{code}
was:
Start two nodes, one of them drop after Start caches in recovery mode
{code:java}
[12:47:57,333][SEVERE][tcp-disco-msg-worker-#2][G] Blocked system-critical
thread has been detected. This can lead to cluster-wide undefined behaviour
[threadName=ttl-cleanup-worker, blockedFor=16s]
[12:47:57,334][WARNING][tcp-disco-msg-worker-#2][G] Thread
[name="ttl-cleanup-worker-#45", id=63, state=WAITING, blockCnt=0, waitCnt=1]
[12:47:57,336][SEVERE][tcp-disco-msg-worker-#2][] Critical system error
detected. Will be handled accordingly to configured handler
[hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0,
super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet
[SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]],
failureCtx=FailureContext [type=SYSTEM_WORKER_BLOCKED, err=class
o.a.i.IgniteException: GridWorker [name=ttl-cleanup-worker,
igniteInstanceName=null, finished=false, heartbeatTs=1549187260799]]]
class org.apache.ignite.IgniteException: GridWorker [name=ttl-cleanup-worker,
igniteInstanceName=null, finished=false, heartbeatTs=1549187260799]
at
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1833)
at
org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1828)
at
org.apache.ignite.internal.worker.WorkersRegistry.onIdle(WorkersRegistry.java:233)
at
org.apache.ignite.internal.util.worker.GridWorker.onIdle(GridWorker.java:297)
at
org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.lambda$new$0(ServerImpl.java:2680)
at
org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorker.body(ServerImpl.java:7218)
at
org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.body(ServerImpl.java:2717)
at
org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:120)
at
org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorkerThread.body(ServerImpl.java:7156)
at org.apache.ignite.spi.IgniteSpiThread.run(IgniteSpiThread.java:62)
{code}
> One of two nodes fail by handler with message -
> threadName=ttl-cleanup-worker, blockedFor=16s
> ---------------------------------------------------------------------------------------------
>
> Key: IGNITE-11179
> URL: https://issues.apache.org/jira/browse/IGNITE-11179
> Project: Ignite
> Issue Type: Bug
> Components: data structures
> Affects Versions: 2.8
> Reporter: ARomantsov
> Priority: Critical
> Fix For: 2.8
>
>
> Start two nodes, one of them fail after block "start caches in recovery mode"
> {code:java}
> [12:47:57,333][SEVERE][tcp-disco-msg-worker-#2][G] Blocked system-critical
> thread has been detected. This can lead to cluster-wide undefined behaviour
> [threadName=ttl-cleanup-worker, blockedFor=16s]
> [12:47:57,334][WARNING][tcp-disco-msg-worker-#2][G] Thread
> [name="ttl-cleanup-worker-#45", id=63, state=WAITING, blockCnt=0, waitCnt=1]
> [12:47:57,336][SEVERE][tcp-disco-msg-worker-#2][] Critical system error
> detected. Will be handled accordingly to configured handler
> [hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0,
> super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet
> [SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]],
> failureCtx=FailureContext [type=SYSTEM_WORKER_BLOCKED, err=class
> o.a.i.IgniteException: GridWorker [name=ttl-cleanup-worker,
> igniteInstanceName=null, finished=false, heartbeatTs=1549187260799]]]
> class org.apache.ignite.IgniteException: GridWorker [name=ttl-cleanup-worker,
> igniteInstanceName=null, finished=false, heartbeatTs=1549187260799]
> at
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1833)
> at
> org.apache.ignite.internal.IgnitionEx$IgniteNamedInstance$2.apply(IgnitionEx.java:1828)
> at
> org.apache.ignite.internal.worker.WorkersRegistry.onIdle(WorkersRegistry.java:233)
> at
> org.apache.ignite.internal.util.worker.GridWorker.onIdle(GridWorker.java:297)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.lambda$new$0(ServerImpl.java:2680)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorker.body(ServerImpl.java:7218)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl$RingMessageWorker.body(ServerImpl.java:2717)
> at
> org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:120)
> at
> org.apache.ignite.spi.discovery.tcp.ServerImpl$MessageWorkerThread.body(ServerImpl.java:7156)
> at org.apache.ignite.spi.IgniteSpiThread.run(IgniteSpiThread.java:62)
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)