Hi Andrey, Do see the exception only in logs or your code experiences the exception? It looks like the exception is treated as a failure but actually it is not (as it is normal Ignite node stop). I would like to understand how critical is it for users.
ср, 4 дек. 2019 г. в 19:25, Andrey Davydov <[email protected]>: > > Hello, > > > > Yesterday we got error on node shutdown while test our system: > > > > 2019-12-03 18:49:53,653 [pool-228-thread-1] INFO r.s.d.m.c.m.PoisonPill:54 > - Poison pill works on e45190a6-de52-4e43-b710-1ee8cd1f60a6 > > 2019-12-03 18:49:53,657 [pool-228-thread-1] INFO > r.s.d.m.c.m.ClusterLifecycleBean:61 - IgniteLifecycleBean 331582218 handle > event: BEFORE_NODE_STOP > > 2019-12-03 18:49:53,658 [pool-228-thread-1] INFO > r.s.d.m.c.w.j.JettyStarter:145 - Stop http interface. > > 2019-12-03 18:49:53,658 [pool-228-thread-1] INFO > r.s.d.m.c.w.j.JettyStarter:149 - Http interface stopped. > > 2019-12-03 18:49:53,658 [pool-228-thread-1] INFO > r.s.d.m.c.m.ClusterLifecycleBean:105 - IgniteLifecycleBean 331582218 finish > handle event: BEFORE_NODE_STOP > > [18:49:53] Topology snapshot [ver=4, locNode=a8ee74de, servers=2, clients=0, > state=INACTIVE, CPUs=4, offheap=7.2GB, heap=4.0GB] > > [18:49:53] Coordinator changed [prev=TcpDiscoveryNode > [id=e45190a6-de52-4e43-b710-1ee8cd1f60a6, addrs=[127.0.0.1], > sockAddrs=[/127.0.0.1:47500], discPort=47500, order=1, intOrder=1, > lastExchangeTime=1575398986885, loc=false, ver=2.7.5#20190603-sha1:be4f2a15, > isClient=false], cur=TcpDiscoveryNode > [id=a8ee74de-6d37-4413-9418-08ec903bb974, addrs=[127.0.0.1], > sockAddrs=[/127.0.0.1:47501], discPort=47501, order=2, intOrder=2, > lastExchangeTime=1575398993662, loc=true, ver=2.7.5#20190603-sha1:be4f2a15, > isClient=false]] > > [18:49:53] ^-- Baseline [id=0, size=3, online=2, offline=1] > > [18:49:53] Topology snapshot [ver=4, locNode=67aa9f86, servers=2, clients=0, > state=INACTIVE, CPUs=4, offheap=7.2GB, heap=4.0GB] > > [18:49:53] Coordinator changed [prev=TcpDiscoveryNode > [id=e45190a6-de52-4e43-b710-1ee8cd1f60a6, addrs=[127.0.0.1], > sockAddrs=[/127.0.0.1:47500], discPort=47500, order=1, intOrder=1, > lastExchangeTime=1575398989326, loc=false, ver=2.7.5#20190603-sha1:be4f2a15, > isClient=false], cur=TcpDiscoveryNode > [id=a8ee74de-6d37-4413-9418-08ec903bb974, addrs=[127.0.0.1], > sockAddrs=[/127.0.0.1:47501], discPort=47501, order=2, intOrder=2, > lastExchangeTime=1575398989326, loc=false, ver=2.7.5#20190603-sha1:be4f2a15, > isClient=false]] > > [18:49:53] ^-- Baseline [id=0, size=3, online=2, offline=1] > > 2019-12-03 18:49:53,689 [grid-nio-worker-tcp-comm-1-#1107%TestNode-0%] ERROR > :135 - Critical system error detected. Will be handled accordingly to > configured handler [hnd=StopNodeOrHaltFailureHandler [tryStop=false, > timeout=0, super=AbstractFailureHandler > [ignoredFailureTypes=[SYSTEM_WORKER_BLOCKED, > SYSTEM_CRITICAL_OPERATION_TIMEOUT]]], failureCtx=FailureContext > [type=SYSTEM_WORKER_TERMINATION, err=java.lang.InterruptedException]] > > java.lang.InterruptedException: null > > at > org.apache.ignite.internal.util.nio.GridNioServer$AbstractNioClientWorker.bodyInternal(GridNioServer.java:2158) > ~[ignite-core-2.7.5.jar:2.7.5] > > at > org.apache.ignite.internal.util.nio.GridNioServer$AbstractNioClientWorker.body(GridNioServer.java:1794) > [ignite-core-2.7.5.jar:2.7.5] > > at > org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:120) > [ignite-core-2.7.5.jar:2.7.5] > > at java.lang.Thread.run(Thread.java:748) [?:1.8.0_232] > > 2019-12-03 18:49:54,033 [grid-nio-worker-tcp-comm-1-#1107%TestNode-0%] ERROR > :127 - JVM will be halted immediately due to the failure: > [failureCtx=FailureContext [type=SYSTEM_WORKER_TERMINATION, > err=java.lang.InterruptedException]] > > > > We can’t find any way for reproduce. > > > > Andrey. > > -- Best regards, Ivan Pavlukhin
