AFTER_NODE_STOP state was ignored, because JVM stop. In our case it is not a problem. I think that there are some scenarios when AFTER_NODE_STOP is important.
On Thu, Dec 5, 2019 at 11:05 AM Ivan Pavlukhin <[email protected]> wrote: > Hi Andrey, > > Do see the exception only in logs or your code experiences the > exception? It looks like the exception is treated as a failure but > actually it is not (as it is normal Ignite node stop). I would like to > understand how critical is it for users. > > ср, 4 дек. 2019 г. в 19:25, Andrey Davydov <[email protected]>: > > > > Hello, > > > > > > > > Yesterday we got error on node shutdown while test our system: > > > > > > > > 2019-12-03 18:49:53,653 [pool-228-thread-1] INFO > r.s.d.m.c.m.PoisonPill:54 - Poison pill works on > e45190a6-de52-4e43-b710-1ee8cd1f60a6 > > > > 2019-12-03 18:49:53,657 [pool-228-thread-1] INFO > r.s.d.m.c.m.ClusterLifecycleBean:61 - IgniteLifecycleBean 331582218 handle > event: BEFORE_NODE_STOP > > > > 2019-12-03 18:49:53,658 [pool-228-thread-1] INFO > r.s.d.m.c.w.j.JettyStarter:145 - Stop http interface. > > > > 2019-12-03 18:49:53,658 [pool-228-thread-1] INFO > r.s.d.m.c.w.j.JettyStarter:149 - Http interface stopped. > > > > 2019-12-03 18:49:53,658 [pool-228-thread-1] INFO > r.s.d.m.c.m.ClusterLifecycleBean:105 - IgniteLifecycleBean 331582218 > finish handle event: BEFORE_NODE_STOP > > > > [18:49:53] Topology snapshot [ver=4, locNode=a8ee74de, servers=2, > clients=0, state=INACTIVE, CPUs=4, offheap=7.2GB, heap=4.0GB] > > > > [18:49:53] Coordinator changed [prev=TcpDiscoveryNode > [id=e45190a6-de52-4e43-b710-1ee8cd1f60a6, addrs=[127.0.0.1], sockAddrs=[/ > 127.0.0.1:47500], discPort=47500, order=1, intOrder=1, > lastExchangeTime=1575398986885, loc=false, > ver=2.7.5#20190603-sha1:be4f2a15, isClient=false], cur=TcpDiscoveryNode > [id=a8ee74de-6d37-4413-9418-08ec903bb974, addrs=[127.0.0.1], sockAddrs=[/ > 127.0.0.1:47501], discPort=47501, order=2, intOrder=2, > lastExchangeTime=1575398993662, loc=true, ver=2.7.5#20190603-sha1:be4f2a15, > isClient=false]] > > > > [18:49:53] ^-- Baseline [id=0, size=3, online=2, offline=1] > > > > [18:49:53] Topology snapshot [ver=4, locNode=67aa9f86, servers=2, > clients=0, state=INACTIVE, CPUs=4, offheap=7.2GB, heap=4.0GB] > > > > [18:49:53] Coordinator changed [prev=TcpDiscoveryNode > [id=e45190a6-de52-4e43-b710-1ee8cd1f60a6, addrs=[127.0.0.1], sockAddrs=[/ > 127.0.0.1:47500], discPort=47500, order=1, intOrder=1, > lastExchangeTime=1575398989326, loc=false, > ver=2.7.5#20190603-sha1:be4f2a15, isClient=false], cur=TcpDiscoveryNode > [id=a8ee74de-6d37-4413-9418-08ec903bb974, addrs=[127.0.0.1], sockAddrs=[/ > 127.0.0.1:47501], discPort=47501, order=2, intOrder=2, > lastExchangeTime=1575398989326, loc=false, > ver=2.7.5#20190603-sha1:be4f2a15, isClient=false]] > > > > [18:49:53] ^-- Baseline [id=0, size=3, online=2, offline=1] > > > > 2019-12-03 18:49:53,689 [grid-nio-worker-tcp-comm-1-#1107%TestNode-0%] > ERROR :135 - Critical system error detected. Will be handled accordingly > to configured handler [hnd=StopNodeOrHaltFailureHandler [tryStop=false, > timeout=0, super=AbstractFailureHandler > [ignoredFailureTypes=[SYSTEM_WORKER_BLOCKED, > SYSTEM_CRITICAL_OPERATION_TIMEOUT]]], failureCtx=FailureContext > [type=SYSTEM_WORKER_TERMINATION, err=java.lang.InterruptedException]] > > > > java.lang.InterruptedException: null > > > > at > org.apache.ignite.internal.util.nio.GridNioServer$AbstractNioClientWorker.bodyInternal(GridNioServer.java:2158) > ~[ignite-core-2.7.5.jar:2.7.5] > > > > at > org.apache.ignite.internal.util.nio.GridNioServer$AbstractNioClientWorker.body(GridNioServer.java:1794) > [ignite-core-2.7.5.jar:2.7.5] > > > > at > org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:120) > [ignite-core-2.7.5.jar:2.7.5] > > > > at java.lang.Thread.run(Thread.java:748) [?:1.8.0_232] > > > > 2019-12-03 18:49:54,033 [grid-nio-worker-tcp-comm-1-#1107%TestNode-0%] > ERROR :127 - JVM will be halted immediately due to the failure: > [failureCtx=FailureContext [type=SYSTEM_WORKER_TERMINATION, > err=java.lang.InterruptedException]] > > > > > > > > We can’t find any way for reproduce. > > > > > > > > Andrey. > > > > > > > > -- > Best regards, > Ivan Pavlukhin >
