Hi,
I have embedded Ignite into my application and using it for distributed caches. 
I am running Ignite cluster in my lab environment. I have two nodes in the 
cluster. Time to time I get node segmented event and the node which receives it 
dies abruptly.

The application is registered to three discovery events node_left, node_failed 
and node_segmented. On receiving these events each node checks if it is now 
oldest node is the cluster. This is to check if oldest node has left/failed.

I am also listening to life cycle events. I am interested in before_node_stop 
and after_node_stop events. On receiving these events, I need to stop another 
component of the application.


  1.  What are the reasons of getting node_segmented event?
     *   One reason is obviously network glitch. Node losing connectivity with 
other members
     *   Can high memory usage/long GC pause be reason for segmentation?
     *   Is there a way to get cause of the segmentation?
  2.  After getting node_segmented event, I immediately got before_node_stop 
event. But after_node_stop did not follow. So node was kind of left in some 
inconsistent state. Never recovered from that.
     *   Is it possible that on receiving node_segmented event, when I tried to 
get the oldest node in the cluster caused the node to stop?
Event timeline from both nodes:

Application 1:
08/19/17  10:40:10 :  received node failed event. The event was caused by 
application 2.
08/19/17  10:40:10 :  [10:40:10] Topology snapshot [ver=3, servers=1, 
clients=0, CPUs=32, heap=14.0GB]

Application 2:
08/19/17 10:40:28 : received node segmented event. The event was caused by 
application 2
08/19/17 10:40:28 : Checking if oldest has changed
08/19/17 10:40:28 : Ignite Lifecycle event received: BEFORE_NODE_STOP. Fires 
event to stop another component
08/19/17 10:40:28 : dependent component stops
08/19/17 10:40:28 : received node failed event. The event was caused by 
application 1.
08/19/17 10:40:10 : Topology snapshot [ver=3, servers=1, clients=0, CPUs=32, 
heap=14.0GB]

Thanks,
Biren




--
View this message in context: 
http://apache-ignite-users.70518.x6.nabble.com/Cluster-segmentation-tp16314.html
Sent from the Apache Ignite Users mailing list archive at Nabble.com.

Reply via email to