I don't believe that should be an issue but it's puzzling that it just stop working.

Can you try switching to unicast to see if that makes a difference. Here is the quickstart document.

https://github.com/ganglia/monitor-core/wiki/Ganglia-Quick-Start

Vladimir

12/07/2016 u 05:59 AM, Vc, Sarathchandran (Nokia - IN/Bangalore) je napisao/la:
 
We have a 100 node cluster and which is monitored by Ganglia .We are facing some issue on the setup as below.
 
Here is the details
 
Three group cluster group created which is monitored by three multicast IP
 
APP_CLUS  239.2.11.71   10 NODES
DB_CLUS   239.2.11.72   80 NODES
SUP_CLUS   239.2.11.73   10 NODES
 
Once I have started the gmond service in all the 100 nodes ganglia web page will show all the node status properly .
After exact 5 mins from DB_CLUS 70 nodes will say dead and critical .But node is up and runningL .Changed all the configuration multicast IP everything but result is same .
 
If I restarted the gmond in all the nodes again same issue will come after 5 mins.
 
Is there any limitation in multicast IP or ganglia ? We have a 20 node cluster which is working fine without any issues .Only one difference between these two setup is 100 node cluster is running in DNS and 20 node cluster is with local host names.

------------------------------------------------------------------------------
Developer Access Program for Intel Xeon Phi Processors
Access to Intel Xeon Phi processor-based developer platforms.
With one year of Intel Parallel Studio XE.
Training and support from Colfax.
Order your platform today.http://sdm.link/xeonphi
_______________________________________________
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to