On Jan 12, 2011, at 9:39 AM, John Williams wrote: > I have also taken this one step further by installing our server on a brand > new Dell R710 with 6x240GB SSD (RAID5). Ganglia is the only thing running on > the server. I received the same errors after just a few minutes of running. > > I have also ran xmllint against the output from port 8651 and it reports no > errors. > > Any help is appreciated.
Are all your data_source groups on separate subnets? Are you using multicast? Your gmetad.conf has you giving no ports, which means everyone is using 8649, and if the various machines are on the same wire, their data is going to get piled together. I don't know if gmetad 3.0 or newer is better about this (because I always define ports these days), but in the 2.5 era it would get hopelessly confused if multiple data_sources were using the same port. It may be worth giving each data_source its own port and see if things improve. The port is what ganglia uses to differentiate clusters. Cluster name, data_source, ip address.. doesn't matter. If Two machines are using the same port to relay metrics, gmetad is going to think they're in the same cluster. ------------------------------------------------------------------------------ Protect Your Site and Customers from Malware Attacks Learn about various malware tactics and how to avoid them. Understand malware threats, the impact they can have on your business, and how you can protect your company and customers by using code signing. http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Ganglia-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ganglia-general

