On Jan 12, 2011, at 9:39 AM, John Williams wrote:

> I have also taken this one step further by installing our server on a brand 
> new Dell R710 with 6x240GB SSD (RAID5). Ganglia is the only thing running on 
> the server. I received the same errors after just a few minutes of running.
> 
> I have also ran xmllint against the output from port 8651 and it reports no 
> errors.
> 
> Any help is appreciated.


Are all your data_source groups on separate subnets? Are you using multicast?

Your gmetad.conf has you giving no ports, which means everyone is using 8649, 
and if the various machines are on the same wire, their data is going to get 
piled together. 

I don't know if gmetad 3.0 or newer is better about this (because I always 
define ports these days), but in the 2.5 era it would get hopelessly confused 
if multiple data_sources were using the same port. It may be worth giving each 
data_source its own port and see if things improve.

The port is what ganglia uses to differentiate clusters. Cluster name, 
data_source, ip address.. doesn't matter. If Two machines are using the same 
port to relay metrics, gmetad is going to think they're in the same cluster.


------------------------------------------------------------------------------
Protect Your Site and Customers from Malware Attacks
Learn about various malware tactics and how to avoid them. Understand 
malware threats, the impact they can have on your business, and how you 
can protect your company and customers by using code signing.
http://p.sf.net/sfu/oracle-sfdevnl
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to