Re: [Ganglia-general] Question about scaling

2012-11-01 Thread Chris Burroughs
flat. From: Nicholas Satterly [nfsatte...@gmail.com] Sent: Thursday, October 25, 2012 15:19 To: Potter,Mark L Cc: ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Question about scaling Hi Mark, I wouldn't be so quick to dismiss

Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Potter,Mark L
Hi Mark, I assume cnode340 is the head node that all ~340 other gmond's send their data to. If so, you could reduce the amount of redundant metadata flying around by increasing send_metadata_interval to 120 seconds or higher. That is correct, cnode340 is the head node for ganglia. I have

Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Potter,Mark L
Well things blew up ~184 hosts. The web interface shows a random number of hosts down each refresh, although sometimes there are all up. It reports just ~1 second to download and process the XML: Downloading and parsing ganglia's XML tree took 0.9751s. So I don't think timeouts are the problem.

Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Potter,Mark L
] Question about scaling Hi Mark, I wouldn't be so quick to dismiss timeouts as the problem. The 0.9751s it took to download and parse ganglia's XML tree refers to the time it took the PHP web frontend to query the gmetad XML whereas the timeout's I was referring to occur when the gmetad polls

Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Vladimir Vuksan
] Sent: Thursday, October 25, 2012 15:19 To: Potter,Mark L Cc: ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Question about scaling Hi Mark, I wouldn't be so quick to dismiss timeouts as the problem. The 0.9751s it took to download and parse ganglia's XML tree refers

Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Potter,Mark L
...@veus.hr] Sent: Thursday, October 25, 2012 16:20 To: Potter,Mark L; Nicholas Satterly Cc: ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Question about scaling 60 seconds is likely the problem. I would leave it at default ie 15. I can explain later. Potter,Mark L mlpot

Re: [Ganglia-general] Question about scaling

2012-10-24 Thread Peter Phaal
Subject: Re: [Ganglia-general] Question about scaling Please send thru your gmetad.conf file so we can see how things are configured on the server side. * --Nick. * Be sure to anonymise any sensitive info. On 23 Oct 2012, at 19:21, Potter,Mark L mlpot...@mdanderson.org wrote: I am using what

[Ganglia-general] Question about scaling

2012-10-23 Thread Potter,Mark L
I am using what I think to be a fairly standard gmond.conf: globals { daemonize = yes setuid = yes user = nobody debug_level = 0 max_udp_msg_len = 1472 mute = no deaf = no allow_extra_data = yes host_dmax = 86400 /*secs. Expires (removes from web interface) hosts in 1 day */

Re: [Ganglia-general] Question about scaling

2012-10-23 Thread Nicholas Satterly
Please send thru your gmetad.conf file so we can see how things are configured on the server side. * --Nick. * Be sure to anonymise any sensitive info. On 23 Oct 2012, at 19:21, Potter,Mark L mlpot...@mdanderson.org wrote: I am using what I think to be a fairly standard gmond.conf: globals

Re: [Ganglia-general] Question about scaling

2012-10-23 Thread Potter,Mark L
@lists.sourceforge.net Subject: Re: [Ganglia-general] Question about scaling Please send thru your gmetad.conf file so we can see how things are configured on the server side. * --Nick. * Be sure to anonymise any sensitive info. On 23 Oct 2012, at 19:21, Potter,Mark L mlpot...@mdanderson.org wrote: I

Re: [Ganglia-general] Question about scaling

2012-10-23 Thread Nicholas Satterly
Satterly [nfsatte...@gmail.com] Sent: Tuesday, October 23, 2012 13:33 To: Potter,Mark L Cc: ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Question about scaling Please send thru your gmetad.conf file so we can see how things are configured on the server side