Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Vitaly Karasik
It seems like we have different numbers in gmond: HOST NAME=5.5.5.5 IP=5.5.5.5 REPORTED=1168934873 TN=2 TMAX=20 DMAX=0 LOCATION=unspecified GMOND_STARTED=1166534354 .. METRIC NAME=proc_total VAL=185 TYPE=uint32 UNITS= TN=229 TMAX=950 DMAX=0 SLOPE=both SOURCE=gmond/ .. METRIC

Re: [Ganglia-general] PBS Queue visualisation

2007-01-16 Thread Martin Knoblauch
Adam, look at the report/compound graphs in web/graph.php They should basically do what you want. Cheers Martin --- Adam Gray [EMAIL PROTECTED] wrote: I'm running ganglia on a cluster managed with OpenPBS. I have made a few extra metrics for monitoring CPU temp and batch system jobs on each

Re: [Ganglia-general] XML error: no element found at 1

2007-01-16 Thread Martin Knoblauch
Ashutok, you need to do a query if you use port 8562 (the web interface does). What happens if you do telnet localhost 8561. That should give you the complete gmetad XML stream. Is the rrdroot directory writable to the owner of the gmetad process? It should belong to e.g. nobody. This is a

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Martin Knoblauch
Vitaly, in this case try to run gmond with a debug level higher that 2. Maybe this sheds some light on it. Or, you could add debug statements to the proc_run_func and proc_total_func code. But: first of all show us the output of cat /proc/loadavg on both nodes. cheers Martin --- Vitaly

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Vitaly Karasik
.5: cat /proc/loadavg 0.04 0.06 0.01 1/185 10512 .6: cat /proc/loadavg 1.03 1.01 1.00 2/441 19965 Oops! I think I'm starting to understand - number of processes on both machines are the same, but number the threads are different. probably gmond counts threads, not processes: .5: ps -ef|wc

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Martin Knoblauch
Vitaly, gmond on Linux just interprets the fourth filed of /proc/loadavg. The number in front of the slash is the number of running processes, the number following the slash is the total number of processes. Cheers Martin --- Vitaly Karasik [EMAIL PROTECTED] wrote: .5: cat /proc/loadavg

Re: [Ganglia-general] Two similar linux hosts provides different metrics

2007-01-16 Thread Vitaly Karasik
Martin, Thank a lot - now I understand that different number of threads causes this difference. Vitaly -Original Message- From: Martin Knoblauch [mailto:[EMAIL PROTECTED] Sent: Tuesday, January 16, 2007 12:29 PM To: Vitaly Karasik; ganglia-general@lists.sourceforge.net Subject:

[Ganglia-general] HELP : How to interoperate between a web server and a cluster ?

2007-01-16 Thread Shameer Khadar
Dear All, Currently we are developing an Integrated Web Server for protein sequence and structure analysis. We are planning to provide a dedicated cluster to support the server. I need help to set up a web application that can manage the interoperation web server and our cluster. Is this possible

Re: [Ganglia-general] XML error: no element found at 1

2007-01-16 Thread Ashutosh Mahajan
Quoting Martin Knoblauch [EMAIL PROTECTED]: Ashutok, you need to do a query if you use port 8562 (the web interface does). What happens if you do telnet localhost 8561. That should give you the complete gmetad XML stream. thanks for the prompt reply. you meant 8651, rather than 8561?

Re: [Ganglia-general] HELP : How to interoperate between a web server and a cluster ?

2007-01-16 Thread Vladimir
Shameer Khadar wrote: We are planning to provide a dedicated cluster to support the server. I need help to set up a web application that can manage the interoperation web server and our cluster. Is this possible with perl cgi ? For example if the user is submitting a huge set of data for

[Ganglia-general] Ganglia and Shorewall?

2007-01-16 Thread Ian Wootten
Has anyone had any experience in configuring ganglia with the shorewall firewall? Ganglia is successfully multicasting to other hosts on the same network, yet the XML tree is not returned to localhost. When my shorewall firewall is switched off, it all works brilliantly. My rules for

Re: [Ganglia-general] VirtMem Utilization

2007-01-16 Thread Matthias Blankenhaus
On Mon, 15 Jan 2007, matt massie wrote: On Thu, 2007-01-11 at 17:31 -0800, Matthias Blankenhaus wrote: Hello ! I have read through the Paper The ganglia distributed monitoring system... (Massie, Chun, Culler) from 2004 and found lots of extremely valuable information. I am

Re: [Ganglia-general] XML error: no element found at 1

2007-01-16 Thread Martin Knoblauch
Hi Ashutosh, sorry for the wrong port. I meant of course 8651. You could try to run gmetad with a high debug level. This could help to track down the problem. Also, could you please post the gmetad.conf file? Cheers Martin --- Ashutosh Mahajan [EMAIL PROTECTED] wrote: Quoting Martin