Hello,

here is a strange occurence. I have two (infact, more than two, but
let's consider just a pair) identical servers running identical setups
- identical OS, identical gmond with identical config files, identical
disks, identical everything. However, one of those servers is
perfectly well, and another one has trouble reporting default metrics.

Here's what the "normal" one shows in node view:

xx.xx.xx.172

Location: Unknown
Cluster local time Wed Apr 27 19:05:32 2011
Last heartbeat received 5 seconds ago.
Uptime 9 days, 9:22:38
Load:   0.00    0.00    0.00
1m      5m      15m

CPU Utilization:        0.1     0.2     99.7
user    sys     idle
Hardware
CPUs: 4 x 1.95 GHz
Memory (RAM): 7.80 GB
Local Disk: Using 16.532 of 142.835 GB
Most Full Disk Partition: 11.6% used.   Software
OS: Linux 2.6.18-238.9.1.el5 (x86_64)
Booted: April 18, 2011, 9:42 am
Uptime: 9 days, 9:22:38
Swap: Using 0.0 of 12001.6 MB swap.


and here's what the "problem one" shows:

xx.xx.xx.171

Location: Unknown
Cluster local time Wed Apr 27 19:07:32 2011
Last heartbeat received 10 seconds ago.
Uptime 9 days, 9:20:01
Load:   0.00    0.00    0.00
1m      5m      15m

CPU Utilization:        0.1     0.2     99.7
user    sys     idle
Hardware
CPUs: 4 x 1.95 GHz
Memory (RAM): 7.80 GB
Local Disk: Unknown
Most Full Disk Partition: 6.2% used.    Software
OS: Linux 2.6.18-238.9.1.el5 (x86_64)
Booted: April 18, 2011, 9:47 am
Uptime: 9 days, 9:20:01
Swap: Using 12001.6 of 12001.6 MB swap.



both are running gmond 3.1.7 and talk to a third host which also runs
gmond 3.1.7 (which is getting polled by the web frontend host with
gmetad 3.1.7)

at a glance, there's something confusing gmond on the problem server,
so it mismatches disk partitions, or something.

as a result, the problem node reports not all of the default metrics,
and those it does are somewhat off-kilter, as you can see (unknown
local disk?)

Any idea what might be going wrong and/or how to pinpoint the problem?

-- 
Michael Bravo

------------------------------------------------------------------------------
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
_______________________________________________
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to