Dear All,

I am getting huge numbers of errors like these in my /var/log/messages file,
and in the debug output from gmetad:

Nov 24 13:34:54 behemoth /usr/sbin/gmetad[11186]: RRD_update
(/var/lib/ganglia/rrds/NEMO cluster @ POL/nemo.beowulf.cluster/C
PU2_Vcore.rrd): reading the cookie off /var/lib/ganglia/rrds/NEMO cluster @
POL/nemo.beowulf.cluster/CPU2_Vcore.rrd faild

The errors all relate to one particular cluster and " __SummaryInfo__ ".
The cluster in question just happens to be the one with the corrupted Web
frontend view described in this thread from last October:
http://sourceforge.net/mailarchive/message.php?msg_name=200710301958.30971.dab%40mail.nerc-essc.ac.uk

The errors do not say that the rrd files do not exist or are not readable,
and indeed the files are there and the cluster is displayed on the Web
frontend (albeit mixing up nodes from other clusters - see previous posting
on the subject and this URL:
http://behemoth.nerc-essc.ac.uk/ganglia_3.1.1/?c=NEMO%20cluster%20%40%20POL&m=load_one&r=hour&s=descending&hc=4).
The partition where /var/lib/ganglia is located is not full.

I am using the following versions:
gmetad 3.1.1 (I had the same problem with 3.0.7)
Web frontend 3.1.1 (I had the same problem with 3.0.7)
rrdtool 1.1.12 (from a SuSE RPM)

Does anybody have any suggestions?  I would like to get rid of these errors
as it makes it hard to spot other problems in /var/log/messages, and I hope
it leads to a solution to the problem with the Web frontend's view of that
particular cluster.

Regards,
Dan Bretherton

-- 
Mr. D.A. Bretherton
Reading e-Science Centre
Environmental Systems Science Centre
Harry Pitt Building
3 Earley Gate
University of Reading
Reading, RG6 6AL
UK

Tel. +44 118 378 7722
Fax: +44 118 378 6413
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to