Hi there,

I just installed Clic Linux on a number of computers here and I'm having
trouble with the Ganglia package.  The installation instructions say
that I should just be able to point my web browser at the master node
and I will see stats about my cluster.  Instead I get this error:
There was an error collecting gmetad data: XML error: no element found
at 1

I've been trying to figure out what's wrong and it seems like gmetad's
child process is crashing when it tries to start up.  Running the ps
command shows these processes:
root     13869 95.8  0.4  6136 4240 ?        R    13:57   3:10
/usr/bin/perl -Tw /usr/sbin/gmetad
root     13870  0.0  0.0     0    0 ?        Z    13:57   0:00 [gmetad
<defunct>]

And top shows gmetad using 99.9% of one of my processors.  Perhaps
that's just normal though?

Looking inside /var/log/ganglia/gmetad/gmetad.log I see this:
no element found at line 1, column 0, byte 0 at
/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi/XML/Parser.pm
line 185

Looking at the gmetad script and adding a few print statements I've
found that the child process is crashing inside the collect_data
function, line 269:
$p->parse($xml_data);

The function looks like it is supposed to collect data for each source
but I believe I only have one source as my /etc/gmetad_sources only
contains this line:
CLIC 10.0.0.199 8649
which is the internal ip address of my master node.

Running lsof shows that several instances of gmond are up and running
and listening on port 8649 and the perl script doesn't ever fail to
connect, it just never gets past the parse part.

A thing that strikes me as really odd though is that lsof also shows
multiple connections from gmond on my master node to the ip 239.2.11.71
on port 8649.  I'm kinda worried I've got some sort of trojan.

So anyway, if anyone can shed some light on my problems it will be
greatly appreciated.

Oh, and this might help, here are the relevant rpms that are installed
on my master node:

ganglia-monitor-gmetad-0.1.1-8mdk
ganglia-monitor-core-2.4.1-10mdk
ganglia-monitor-script-2.4.1-10mdk
libganglia-monitor1-2.4.1-10mdk
ganglia-monitor-gmetad-web-frontend-0.1.1-2mdk


Thanks for your time,

Glen Kaukola
CE-CERT
UC Riverside


Reply via email to