Hi folks,
I am trying to configure ganglia to monitor our cluster but I am experiencing a 
lot of problems.   Initially, after a lot of compilation for different packages 
(rrdtool, libart, freetype, and, of course, ganglia) I have the php application 
running but the information of each node is not loaded correctly on the 
graphics.  All the graphics are empty!
When I look at the apache logfiles, I got these messages:
server# tail -f /usr/local/apache2/logs/access_log
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-102.data&l=e2ecff&v=0.02&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6274
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-100.data&l=e2ecff&v=0.02&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6224
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-104.data&l=e2ecff&v=0.02&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6274
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-106.data&l=e2ecff&v=0.02&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6325
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-109.data&l=e2ecff&v=0.02&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6332
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-107.data&l=e2ecff&v=0.02&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6251
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-099.data&l=e2ecff&v=0.02&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6300
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-087.data&l=e2ecff&v=0.01&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6327
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-105.data&l=e2ecff&v=0.01&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6247
10.1.2.41 - - [08/Nov/2006:11:02:34 -0800] "GET 
/ganglia/graph.php?m=load_one&z=small&c=cerebro&h=cerebro-A-110.data&l=e2ecff&v=0.01&x=0&n=0&r=hour&st=1163012545
 HTTP/1.1" 200 6176

server #  tail -f /usr/local/apache2/logs/error_log
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-023.data/load_one.rrd': 
No such file or directory
ERROR: This RRD was created on other architecture
ERROR: This RRD was created on other architecture
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-029.data/load_one.rrd': 
No such file or directory
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-030.data/load_one.rrd': 
No such file or directory
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-031.data/load_one.rrd': 
No such file or directory
ERROR: This RRD was created on other architecture
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-035.data/load_one.rrd': 
No such file or directory
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-036.data/load_one.rrd': 
No such file or directory
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-038.data/load_one.rrd': 
No such file or directory
ERROR: This RRD was created on other architecture
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-040.data/load_one.rrd': 
No such file or directory
ERROR: opening '/var/lib/ganglia/rrds/cerebro/cerebro-B-041.data/load_one.rrd': 
No such file or directory

I don't know exactly what means "This RRD was created on other architecture" 
because as I know, I compiled the rrdtool following the instructions on 
http://apstc.sun.com.sg/downloads/s10/README/rrdtool-1.2.11-sol10-x86.txt

Also, I don't know why most of the information is not generated by the server: 
i.e.: ERROR: opening 
'/var/lib/ganglia/rrds/cerebro/cerebro-B-041.data/load_one.rrd': No such file 
or directory

Looking the status of "Hosts up" and "Hosts down", the application is not 
working properly because for sure all my nodes are up and running but the 
application is reporting that most of them are down:
CPUs Total:     74
Hosts up:       38
Hosts down:     230

We have 306 nodes and are reported only 268 (230 down).

For sure I am running on each node gmond but also I am running, and I know I 
don't need it, gmetad.   On the server I am running both, gmond and gmetad.


Could somebody help me to figure out how to solve my problem?   I appreciate in 
advance.
- Hugo
 
 
This message posted from opensolaris.org
_______________________________________________
opensolaris-discuss mailing list
[email protected]

Reply via email to