We still have no luck with gmetad reporting any data from the
cluster. We have also tried to get to gmetad's log file, but we have
no luck with it even creating the file. We believe the problem lies
somewhere with the cause of this debug message:
The remote machine closed connection for [VisLab Mini Cluster] data
source after 1023 bytes read
Would anyone have any ideas that would help us along with this?
-Chris
On Jan 20, 2006, at 4:33 PM, Christopher Waters wrote:
After a fresh install of Ganglia, I am unable to get any of the
data from the gmond's to gmetad. When polled for XML, gmetad
returns an XML file with an empty <GRID> block; not only that, the
RRD folder remains empty.
All programs are compiled/run on OS X 10.4.4 (server on monitor,
non-server on nodes). We're using the same configuration that we
used when installing Ganglia last summer, which worked just fine.
I'm at a loss as to what could be wrong.. my only idea is that
gmetad isnt getting the data from the gmond running on the same
machine.
For clarity's sake, here are my .conf files (summarized):
gmond.conf (all machines):
cluster {
name = "VisLab Mini Cluster"
owner = "MSU CSE Dept."
}
udp_send_channel {
mcast_join = 239.2.11.71
port = 8649
}
udp_recv_channel {
mcast_join = 239.2.11.71
bind = 239.2.11.71
port = 8649
}
tcp_accept_channel {
port = 8649
}
gmetad.conf (monitor only):
data_source "VisLab Mini Cluster" localhost
gridname "VisLab Cluster"
all_trusted on
setuid on
setuid_username "nobody"
rrd_rootdir "/var/ganglia/rrds" # permissions are nobody:nogroup
After starting gmond on all nodes, I start gmond and gmetad on the
monitor server as follows:
# gmond
# gmetad -d1
Sources are ...
Source: [VisLab Mini Cluster, step 15] has 1 sources
127.0.0.1
Data thread 25177600 is monitoring [VisLab Mini Cluster] data source
127.0.0.1
The remote machine closed connection for [VisLab Mini Cluster] data
source after 1023 bytes read
The remote machine closed connection for [VisLab Mini Cluster] data
source after 1023 bytes read
... (repeats)...
Polling gmond/gmetad on the monitor server:
# telnet localhost 8649
...(large xml output with -all- node data)...
Connection closed by foreign host.
# telnet localhost 8651
...(long doctype header)...
<GANGLIA_XML VERSION="3.0.2" SOURCE="gmetad">
<GRID NAME="VisLab Cluster" AUTHORITY="http://
mini.vislab.cse.msstate.edu/ganglia/" LOCALTIME="1137795457">
</GRID>
</GANGLIA_XML>
Connection closed by foreign host.
Any help with this situation would be appreciated.
-Christopher Waters
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through
log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD
SPLUNK!
http://sel.as-us.falkag.net/sel?
cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general