We still have no luck with gmetad reporting any data from the cluster. We have also tried to get to gmetad's log file, but we have no luck with it even creating the file. We believe the problem lies somewhere with the cause of this debug message: The remote machine closed connection for [VisLab Mini Cluster] data source after 1023 bytes read

Would anyone have any ideas that would help us along with this?

 -Chris


On Jan 20, 2006, at 4:33 PM, Christopher Waters wrote:

After a fresh install of Ganglia, I am unable to get any of the data from the gmond's to gmetad. When polled for XML, gmetad returns an XML file with an empty <GRID> block; not only that, the RRD folder remains empty.

All programs are compiled/run on OS X 10.4.4 (server on monitor, non-server on nodes). We're using the same configuration that we used when installing Ganglia last summer, which worked just fine. I'm at a loss as to what could be wrong.. my only idea is that gmetad isnt getting the data from the gmond running on the same machine.

For clarity's sake, here are my .conf files (summarized):
gmond.conf (all machines):
 cluster {
   name = "VisLab Mini Cluster"
   owner = "MSU CSE Dept."
 }
 udp_send_channel {
   mcast_join = 239.2.11.71
   port       = 8649
 }
 udp_recv_channel {
   mcast_join = 239.2.11.71
   bind       = 239.2.11.71
   port       = 8649
 }
 tcp_accept_channel {
   port = 8649
 }

gmetad.conf (monitor only):
 data_source "VisLab Mini Cluster" localhost
 gridname "VisLab Cluster"
 all_trusted on
 setuid on
 setuid_username "nobody"
 rrd_rootdir "/var/ganglia/rrds" # permissions are nobody:nogroup

After starting gmond on all nodes, I start gmond and gmetad on the monitor server as follows:
# gmond
# gmetad -d1
Sources are ...
Source: [VisLab Mini Cluster, step 15] has 1 sources
        127.0.0.1
Data thread 25177600 is monitoring [VisLab Mini Cluster] data source
        127.0.0.1
The remote machine closed connection for [VisLab Mini Cluster] data source after 1023 bytes read The remote machine closed connection for [VisLab Mini Cluster] data source after 1023 bytes read
... (repeats)...

Polling gmond/gmetad on the monitor server:
# telnet localhost 8649
...(large xml output with -all- node data)...
Connection closed by foreign host.

# telnet localhost 8651
...(long doctype header)...
<GANGLIA_XML VERSION="3.0.2" SOURCE="gmetad">
<GRID NAME="VisLab Cluster" AUTHORITY="http:// mini.vislab.cse.msstate.edu/ganglia/" LOCALTIME="1137795457">
</GRID>
</GANGLIA_XML>
Connection closed by foreign host.

Any help with this situation would be appreciated.

 -Christopher Waters


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel? cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general


Reply via email to