Hi Alex, in my case I only have head-nodes, since we just have a handful of servers which have a lot of services running and there's no use in putting 2 of them together in a cluster.
In general the IP for the head nodes is the server's ip for udp_send_channel, udp_recv_channel and tcp_accept_channel. The gmetad is configured for the correct IP and port. The Web-Config is configured for the gmetad's ip/xml_port correctly, so these 3 points are definately okay. As I understand it, the web frontend should get all information from the Gmetad's XML ... the xml looks fine but the Website is mixed up in case of clusters and hosts ... Generally I am pretty puzzled by the strange behaviour of the web frontend. Is there any way of debugging this part closer? The gmond/gmetad don't show any errors even in debug mode. Best regards Manuel Tuesday, December 21, 2010, 11:22:30 PM, you wrote: AD> There are a few places you can verify that everything is matching AD> up. Everything I'm writing here is for unicast. I'm not sure how AD> much of the gmond info applies to multicast, but the gmetad & web AD> parts should be the same for uni/multicast. AD> Unicast clusters have normal nodes & head nodes. A head node is AD> one which knows the state of all other nodes in the cluster, and AD> can be polled by gmetad. Just for redundancy and clarity and AD> redundancy: A head node is also a 'normal' node, in that it knows AD> it's own state as well as others'. AD> 1. normal gmond udp_send_channel (host&port) must match a head-node udp_recv_channel. AD> 2. gmetad's data_source must match a head-node gmond tcp_accept_channel. AD> 3. web's conf.php $ganglia_ip/$ganglia_port must match gmetad's xml_port. AD> Your XML looks like 1 & 2 are true, but I'd still double-check. What about #3? AD> I've never seen the symptoms you're describing, so I'm guessing AD> to some degree. You might try shutting down all the daemons, and AD> bringing up just 1 gmond & gmetad. See how that looks, then bring AD> up more gmonds one at a time, and see where things start to fail. AD> Check gmond & gmetad XML output at every stage, as well as the web. AD> alex AD> On Dec 21, 2010, at 3:52 PM, Tarabas wrote: >> Hi Bernard, >> >> I am setting it up with unicast. The structure is as follwows: >> >> server-A - Port 8649 gmond/gmetad >> server-B - Port 8661 gmond >> >> Both clusters just have only one host (same server). >> >> Gmetad on Server-A collects data from Server-A and Server-B gmond. >> >> data_source "server-A" <ip-server-A>:8649 >> data_source "server-B" <ip-server-B>:8660 >> >> The XML from the gmetad looks like this, which in my view looks okay: >> >> [...] >> <GANGLIA_XML VERSION="3.1.7" SOURCE="gmetad"> >> <GRID NAME="mediaskill" AUTHORITY="http://smurfette/ganglia/" >> LOCALTIME="1292967617"> >> <CLUSTER NAME="server-A" LOCALTIME="1292967606" OWNER="mediaskill" >> LATLONG="unspecified" URL="unspecified"> >> <HOST NAME="smurfette" IP="<ip-smurfette>" >> REPORTED="1292967599" TN="18" TMAX="20" DMAX="0" LOCATION="Berlin" >> GMOND_STARTED="1292965419"> >> [...] >> </HOST> >> </CLUSTER> >> <CLUSTER NAME="server-B" LOCALTIME="1292967609" OWNER="mediaskill" >> LATLONG="unspecified" URL="unspecified"> >> <HOST NAME="eva" IP="<ip-eva>" REPORTED="1292967606" TN="11" >> TMAX="20" DMAX="0" LOCATION="Berlin" GMOND_STARTED="1292963565"> >> [...] >> </HOST> >> </CLUSTER> >> </GRID> >> </GANGLIA_XML> >> >> The web-frontend always defaults to Server-A and is not able to >> correctly display any other server (B, C, D) which I also added in the >> same manner with increasing port numbers starting at 8660. >> >> I did not see any errors with debug enabled in any of the gmond or the >> gmetad ... only the web interface seems to have some problems >> displaying the hosts. I configured it to the 8651 port of the gmetad >> on localhost. >> >> Best regards >> Manuel >> >> >> Tuesday, December 21, 2010, 10:10:08 PM, you wrote: >> >> BL> Hi Manuel: >> >> BL> Can you please clarify whether you are trying to setup Ganglia with >> BL> unicast or multicast? >> >> BL> You can also get more troubleshooting information by running >> BL> gmetad/gmond in debug mode (-d 2) and looking at your apache error >> BL> logs. >> >> BL> Cheers, >> >> BL> Bernard >> >> >> ------------------------------------------------------------------------------ >> Forrester recently released a report on the Return on Investment (ROI) of >> Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even >> within 7 months. Over 3 million businesses have gone Google with Google >> Apps: >> an online email calendar, and document program that's accessible from your >> browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew >> _______________________________________________ >> Ganglia-general mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/ganglia-general >> AD> ------------------------------------------------------------------------------ AD> Forrester recently released a report on the Return on Investment (ROI) of AD> Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even AD> within 7 months. Over 3 million businesses have gone Google with Google Apps: AD> an online email calendar, and document program that's accessible from your AD> browser. Read the Forrester report: AD> http://p.sf.net/sfu/googleapps-sfnew AD> _______________________________________________ AD> Ganglia-general mailing list AD> [email protected] AD> https://lists.sourceforge.net/lists/listinfo/ganglia-general Gruss ... Manuel ... ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Ganglia-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ganglia-general

