Michael, Use different multicast addresses for each cluster, unless you are sure the multicast can't leak from 1 cluster to another.
Remember that when you list hosts after the data_source for gmetad.conf that is for resilience only. You do not have to mention all nodes in the cluster there. Given your symptoms it might be something else. I suggest you consider using unicast initially rather than multicast until you get everything going. (upd_send in gmond.conf pointing to a nominated headnode on each cluser, then data_source from that). And netcatting hosts can be very instructive (e.g. nc lsora1006 8649). Are all expected hosts listed in the nc output? Unexpected hostnames? (gmond does reverse dns lookup to make hostnames). Is the cluster name returned by nc different for every cluster? (the clustername in gmetad.conf is not used). good luck Richard Grevis Production Architecture Barclays Capital, Canary Wharf, London, E14 4BB > -----Original Message----- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On > Behalf Of Michael Steeevs > Sent: 30 March 2007 15:29 > To: [email protected] > Subject: [Ganglia-general] Not getting something > > > > I'm trying to set up what I hope/think is a pretty straight > forward configuration -- I'm looking to monitor Oracle RAC > via ganglia, and I've got three clusters (Prod, Dev and > Test). I've got a machine I'm using right now for both > gmetad and the web front end piece, and I can only get one > host from each cluster to show up as 'up' in the web front end. > > Right now, I've got the following setup: > > lsora1003 and lsora1006 -- Dev RAC, running just gmond. > > cluster { > name = "Oracle RAC Dev nodes" > owner = "myorg" > latlong = "unspecified" > url = "unspecified" > } > > /* Feel free to specify as many udp_send_channels as you > * like. Gmond used to only support having a single channel */ > udp_send_channel { > mcast_join = 239.2.100.71 > port = 8649 > } > > /* You can specify as many udp_recv_channels as you > * like as well. */ > udp_recv_channel { > mcast_join = 239.2.100.71 > port = 8649 > bind = 239.2.100.71 > } > > lsora1001, lsora1002 and lsora1005 -- Test RAC, running just gmond. > > cluster { > name = "Oracle RAC Test nodes" > owner = "myorg" > latlong = "unspecified" > url = "unspecified" > } > > /* Feel free to specify as many udp_send_channels as you > * like. Gmond used to only support having a single channel */ > udp_send_channel { > mcast_join = 239.2.101.71 > port = 8649 > } > > /* You can specify as many udp_recv_channels as you > * like as well. */ > udp_recv_channel { > mcast_join = 239.2.101.71 > port = 8649 > bind = 239.2.101.71 > } > > lsora1004, lsora1007 and lsora1008 -- Prod RAC, running just gmond. > > cluster { > name = "Oracle RAC Prod nodes" > owner = "myorg" > latlong = "unspecified" > url = "unspecified" > } > > /* Feel free to specify as many udp_send_channels as you > * like. Gmond used to only support having a single channel */ > udp_send_channel { > mcast_join = 239.2.102.71 > port = 8649 > } > > /* You can specify as many udp_recv_channels as you > * like as well. */ > udp_recv_channel { > mcast_join = 239.2.102.71 > port = 8649 > bind = 239.2.102.71 > } > > My gmetad server has gmond running as cluster 'localhost', > and the following in the gmetad file: > > data_source "localhost" localhost > data_source "Oracle RAC Dev nodes" lsora1003 lsora1006 > data_source "Oracle RAC Test nodes" lsora1001 lsora1002 > lsora1005 data_source "Oracle RAC Prod nodes" lsora1004 > lsora1007 lsora1008 > > Initially, none of the other hosts would show up in their > clusters, just the first node listed, but over time the other > nodes do appear, and are active briefly, and then show as > being down and unable to contact. > > > -Mike > -- > Michael Steeves ([EMAIL PROTECTED]) > > -------------------------------------------------------------- > ----------- > Take Surveys. Earn Cash. Influence the Future of IT > Join SourceForge.net's Techsay panel and you'll get the > chance to share your opinions on IT & business topics through > brief surveys-and earn cash > http://www.techsay.com/default.php?page=join.php&p=sourceforge &CID=DEVDEV _______________________________________________ Ganglia-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/ganglia-general ------------------------------------------------------------------------ For more information about Barclays Capital, please visit our web site at http://www.barcap.com. Internet communications are not secure and therefore the Barclays Group does not accept legal responsibility for the contents of this message. Although the Barclays Group operates anti-virus programmes, it does not accept responsibility for any damage whatsoever that is caused by viruses being passed. Any views or opinions presented are solely those of the author and do not necessarily represent those of the Barclays Group. Replies to this email may be monitored by the Barclays Group for operational or business reasons. ------------------------------------------------------------------------

