can you grep for the ip address of the interface specified in the "missing" nodes configuration? i suspect that what might be happening is that the address on the interface you have specified might not be resolvable so you will not get the node246 string but something else or only an ip address.

just a thought.

-matt

Andrea Capriotti wrote:
Hi All,

I have gaglia installed on a cluster [1] with 384 nodes and I just
upgraded from version 2.5.7 to version 3.0. I have two nodes running
gmetad (node247 and node248) and now I cannot see these nodes on the web
front-end and in the XML output (also rrd files are not upgraded).

The only difference in node and gmond configuration is the multicast
interface:

# diff gmond.conf gmond.conf.node247
22c22
<   mcast_if = "eth0"
---

 mcast_if = "eth1"

30c30
<   mcast_if = "eth0"
---

 mcast_if = "eth1"


The part about multicast is the following:

/* channel to send multicast on mcast_channel:mcast_port */
udp_send_channel {
  mcast_join = "239.2.11.71"
  port = "8649"
  mcast_if = "eth0"
}

/* channel to receive multicast from mcast_channel:mcast_port */
udp_recv_channel {
  mcast_join = "239.2.11.71"
  port = "8649"
  bind = "239.2.11.71"
  mcast_if = "eth0"
}

/* channel to export xml on xml_port */
tcp_accept_channel {
  port = "8649"

All other nodes are listed, i.e.:

# telnet localhost 8649 | grep node246

<HOST NAME="node246.clx.cineca.it" IP="10.10.12.246"
REPORTED="1108568138" TN="10" TMAX="20" DMAX="3600"
LOCATION="unspecified" GMOND_STARTED="1108499041">

But nothing about node247 and node248. I also checked with tcpdump and
others nodes in the cluster do not see multicast from node247 and
node248.

node001:~ # tcpdump -i eth0| grep node246
tcpdump: listening on eth0
16:51:59.126916 node246.clx.cineca.it.32774 > 239.2.11.71.8649: udp 8 (DF) [ttl 
1]
16:51:59.126959 node246.clx.cineca.it.32774 > 239.2.11.71.8649: udp 8 (DF) [ttl 
1]
16:51:59.127120 node246.clx.cineca.it.32774 > 239.2.11.71.8649: udp 8 (DF) [ttl 
1]

node001:~ # tcpdump -i eth0| grep node246
tcpdump: listening on eth0

I miss something or is a bug?

[1] SUSE SLES 8 SP3 - kernel 2.4.21-266-smp - rrdtool 1.0.39-168

Best Regards

--
PGP fingerprint 'A7C2 3C2F 8445 AD3C 135E F40B 242A 5984 ACBC 91D3'

   They that can give up essential liberty to obtain a little
      temporary safety deserve neither liberty nor safety.
  --Benjamin Franklin, Historical Review of Pennsylvania, 1759

Attachment: signature.asc
Description: OpenPGP digital signature

Reply via email to