> > I'm configuring a new 2 node cluster using SLES11 and the HAE using
> > openais 0.80.3-26.1 and pacemaker 1.0.3-4.1
> >
> > The problem I'm having is that the nodes do not seem to find each 
other as
> > the documentation says they should.
> >
> > Here's a brief rundown of what I've done:
> >
> > 1. configured the two nodes using ip addresses 10.1.254.166 and
> > 10.1.254.169.
> > 2. installed the ha_sles pattern
> > 3. updated the following lines in /etc/ais/openais.conf:
> >         bindnetaddr:    10.1.254.0
> >         mcastaddr:      239.252.10.10
> >         mcastport:      5405
> > 4. opened udp/5405 in the firewall
> > 5. generated /etc/ais/authkey using ais-keygen and copied to second 
node
> > 6. start openais using rcopenais start
> >
> > Here are my questions:
> >
> > 1. How long should I expect to wait before seeing the CLM messages
> > indicating the nodes joining the cluster?  Initially, I waited a few
> > minutes and assumed something was wrong because I never saw these
> > messages.  But last night, the following appeared in the log:
> >
> > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ] Members Left:
> > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ] Members Joined:
> > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ]       r(0)
> > ip(10.1.254.169)
> > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ] got nodejoin message
> > 10.1.254.166
> > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ] got nodejoin message
> > 10.1.254.169

> That looks good. Are you sure that a) there's really no firewall
> involvement and b) your network switch can handle multicast?

In my latest attempts, I've cleared all iptables rules to be sure that 
wasn't an issue.  There is no other firewall between these boxes.

I will pursue the possibility of no multicast support, although our 
network engineers have told me it is enabled on our switches. 

> > 2. Using the GUI, the other node never shows online.  The node where
> > crm_gui is being run from shows online, but the other one never goes
> > green.
> >
> > 3. After a restart of openais this morning, I have not yet
> >
> > I've included the messages from a shutdown/startup of openais this
> > morning.

> Nothing much in the logs, except that nodes don't form a cluster.
> Check if they really communicate using tcpdump or wireshark.
> There's also openais-cfgtool which may display ring status.

I've captured some packets using tcpdump, and indeed, I never see the 
multicast traffic being received, only sent.  The odd thing is that these 
machines respond to other multicast traffic, like pinging 224.0.0.1.

Is there a kernel option that anyone is aware of that could be causing the 
boxes to drop multicast?

Thanks,
Justin

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to