Am Mittwoch, 16. Dezember 2009 14:43:33 schrieb [email protected]:
> > > I'm configuring a new 2 node cluster using SLES11 and the HAE using
> > > openais 0.80.3-26.1 and pacemaker 1.0.3-4.1
> > >
> > > The problem I'm having is that the nodes do not seem to find each
>
> other as
>
> > > the documentation says they should.
> > >
> > > Here's a brief rundown of what I've done:
> > >
> > > 1. configured the two nodes using ip addresses 10.1.254.166 and
> > > 10.1.254.169.
> > > 2. installed the ha_sles pattern
> > > 3. updated the following lines in /etc/ais/openais.conf:
> > >         bindnetaddr:    10.1.254.0
> > >         mcastaddr:      239.252.10.10
> > >         mcastport:      5405
> > > 4. opened udp/5405 in the firewall
> > > 5. generated /etc/ais/authkey using ais-keygen and copied to second
>
> node
>
> > > 6. start openais using rcopenais start
> > >
> > > Here are my questions:
> > >
> > > 1. How long should I expect to wait before seeing the CLM messages
> > > indicating the nodes joining the cluster?  Initially, I waited a few
> > > minutes and assumed something was wrong because I never saw these
> > > messages.  But last night, the following appeared in the log:
> > >
> > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ] Members Left:
> > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ] Members Joined:
> > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ]       r(0)
> > > ip(10.1.254.169)
> > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ] got nodejoin message
> > > 10.1.254.166
> > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM  ] got nodejoin message
> > > 10.1.254.169
> >
> > That looks good. Are you sure that a) there's really no firewall
> > involvement and b) your network switch can handle multicast?
>
> In my latest attempts, I've cleared all iptables rules to be sure that
> wasn't an issue.  There is no other firewall between these boxes.
>
> I will pursue the possibility of no multicast support, although our
> network engineers have told me it is enabled on our switches.
>
> > > 2. Using the GUI, the other node never shows online.  The node where
> > > crm_gui is being run from shows online, but the other one never goes
> > > green.
> > >
> > > 3. After a restart of openais this morning, I have not yet
> > >
> > > I've included the messages from a shutdown/startup of openais this
> > > morning.
> >
> > Nothing much in the logs, except that nodes don't form a cluster.
> > Check if they really communicate using tcpdump or wireshark.
> > There's also openais-cfgtool which may display ring status.
>
> I've captured some packets using tcpdump, and indeed, I never see the
> multicast traffic being received, only sent.  The odd thing is that these
> machines respond to other multicast traffic, like pinging 224.0.0.1.
>
> Is there a kernel option that anyone is aware of that could be causing the
> boxes to drop multicast?
>
> Thanks,
> Justin

I'd have a serious word with your network guys.

Show them your tcpdumps and they hopefully will understand.

Greetings,

-- 
Dr. Michael Schwartzkopff
MultiNET Services GmbH
Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
Tel: +49 - 89 - 45 69 11 0
Fax: +49 - 89 - 45 69 11 21
mob: +49 - 174 - 343 28 75

mail: [email protected]
web: www.multinet.de

Sitz der Gesellschaft: 85630 Grasbrunn
Registergericht: Amtsgericht München HRB 114375
Geschäftsführer: Günter Jurgeneit, Hubert Martens

---

PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
Skype: misch42
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to