Am Mittwoch, 16. Dezember 2009 14:43:33 schrieb [email protected]: > > > I'm configuring a new 2 node cluster using SLES11 and the HAE using > > > openais 0.80.3-26.1 and pacemaker 1.0.3-4.1 > > > > > > The problem I'm having is that the nodes do not seem to find each > > other as > > > > the documentation says they should. > > > > > > Here's a brief rundown of what I've done: > > > > > > 1. configured the two nodes using ip addresses 10.1.254.166 and > > > 10.1.254.169. > > > 2. installed the ha_sles pattern > > > 3. updated the following lines in /etc/ais/openais.conf: > > > bindnetaddr: 10.1.254.0 > > > mcastaddr: 239.252.10.10 > > > mcastport: 5405 > > > 4. opened udp/5405 in the firewall > > > 5. generated /etc/ais/authkey using ais-keygen and copied to second > > node > > > > 6. start openais using rcopenais start > > > > > > Here are my questions: > > > > > > 1. How long should I expect to wait before seeing the CLM messages > > > indicating the nodes joining the cluster? Initially, I waited a few > > > minutes and assumed something was wrong because I never saw these > > > messages. But last night, the following appeared in the log: > > > > > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM ] Members Left: > > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM ] Members Joined: > > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM ] r(0) > > > ip(10.1.254.169) > > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM ] got nodejoin message > > > 10.1.254.166 > > > Dec 14 18:31:59 plccedir03 openais[3739]: [CLM ] got nodejoin message > > > 10.1.254.169 > > > > That looks good. Are you sure that a) there's really no firewall > > involvement and b) your network switch can handle multicast? > > In my latest attempts, I've cleared all iptables rules to be sure that > wasn't an issue. There is no other firewall between these boxes. > > I will pursue the possibility of no multicast support, although our > network engineers have told me it is enabled on our switches. > > > > 2. Using the GUI, the other node never shows online. The node where > > > crm_gui is being run from shows online, but the other one never goes > > > green. > > > > > > 3. After a restart of openais this morning, I have not yet > > > > > > I've included the messages from a shutdown/startup of openais this > > > morning. > > > > Nothing much in the logs, except that nodes don't form a cluster. > > Check if they really communicate using tcpdump or wireshark. > > There's also openais-cfgtool which may display ring status. > > I've captured some packets using tcpdump, and indeed, I never see the > multicast traffic being received, only sent. The odd thing is that these > machines respond to other multicast traffic, like pinging 224.0.0.1. > > Is there a kernel option that anyone is aware of that could be causing the > boxes to drop multicast? > > Thanks, > Justin
I'd have a serious word with your network guys. Show them your tcpdumps and they hopefully will understand. Greetings, -- Dr. Michael Schwartzkopff MultiNET Services GmbH Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany Tel: +49 - 89 - 45 69 11 0 Fax: +49 - 89 - 45 69 11 21 mob: +49 - 174 - 343 28 75 mail: [email protected] web: www.multinet.de Sitz der Gesellschaft: 85630 Grasbrunn Registergericht: Amtsgericht München HRB 114375 Geschäftsführer: Günter Jurgeneit, Hubert Martens --- PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B Skype: misch42 _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
