Thanks a lot. Somehow I missed this thread. Sounds promising. I'll check it out.
Best regards Claus -----Ursprüngliche Nachricht----- Von: "Alexander Bodnarashik" <[email protected]> Gesendet: Sep 22, 2011 8:55:32 PM An: "General Linux-HA mailing list" <[email protected]> Betreff: Re: [Linux-HA] ERROR: glib: Message too long >Hi. >Please see >http://www.gossamer-threads.com/lists/linuxha/users/68406?do=post_view_threaded#68406 > >2011/9/22 Claus Wimmer <[email protected]> > >> Hello, >> >> I have tried to build up a four nodes cluster with heartbeat and pacemaker. >> Everything is alright as long as the cluster consists of 2 nodes. With 3 or >> 4 nodes suddenly error messages come up during a configuration change: >> >> Sep 06 08:16:56 secomat4 heartbeat: [15956]: ERROR: glib: Unable to send >> [-1] ucast packet: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15954]: ERROR: glib: Unable to send >> [-1] ucast packet: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15952]: ERROR: glib: Unable to send >> [-1] ucast packet: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15952]: ERROR: write_child: write >> failure on ucast hb1.: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15954]: ERROR: write_child: write >> failure on ucast hb1.: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15956]: ERROR: write_child: write >> failure on ucast hb1.: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15958]: ERROR: glib: Unable to send >> [-1] ucast packet: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15958]: ERROR: write_child: write >> failure on ucast hb1.: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15960]: ERROR: glib: Unable to send >> [-1] ucast packet: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15960]: ERROR: write_child: write >> failure on ucast hb2.: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15966]: ERROR: glib: Unable to send >> [-1] ucast packet: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15962]: ERROR: glib: Unable to send >> [-1] ucast packet: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15964]: ERROR: glib: Unable to send >> [-1] ucast packet: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15962]: ERROR: write_child: write >> failure on ucast hb2.: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15966]: ERROR: write_child: write >> failure on ucast hb2.: Message too long >> Sep 06 08:16:56 secomat4 heartbeat: [15964]: ERROR: write_child: write >> failure on ucast hb2.: Message too long >> ... >> >> The result is a loss of nodes' intra cluster connection. This seems to be >> independent of cluster communication protocol. The example error mesages >> show up with ucast (currently I use bcast again, see ha.cf). I have seen >> the same error messages with bcast (bcast with compression didn't work, >> too). With mcast it was slightly different: I couldn't get a single node up >> and running, so i had to switch back to bcast. >> >> >> Some Information about the setup: >> >> ha.cf: >> use_logd on >> udpport 694 >> keepalive 2 >> warntime 10 >> deadtime 15 >> initdead 90 >> bcast hb1 hb2 >> autojoin none >> node secomat1 secomat2 secomat3 secomat4 >> # debug 1 >> crm yes >> apiauth stonith-ng uid=root >> >> >> crm: >> pacemaker >> >> >> RPMs: >> secomat4:~ # rpm -qi heartbeat >> Name : heartbeat Relocations: (not relocatable) >> Version : 3.0.3 Vendor: (none) >> Release : 2.18 Build Date: Wed Sep 29 18:06:13 >> 2010 >> Install Date: Wed Apr 13 15:25:12 2011 Build Host: f13.beekhof.net >> Group : Productivity/Clustering/HA Source RPM: >> heartbeat-3.0.3-2.18.src.rpm >> Size : 10221126 License: GPL v2 only; LGPL >> v2.1 or later >> Signature : (none) >> URL : http://linux-ha.org/ >> Summary : Messaging and membership subsystem for High-Availability >> Linux >> Description : ... >> >> >> secomat4:~ # rpm -qi pacemaker >> Name : pacemaker Relocations: (not relocatable) >> Version : 1.1.5 Vendor: (none) >> Release : 1.1 Build Date: Mon Feb 14 17:34:14 >> 2011 >> Install Date: Wed Apr 13 15:25:15 2011 Build Host: f13.beekhof.net >> Group : Productivity/Clustering/HA Source RPM: >> pacemaker-1.1.5-1.1.src.rpm >> Size : 13626153 License: GPLv2+ and LGPLv2+ >> Signature : (none) >> URL : http://www.clusterlabs.org >> Summary : Scalable High-Availability cluster resource manager >> Description : ... >> >> >> Hardware: >> 2 Intel(R) Xeon(R) CPU X5650 @ 2.67GHz, 2660 MHz >> 6 cores >> 24 processors >> >> >> OS: >> secomat4:~ # uname >> -a >> >> Linux secomat4 2.6.34.10-0.2-default #1 SMP 2011-07-20 18:48:56 +0200 >> x86_64 x86_64 x86_64 GNU/Linux >> >> >> Best regards >> Claus >> ___________________________________________________________ >> Schon gehört? WEB.DE hat einen genialen Phishing-Filter in die >> Toolbar eingebaut! http://produkte.web.de/go/toolbar >> _______________________________________________ >> Linux-HA mailing list >> [email protected] >> http://lists.linux-ha.org/mailman/listinfo/linux-ha >> See also: http://linux-ha.org/ReportingProblems >_______________________________________________ >Linux-HA mailing list >[email protected] >http://lists.linux-ha.org/mailman/listinfo/linux-ha >See also: http://linux-ha.org/ReportingProblems ___________________________________________________________ Schon gehört? WEB.DE hat einen genialen Phishing-Filter in die Toolbar eingebaut! http://produkte.web.de/go/toolbar _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
