I have a 2 node cluster I'm building in the lab. It's running SLES 10sp1. In
the configuration I have XEN loading on both systems and ocfs2 on an iSCSi
host. The nodes should be identical, one was imaged from the other. The only
difference I can see is on node2 the ethernet adapter is eth1.
When I run without XEN everything works fine. When I add XEN I get the
following errors on one node. The second node is fine.
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[11] : [auth=1
a2e4ee9fb9a5df41fde760a723c2e9aa2599e84e]
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: write failure on bcast eth0.:
No such device
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: glib: Unable to send bcast
[-1] packet(len=201): No such device
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG: Dumping message with 12
fields
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[0] : [t=status]
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[1] : [st=active]
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[2] : [dt=7530]
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[3] : [protocol=1]
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[4] : [src=xen1]
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[5] :
[(1)srcuuid=0x80f4b38(36 27)]
Mar 7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[6] : [seq=6d8]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[7] : [hg=10]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[8] : [ts=47d18ef1]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[9] : [ld=2.10 2.07
1.773/143 6735]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[10] : [ttl=4]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[11] : [auth=1
b72f847c91c22b39a77a1dd2b148864839ad0166]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: write failure on bcast eth0.:
No such device
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: glib: Unable to send bcast
[-1] packet(len=196): No such device
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG: Dumping message with 10
fields
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[0] : [t=NS_ackmsg]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[1] : [dest=xen1]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[2] : [ackseq=6d9]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[3] :
[(1)destuuid=0x80f4458(37 28)]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[4] : [src=xen1]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[5] :
[(1)srcuuid=0x80f3a60(36 27)]
Mar 7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[6] : [hg=10]
Mar 7 10:53:37 xen1 heartbeat: [3829]: ERROR: MSG[7] : [ts=47d18ef2]
Mar 7 10:53:37 xen1 heartbeat: [3829]: ERROR: MSG[8] : [ttl=4]
Mar 7 10:53:37 xen1 heartbeat: [3829]: ERROR: MSG[9] : [auth=1
21fbfcb8a3f56e9da1f7df3eea4c6e78f9565f72]
Mar 7 10:53:37 xen1 heartbeat: [3829]: ERROR: write failure on bcast eth0.:
No such device
Mar 7 10:53:37 xen1 heartbeat: [3829]: ERROR: glib: Unable to send bcast
[-1] packet(len=201): No such device
Mar 7 10:53:37 xen1 heartbeat: [3829]: ERROR: MSG: Dumping message with 12
fields
this is my ha.cf
xen1:~ # cat /etc/ha.d/ha.cf
autojoin any
crm true
bcast eth0
node xen2
node xen1
respawn root /sbin/evmsd
apiauth evms uid=hacluster,root
ping 192.168.200.5
respawn root /usr/lib/heartbeat/stonithd
respawn root /usr/lib/heartbeat/pingd -m 100 -d 5s
logfacility local0
use_logd yes
xen1:~ #
And this is my ifconfig and brctl responses
eth0 Link encap:Ethernet HWaddr 00:01:80:66:DB:5B
inet addr:192.168.200.3 Bcast:192.168.200.255 Mask:255.255.255.0
inet6 addr: fe80::201:80ff:fe66:db5b/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:5856 errors:0 dropped:0 overruns:0 frame:0
TX packets:6868 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:2458582 (2.3 Mb) TX bytes:1169888 (1.1 Mb)
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:16436 Metric:1
RX packets:159 errors:0 dropped:0 overruns:0 frame:0
TX packets:159 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:14683 (14.3 Kb) TX bytes:14683 (14.3 Kb)
peth0 Link encap:Ethernet HWaddr FE:FF:FF:FF:FF:FF
inet6 addr: fe80::fcff:ffff:feff:ffff/64 Scope:Link
UP BROADCAST RUNNING NOARP MTU:1500 Metric:1
RX packets:6349 errors:0 dropped:0 overruns:0 frame:0
TX packets:7284 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:3079582 (2.9 Mb) TX bytes:1265965 (1.2 Mb)
Base address:0xef00 Memory:fdde0000-fde00000
vif0.0 Link encap:Ethernet HWaddr FE:FF:FF:FF:FF:FF
inet6 addr: fe80::fcff:ffff:feff:ffff/64 Scope:Link
UP BROADCAST RUNNING NOARP MTU:1500 Metric:1
RX packets:6868 errors:0 dropped:0 overruns:0 frame:0
TX packets:5856 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:1169888 (1.1 Mb) TX bytes:2458582 (2.3 Mb)
xenbr0 Link encap:Ethernet HWaddr FE:FF:FF:FF:FF:FF
inet6 addr: fe80::200:ff:fe00:0/64 Scope:Link
UP BROADCAST RUNNING NOARP MTU:1500 Metric:1
RX packets:1454 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:408076 (398.5 Kb) TX bytes:0 (0.0 b)
xen1:~ # brctl show
bridge name bridge id STP enabled interfaces
xenbr0 8000.feffffffffff no vif0.0
peth0
Thanks in advance
--
Rob Aronson
Storage, Virtualization and Orchestration Practice Manager, Novacoast
USA
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems