Hi,

On Fri, Mar 07, 2008 at 10:58:02AM -0800, Rob Aronson wrote:
> I have a 2 node cluster I'm building in the lab. It's running SLES 10sp1. In
> the configuration I have XEN loading on both systems and ocfs2 on an iSCSi
> host. The nodes should be identical, one was imaged from the other. The only
> difference I can see is on node2 the ethernet adapter is eth1.
> 
> When I run without XEN everything works fine. When I add XEN I get the
> following errors on one node. The second node is fine.
> 
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[11] : [auth=1
> a2e4ee9fb9a5df41fde760a723c2e9aa2599e84e]
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: write failure on bcast eth0.:
> No such device
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: glib: Unable to send bcast
> [-1] packet(len=201): No such device
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG: Dumping message with 12
> fields
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[0] : [t=status]
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[1] : [st=active]
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[2] : [dt=7530]
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[3] : [protocol=1]
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[4] : [src=xen1]
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[5] :
> [(1)srcuuid=0x80f4b38(36 27)]
> Mar  7 10:53:35 xen1 heartbeat: [3829]: ERROR: MSG[6] : [seq=6d8]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[7] : [hg=10]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[8] : [ts=47d18ef1]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[9] : [ld=2.10 2.07
> 1.773/143 6735]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[10] : [ttl=4]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[11] : [auth=1
> b72f847c91c22b39a77a1dd2b148864839ad0166]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: write failure on bcast eth0.:
> No such device
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: glib: Unable to send bcast
> [-1] packet(len=196): No such device
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG: Dumping message with 10
> fields
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[0] : [t=NS_ackmsg]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[1] : [dest=xen1]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[2] : [ackseq=6d9]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[3] :
> [(1)destuuid=0x80f4458(37 28)]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[4] : [src=xen1]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[5] :
> [(1)srcuuid=0x80f3a60(36 27)]
> Mar  7 10:53:36 xen1 heartbeat: [3829]: ERROR: MSG[6] : [hg=10]
> Mar  7 10:53:37 xen1 heartbeat: [3829]: ERROR: MSG[7] : [ts=47d18ef2]
> Mar  7 10:53:37 xen1 heartbeat: [3829]: ERROR: MSG[8] : [ttl=4]
> Mar  7 10:53:37 xen1 heartbeat: [3829]: ERROR: MSG[9] : [auth=1
> 21fbfcb8a3f56e9da1f7df3eea4c6e78f9565f72]
> Mar  7 10:53:37 xen1 heartbeat: [3829]: ERROR: write failure on bcast eth0.:
> No such device
> Mar  7 10:53:37 xen1 heartbeat: [3829]: ERROR: glib: Unable to send bcast
> [-1] packet(len=201): No such device
> Mar  7 10:53:37 xen1 heartbeat: [3829]: ERROR: MSG: Dumping message with 12
> fields
> 
> this is my ha.cf
> xen1:~ # cat /etc/ha.d/ha.cf
> autojoin any
> crm true
> bcast eth0
> node xen2
> node xen1
> respawn root /sbin/evmsd
> apiauth evms uid=hacluster,root
> ping 192.168.200.5
> respawn root /usr/lib/heartbeat/stonithd
> respawn root /usr/lib/heartbeat/pingd -m 100 -d 5s
> logfacility     local0
> use_logd        yes
> xen1:~ #
> 
> And this is my ifconfig and brctl responses
> eth0      Link encap:Ethernet  HWaddr 00:01:80:66:DB:5B
>           inet addr:192.168.200.3  Bcast:192.168.200.255  Mask:255.255.255.0
>           inet6 addr: fe80::201:80ff:fe66:db5b/64 Scope:Link
>           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
>           RX packets:5856 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:6868 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:0
>           RX bytes:2458582 (2.3 Mb)  TX bytes:1169888 (1.1 Mb)
> 
> lo        Link encap:Local Loopback
>           inet addr:127.0.0.1  Mask:255.0.0.0
>           inet6 addr: ::1/128 Scope:Host
>           UP LOOPBACK RUNNING  MTU:16436  Metric:1
>           RX packets:159 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:159 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:0
>           RX bytes:14683 (14.3 Kb)  TX bytes:14683 (14.3 Kb)
> 
> peth0     Link encap:Ethernet  HWaddr FE:FF:FF:FF:FF:FF
>           inet6 addr: fe80::fcff:ffff:feff:ffff/64 Scope:Link
>           UP BROADCAST RUNNING NOARP  MTU:1500  Metric:1
>           RX packets:6349 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:7284 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:1000
>           RX bytes:3079582 (2.9 Mb)  TX bytes:1265965 (1.2 Mb)
>           Base address:0xef00 Memory:fdde0000-fde00000
> 
> vif0.0    Link encap:Ethernet  HWaddr FE:FF:FF:FF:FF:FF
>           inet6 addr: fe80::fcff:ffff:feff:ffff/64 Scope:Link
>           UP BROADCAST RUNNING NOARP  MTU:1500  Metric:1
>           RX packets:6868 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:5856 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:0
>           RX bytes:1169888 (1.1 Mb)  TX bytes:2458582 (2.3 Mb)
> 
> xenbr0    Link encap:Ethernet  HWaddr FE:FF:FF:FF:FF:FF
>           inet6 addr: fe80::200:ff:fe00:0/64 Scope:Link
>           UP BROADCAST RUNNING NOARP  MTU:1500  Metric:1
>           RX packets:1454 errors:0 dropped:0 overruns:0 frame:0
>           TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
>           collisions:0 txqueuelen:0
>           RX bytes:408076 (398.5 Kb)  TX bytes:0 (0.0 b)
> 
> xen1:~ # brctl show
> bridge name     bridge id               STP enabled     interfaces
> xenbr0          8000.feffffffffff       no              vif0.0
>                                                         peth0
> 
> Thanks in advance

Is there a firewall running? Other than that, I really can't
think of anything. If nothing helps, try to use unicast.

Thanks,

Dejan

> 
> -- 
> Rob Aronson
> Storage, Virtualization and Orchestration Practice Manager, Novacoast
> USA
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to