Add the general Openstack list, sorry for folks who are on both lists...
On 11/5/2013 2:42 PM, Xin Zhao wrote:
Hello,
On my grizzly quantum/OVS network node, after I start the
quantum-openvswitch-agent, the system log shows errors as below,
and it repeats every second since then... and the panic messages
continue on even after I stop all openstack daemons, only a system reboot
can clear it out.
Nov 5 14:13:58 cldnet01 kernel: qg-581539d2-ac: hw csum failure.
Nov 5 14:13:58 cldnet01 kernel: Pid: 0, comm: swapper Not tainted
2.6.32-358.123.2.openstack.el6.x86_64 #1
Nov 5 14:13:58 cldnet01 kernel: Call Trace:
Nov 5 14:13:58 cldnet01 kernel: <IRQ> [<ffffffff8144a252>] ?
netdev_rx_csum_fault+0x42/0x50
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81442cc0>] ?
__skb_checksum_complete_head+0x60/0x70
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81442ce1>] ?
__skb_checksum_complete+0x11/0x20
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff814c8b7d>] ?
nf_ip_checksum+0x5d/0x130
Nov 5 14:13:58 cldnet01 kernel: [<ffffffffa01b4d31>] ?
udp_error+0xb1/0x1e0 [nf_conntrack]
Nov 5 14:13:58 cldnet01 kernel: [<ffffffffa01aec98>] ?
nf_conntrack_in+0x138/0xa00 [nf_conntrack]
Nov 5 14:13:58 cldnet01 kernel: [<ffffffffa00721bb>] ?
alloc_null_binding+0x5b/0xa0 [iptable_nat]
Nov 5 14:13:58 cldnet01 kernel: [<ffffffffa0072441>] ?
nf_nat_fn+0x91/0x260 [iptable_nat]
Nov 5 14:13:58 cldnet01 kernel: [<ffffffffa01cc721>] ?
ipv4_conntrack_in+0x21/0x30 [nf_conntrack_ipv4]
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81477459>] ?
nf_iterate+0x69/0xb0
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff814819e9>] ?
ip_rcv_finish+0x199/0x440
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81481850>] ?
ip_rcv_finish+0x0/0x440
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81477614>] ?
nf_hook_slow+0x74/0x110
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81481850>] ?
ip_rcv_finish+0x0/0x440
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81481ef4>] ?
ip_rcv+0x264/0x350
Nov 5 14:13:58 cldnet01 kernel: [<ffffffffa024b503>] ?
ovs_netdev_frame_hook+0xb3/0x110 [openvswitch]
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81449e6b>] ?
__netif_receive_skb+0x4ab/0x750
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff8144a1aa>] ?
process_backlog+0x9a/0x100
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff8144f483>] ?
net_rx_action+0x103/0x2f0
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff810770b1>] ?
__do_softirq+0xc1/0x1e0
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff810e1bb0>] ?
handle_IRQ_event+0x60/0x170
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff8100c1cc>] ?
call_softirq+0x1c/0x30
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff8100de05>] ?
do_softirq+0x65/0xa0
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81076e95>] ?
irq_exit+0x85/0x90
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff8151cd75>] ? do_IRQ+0x75/0xf0
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff8100b9d3>] ?
ret_from_intr+0x0/0x11
Nov 5 14:13:58 cldnet01 kernel: <EOI> [<ffffffff81014907>] ?
mwait_idle+0x77/0xd0
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff8151931a>] ?
atomic_notifier_call_chain+0x1a/0x20
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff81009fc6>] ?
cpu_idle+0xb6/0x110
Nov 5 14:13:58 cldnet01 kernel: [<ffffffff8150cc00>] ?
start_secondary+0x2ac/0x2ef
The only other message in the syslog that's related to CSUM is the
following:
Nov 5 14:10:44 cldnet01 kernel: lo: Dropping TSO features since no
CSUM feature.
Nov 5 14:10:44 cldnet01 kernel: lo: Disabled Privacy Extensions
(this message appears after starting the l3-agent)
The network host is RHEL6.4, kernel is
2.6.32-358.123.2.openstack.el6.x86_64
All the daemons appear to being running, an instance can start, but
network doesn't work for the instance.
Any wisdom on what's going on?
Thanks,
Xin
_______________________________________________
rhos-list mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/rhos-list
_______________________________________________
Mailing list: http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack
Post to : [email protected]
Unsubscribe : http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack