[dpdk-dev] Issues with openvswitch2.5+dpdk2.2+kvm/virtio-pci
On Tue, Mar 22, 2016 at 9:47 PM, Daniele Di Proietto wrote: > Hi Christian, > > thanks for the report. I've managed to reproduce the problem and I > observed two separate issues: > > 1) short version: it appears to be a problem in DPDK 2.2 and it should be > fixed by 9a0615af7746("virtio: fix restart"), now on master. > [...] > It appears that simply backporting 9a0615af7746("virtio: fix restart") > fixes the issue. > Yeah debugging it I came to the double configuration as cause as well yesterday. Eventually I saw that virtqueue->vq_ring structs got reallocated but stayed zeroed out by the second call. I didn't realize there was a fix for it already - thanks so much for pointing it out. I'll give a backport a try and let you know if it worked. > 2) When ovs-vswitchd is started with --monitor, there will be a parent > process (the monitor) that simply restarts the child if it crashes, while > the child does the actual ovs-vswitchd job. > > It appears that the monitor feature is broken with DPDK, because the DPDK > library is wrongly initialized in the parent process. If the child > crashes, all the DPDK memzones are preserved, meaning that new allocations > will likely fail. > > The fix for this is calling rte_eal_init() in the child process, after > parsing other ovs-vswitchd command line options. This is done (among other > things) in Aaron's series currently under review: > > http://openvswitch.org/pipermail/dev/2016-March/067422.html > > I think we need a separate fix for branch-2.5. > Yeah I was on discussing and testing that series already as the series was also related to a socket ownership/permission issue (back in time). http://openvswitch.org/pipermail/dev/2015-December/063567.html I think eventually we need separate fixes for both on the branch-2.5 [...]
[dpdk-dev] Issues with openvswitch2.5+dpdk2.2+kvm/virtio-pci
Hi Christian, thanks for the report. I've managed to reproduce the problem and I observed two separate issues: 1) short version: it appears to be a problem in DPDK 2.2 and it should be fixed by 9a0615af7746("virtio: fix restart"), now on master. long version: for every DPDK device added to OVS, these calls are issued: netdev_open() ... rte_eth_dev_configure() rte_eth_rx_queue_setup() rte_eth_dev_start() before receiving packets, the datapath calls netdev_set_multiq, which reinitializes the device netdev_set_multiq() ... rte_eth_dev_stop() rte_eth_dev_configure() rte_eth_rx_queue_setup() rte_eth_dev_start() The problem is that the second rte_eth_dev_start() doesn't actually initializes dev->data->rx_queues[0], because it (wrongly?) assumes that it's already initialized. This causes the next call to rte_eth_ex_burst() to crash. It appears that simply backporting 9a0615af7746("virtio: fix restart") fixes the issue. 2) When ovs-vswitchd is started with --monitor, there will be a parent process (the monitor) that simply restarts the child if it crashes, while the child does the actual ovs-vswitchd job. It appears that the monitor feature is broken with DPDK, because the DPDK library is wrongly initialized in the parent process. If the child crashes, all the DPDK memzones are preserved, meaning that new allocations will likely fail. The fix for this is calling rte_eal_init() in the child process, after parsing other ovs-vswitchd command line options. This is done (among other things) in Aaron's series currently under review: http://openvswitch.org/pipermail/dev/2016-March/067422.html I think we need a separate fix for branch-2.5. On 18/03/2016 08:20, "Christian Ehrhardt" wrote: >Hi, >I was trying to replicate a setup that I have working on physical devices >(ixgbe) under kvm since there is a virtio pmd driver. > > >TL;DR: >- under KVM with virtio-pci (working on baremetal with ixgbe cards) >- adding dpdk port to ovs fails with memzone already exists >and causes a segfault >- I couldn't find a solution in similar mails that popped up here >recently, any help or pointer appreciated. > > >## Details ## >I thought I've read that others have it working I thought that would be a >great way to gain more debug control of the environment, but something >seems to be eluding me. > > >There were quite some similar mails on the List recently, but none seemed >to hit the same issue as I do. At least none of the tunings/workarounds >seemed to apply to me. > >As versions I have Openvswitch 2.5, DPDK 2.2, Qemu 2.5, Kernel 4.4 - so a >fairly recent software stack. > > >The super-short repro summary is: >1. starting ovs-dpdk like >ovs-vswitchd --dpdk -c 0x1 -n 4 --pci-blacklist :00:03.0 -m 2048 -- >unix:/var/run/openvswitch/db.sock -vconsole:emer -vsyslog:err -vfile:info >--mlockall --no-chdir --log-file=/var/log/openvswitch/ovs-vswitchd.log >--pidfile=/var/run/openvswitch/ovs-vswitchd.pid > --detach --monitor >2. add a bridge and a ovs dpdk port > >ovs-vsctl add-port ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk > >ovs-vsctl add-port ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk > > > >The log of the initialization after #1 looks good to me - I can see two >of my three virtio devices recognized and one blacklisted. >Memory allocation looks good, ... I'll attach the log at the end of the >mail > > > > >## ISSUE ## >But when I add a port and refer to one of the dpdk ports it fails with >the following: >ovs-vsctl[14023]: ovs|1|vsctl|INFO|Called as ovs-vsctl add-port >ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk >ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): >memzone already exists >ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): >memzone already exists >ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): >memzone already exists >kernel: show_signal_msg: 18 callbacks suppressed >kernel: pmd12[14025]: segfault at 2 ip 7f3eb205eab2 sp >7f3e3dffa590 error 4 in libdpdk.so.0[7f3eb1fdf000+1e9000] >ovs-vswitchd[13902]: ovs|3|daemon_unix(monitor)|ERR|1 crashes: pid >13903 died, killed (Segmentation fault), core dumped, restarting >systemd-udevd[14040]: Could not generate persistent MAC address for >ovs-netdev: No such file or directory >kernel: device ovs-netdev entered promiscuous mode >ovs-vswitchd[14036]: EAL: memzone_reserve_aligned_thread_unsafe(): >memzone already exists >ovs-vswitchd[14036]: RING: Cannot reserve memory >kernel: device ovsdpdkbr0 entered promiscuous mode >ovs-vswitchd[14036]: EAL: memzone_reserve_aligned_thread_unsafe(): >memzone already exists >ovs-vswitchd[14036]: RING: Cannot reserve memory > > > > > >## Experiments (failed) ## >I thought it could be related to all the multiqueue chances that recently >going in. >My usual setup has 4 vCPUs and 4 queues per virtio-net device. >I tried them with only 1 of 4 queues, also with only 1 queue defined and >only 1 CPU - but all fail the same
[dpdk-dev] Issues with openvswitch2.5+dpdk2.2+kvm/virtio-pci
FYI - a user also reported that the same issue also affects non-virtual environments when using the BNX2X_PMD Christian Ehrhardt Software Engineer, Ubuntu Server Canonical Ltd On Fri, Mar 18, 2016 at 4:20 PM, Christian Ehrhardt < christian.ehrhardt at canonical.com> wrote: > Hi, > I was trying to replicate a setup that I have working on physical devices > (ixgbe) under kvm since there is a virtio pmd driver. > > TL;DR: > - under KVM with virtio-pci (working on baremetal with ixgbe cards) > - adding dpdk port to ovs fails with memzone already exists > and causes a segfault > - I couldn't find a solution in similar mails that popped up here > recently, any help or pointer appreciated. > > ## Details ## > I thought I've read that others have it working I thought that would be a > great way to gain more debug control of the environment, but something > seems to be eluding me. > > There were quite some similar mails on the List recently, but none seemed > to hit the same issue as I do. At least none of the tunings/workarounds > seemed to apply to me. > As versions I have Openvswitch 2.5, DPDK 2.2, Qemu 2.5, Kernel 4.4 - so a > fairly recent software stack. > > The super-short repro summary is: > 1. starting ovs-dpdk like > ovs-vswitchd --dpdk -c 0x1 -n 4 --pci-blacklist :00:03.0 -m 2048 -- > unix:/var/run/openvswitch/db.sock -vconsole:emer -vsyslog:err -vfile:info > --mlockall --no-chdir --log-file=/var/log/openvswitch/ovs-vswitchd.log > --pidfile=/var/run/openvswitch/ovs-vswitchd.pid --detach --monitor > 2. add a bridge and a ovs dpdk port > ovs-vsctl add-port ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk > ovs-vsctl add-port ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk > > The log of the initialization after #1 looks good to me - I can see two of > my three virtio devices recognized and one blacklisted. > Memory allocation looks good, ... I'll attach the log at the end of the > mail > > > ## ISSUE ## > But when I add a port and refer to one of the dpdk ports it fails with the > following: > ovs-vsctl[14023]: ovs|1|vsctl|INFO|Called as ovs-vsctl add-port > ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk > ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone > already exists > ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone > already exists > ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone > already exists > kernel: show_signal_msg: 18 callbacks suppressed > kernel: pmd12[14025]: segfault at 2 ip 7f3eb205eab2 sp > 7f3e3dffa590 error 4 in libdpdk.so.0[7f3eb1fdf000+1e9000] > ovs-vswitchd[13902]: ovs|3|daemon_unix(monitor)|ERR|1 crashes: pid > 13903 died, killed (Segmentation fault), core dumped, restarting > systemd-udevd[14040]: Could not generate persistent MAC address for > ovs-netdev: No such file or directory > kernel: device ovs-netdev entered promiscuous mode > ovs-vswitchd[14036]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone > already exists > ovs-vswitchd[14036]: RING: Cannot reserve memory > kernel: device ovsdpdkbr0 entered promiscuous mode > ovs-vswitchd[14036]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone > already exists > ovs-vswitchd[14036]: RING: Cannot reserve memory > > > ## Experiments (failed) ## > I thought it could be related to all the multiqueue chances that recently > going in. > My usual setup has 4 vCPUs and 4 queues per virtio-net device. > I tried them with only 1 of 4 queues, also with only 1 queue defined and > only 1 CPU - but all fail the same way. > I have testpmd and l2fwd on the same devices working, so I hope they are > not totally set up badly. > > I also tried hilarious things like reassigning to uio_pci_generic before, > but well its virtio_pmd eventually anyways - so it made no difference. > > From how it appears I felt that it could be related to the old discussions > around > [1] http://dpdk.org/ml/archives/dev/2015-May/017589.html > [2] http://openvswitch.org/pipermail/dev/2015-March/052344.html > But they are (partially) applied upstream already and the issue doesn't > 100% match the old discussions. > > > ## Logs ## > [3] log of openvswitch start: > systemd[1]: Starting Open vSwitch Internal Unit... > ovs-ctl[13868]: * Starting ovsdb-server > ovs-vsctl[13893]: ovs|1|vsctl|INFO|Called as ovs-vsctl --no-wait -- > init -- set Open_vSwitch . db-version=7.12.1 > ovs-vsctl[13898]: ovs|1|vsctl|INFO|Called as ovs-vsctl --no-wait set > Open_vSwitch . ovs-version=2.5.0 > "external-ids:system-id=\"8ddb892c-53a5-410d-a765-0031ad6eb1be\"" > "system-type=\"Ubuntu\"" "system-version=\"16.04-xenial\"" > ovs-ctl[13868]: * Configuring Open vSwitch system IDs > ovs-ctl[13868]: 2016-03-18T14:28:28Z|1|dpdk|INFO|No -vhost_sock_dir > provided - defaulting to /var/run/openvswitch > ovs-vswitchd[13900]: ovs|1|dpdk|INFO|No -vhost_sock_dir provided - > defaulting to /var/run/openvswitch > ovs-ctl[13868]: EAL: Detected lcore 0 as core 0 on
[dpdk-dev] Issues with openvswitch2.5+dpdk2.2+kvm/virtio-pci
Hi, I was trying to replicate a setup that I have working on physical devices (ixgbe) under kvm since there is a virtio pmd driver. TL;DR: - under KVM with virtio-pci (working on baremetal with ixgbe cards) - adding dpdk port to ovs fails with memzone already exists and causes a segfault - I couldn't find a solution in similar mails that popped up here recently, any help or pointer appreciated. ## Details ## I thought I've read that others have it working I thought that would be a great way to gain more debug control of the environment, but something seems to be eluding me. There were quite some similar mails on the List recently, but none seemed to hit the same issue as I do. At least none of the tunings/workarounds seemed to apply to me. As versions I have Openvswitch 2.5, DPDK 2.2, Qemu 2.5, Kernel 4.4 - so a fairly recent software stack. The super-short repro summary is: 1. starting ovs-dpdk like ovs-vswitchd --dpdk -c 0x1 -n 4 --pci-blacklist :00:03.0 -m 2048 -- unix:/var/run/openvswitch/db.sock -vconsole:emer -vsyslog:err -vfile:info --mlockall --no-chdir --log-file=/var/log/openvswitch/ovs-vswitchd.log --pidfile=/var/run/openvswitch/ovs-vswitchd.pid --detach --monitor 2. add a bridge and a ovs dpdk port ovs-vsctl add-port ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk ovs-vsctl add-port ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk The log of the initialization after #1 looks good to me - I can see two of my three virtio devices recognized and one blacklisted. Memory allocation looks good, ... I'll attach the log at the end of the mail ## ISSUE ## But when I add a port and refer to one of the dpdk ports it fails with the following: ovs-vsctl[14023]: ovs|1|vsctl|INFO|Called as ovs-vsctl add-port ovsdpdkbr0 dpdk0 -- set Interface dpdk0 type=dpdk ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone already exists ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone already exists ovs-vswitchd[13903]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone already exists kernel: show_signal_msg: 18 callbacks suppressed kernel: pmd12[14025]: segfault at 2 ip 7f3eb205eab2 sp 7f3e3dffa590 error 4 in libdpdk.so.0[7f3eb1fdf000+1e9000] ovs-vswitchd[13902]: ovs|3|daemon_unix(monitor)|ERR|1 crashes: pid 13903 died, killed (Segmentation fault), core dumped, restarting systemd-udevd[14040]: Could not generate persistent MAC address for ovs-netdev: No such file or directory kernel: device ovs-netdev entered promiscuous mode ovs-vswitchd[14036]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone already exists ovs-vswitchd[14036]: RING: Cannot reserve memory kernel: device ovsdpdkbr0 entered promiscuous mode ovs-vswitchd[14036]: EAL: memzone_reserve_aligned_thread_unsafe(): memzone already exists ovs-vswitchd[14036]: RING: Cannot reserve memory ## Experiments (failed) ## I thought it could be related to all the multiqueue chances that recently going in. My usual setup has 4 vCPUs and 4 queues per virtio-net device. I tried them with only 1 of 4 queues, also with only 1 queue defined and only 1 CPU - but all fail the same way. I have testpmd and l2fwd on the same devices working, so I hope they are not totally set up badly. I also tried hilarious things like reassigning to uio_pci_generic before, but well its virtio_pmd eventually anyways - so it made no difference. >From how it appears I felt that it could be related to the old discussions around [1] http://dpdk.org/ml/archives/dev/2015-May/017589.html [2] http://openvswitch.org/pipermail/dev/2015-March/052344.html But they are (partially) applied upstream already and the issue doesn't 100% match the old discussions. ## Logs ## [3] log of openvswitch start: systemd[1]: Starting Open vSwitch Internal Unit... ovs-ctl[13868]: * Starting ovsdb-server ovs-vsctl[13893]: ovs|1|vsctl|INFO|Called as ovs-vsctl --no-wait -- init -- set Open_vSwitch . db-version=7.12.1 ovs-vsctl[13898]: ovs|1|vsctl|INFO|Called as ovs-vsctl --no-wait set Open_vSwitch . ovs-version=2.5.0 "external-ids:system-id=\"8ddb892c-53a5-410d-a765-0031ad6eb1be\"" "system-type=\"Ubuntu\"" "system-version=\"16.04-xenial\"" ovs-ctl[13868]: * Configuring Open vSwitch system IDs ovs-ctl[13868]: 2016-03-18T14:28:28Z|1|dpdk|INFO|No -vhost_sock_dir provided - defaulting to /var/run/openvswitch ovs-vswitchd[13900]: ovs|1|dpdk|INFO|No -vhost_sock_dir provided - defaulting to /var/run/openvswitch ovs-ctl[13868]: EAL: Detected lcore 0 as core 0 on socket 0 ovs-ctl[13868]: EAL: Detected lcore 1 as core 0 on socket 0 ovs-ctl[13868]: EAL: Detected lcore 2 as core 0 on socket 0 ovs-ctl[13868]: EAL: Detected lcore 3 as core 0 on socket 0 ovs-ctl[13868]: EAL: Support maximum 128 logical core(s) by configuration. ovs-ctl[13868]: EAL: Detected 4 lcore(s) ovs-ctl[13868]: EAL: No free hugepages reported in hugepages-1048576kB ovs-ctl[13868]: EAL: VFIO modules not all loaded, skip VFIO support... ovs-ctl[13868]: EAL: