Hi Si-Wei, the 5.11 kernel has reached EOL in Feb 2022. Kernel 5.15 is
the one currently being used for linux-oracle kernel on Focal (20.04)
and Jammy (22.04), and it has the commit that you mentioned above:

$ git log --oneline | grep -i "Fix page DMA map/unmap attributes"
a865fe280b96 net/mlx5e: Fix page DMA map/unmap attributes

$ git tag --contains a865fe280b96
Ubuntu-oracle-5.15.0-1001.1
Ubuntu-oracle-5.15.0-1001.2
Ubuntu-oracle-5.15.0-1001.3
Ubuntu-oracle-5.15.0-1002.4
Ubuntu-oracle-5.15.0-1003.5
Ubuntu-oracle-5.15.0-1004.6
Ubuntu-oracle-5.15.0-1005.7
Ubuntu-oracle-5.15.0-1006.8
Ubuntu-oracle-5.15.0-1007.9
Ubuntu-oracle-5.15.0-1009.12
Ubuntu-oracle-5.15.0-1011.15
Ubuntu-oracle-5.15.0-1012.16
Ubuntu-oracle-5.15.0-1013.17

Can you test a guest running 5.15 to see if this addresses the problem?

Regards,
Fabio Martins

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-oracle-5.11 in Ubuntu.
https://bugs.launchpad.net/bugs/1980884

Title:
  ubuntu guest kernel panics when a sev guest with passthrough mlx5 VF
  is used

Status in linux-oracle-5.11 package in Ubuntu:
  New

Bug description:
  Guest kernel panic can be observed when Ubuntu SEV guest with mlx5 vfio-pci 
is started
  as iperf3 server using "iperf3 -s" and as soon as the client tries to connect
  with it.

  Steps to reproduce:

  HOST INFO
  Host type     : OCI (Oracle Cloud) Bare-Metal Server
  Server/Machine: ORACLE SERVER E4-2c
  CPU model     : AMD EPYC 7J13 64-Core Processor
  Architecture  : x86_64
  Host OS       : Oracle Linux Server release 7.9
  Host Kernel   : 5.4.17-2136.309.3.el7uek.x86_64 #2 SMP Tue Jun 14 21:58:29 
PDT 2022
  Hypervisor    : QEMU emulator version 4.2.1 (qemu-4.2.1-17.1.el7)
  OVMF/AAVMF    : OVMF-1.6.2-2.el7.noarch
  libiscsi      : libiscsi-1.19.0-1.el7.x86_64
  Guest Kernel  : 5.11.0-1028-ORACLE

  1) Start Ubuntu 20.04/18.04 SEV guest with vfio-pci:

  /usr/bin/qemu-system-x86_64 -machine q35 -name OL20.04-uefi -enable-kvm
  -nodefaults -cpu host,+host-phys-bits -m 8G -smp 8,maxcpus=240 -monitor stdio
  -vnc 0.0.0.0:0,to=999 -vga std -drive
  
file=/usr/share/OVMF/OVMF_CODE.pure-efi.fd,index=0,if=pflash,format=raw,readonly
     -drive file=OVMF_VARS.pure-efi.fd.ol20.04,index=1,if=pflash,format=raw
  -device
  virtio-scsi-pci,id=virtio-scsi-pci0,disable-legacy=on,iommu_platform=true
  -drive
  
file=/systest/atanveer/scripts/Ubuntu-20.04-2022.02.15-0-uefi-x86_64.qcow2,if=none,id=local_disk0,format=qcow2,media=disk
 -device
  ide-hd,drive=local_disk0,id=local_disk1,bootindex=0      -net none -device
  vfio-pci,host=0000:21:10.1 -qmp tcp:127.0.0.1:3334,server,nowait -serial
  telnet:127.0.0.1:3333,server,nowait  -D ./OL20.04-uefi.log     -device
  virtio-rng-pci,disable-legacy=on,iommu_platform=true -object
  sev-guest,id=sev0,cbitpos=51,reduced-phys-bits=1 -machine
  memory-encryption=sev0

  2) Start a client guest OL/Ubuntu:

  /usr/bin/qemu-system-x86_64 -machine q35 -name OL18.04-uefi -enable-kvm
  -nodefaults -cpu host,+host-phys-bits -m 8G -smp 8,maxcpus=240 -monitor stdio
  -vnc 0.0.0.0:0,to=999 -vga std -drive
  
file=/usr/share/OVMF/OVMF_CODE.pure-efi.fd,index=0,if=pflash,format=raw,readonly
     -drive file=OVMF_VARS.pure-efi.fd.ol18.04,index=1,if=pflash,format=raw
  -device
  virtio-scsi-pci,id=virtio-scsi-pci0,disable-legacy=on,iommu_platform=true
  -drive
  
file=/systest/atanveer/scripts/Ubuntu-18.04-2022.02.13-0-uefi-x86_64.qcow2,if=none,id=local_disk0,format=qcow2,media=disk
 -device
  ide-hd,drive=local_disk0,id=local_disk1,bootindex=0      -net none -device
  vfio-pci,host=0000:21:10.2 -qmp tcp:127.0.0.1:6666,server,nowait -serial
  telnet:127.0.0.1:5555,server,nowait  -D ./OL18.04-uefi.log     -device
  virtio-rng-pci,disable-legacy=on,iommu_platform=true -object
  sev-guest,id=sev0,cbitpos=51,reduced-phys-bits=1 -machine
  memory-encryption=sev0

  3) Flush iptables on both the VMs using "iptables -F"

  4) Start the iperf3 server on the first VM using "iperf3 -s"

  5) Start the iperf3 client on the second VM using "iperf3 -c <server_ip> -4
  -f M -i 0 -t 70 -O 10 -P 64"

  The kernel panic is seen on the first VM i.e. Ubuntu 20.04 with iperf3 also
  showing "Bad Address" error.

  Console logs:

  root@ubuntu-20-04:~# iperf3 -s
  -----------------------------------------------------------
  Server listening on 5201
  -----------------------------------------------------------
  Accepted connection from 10.196.246.104, port 33732
  [  5] local 10.196.247.88 port 5201 connected to 10.196.246.104 port 33734
  [  8] local 10.196.247.88 port 5201 connected to 10.196.246.104 port 33736
  [ 10] local 10.196.247.88 port 5201 connected to 10.196.246.104 port 33738
  iperf3: error - unable to read from stream socket: Bad address
  -----------------------------------------------------------
  Server listening on 5201
  -----------------------------------------------------------
  [   91.083856] general protection fault: 0000 [#1] SMP NOPTI
  [   91.084591] CPU: 4 PID: 0 Comm: swapper/4 Not tainted 5.11.0-1028-oracle
  #31~20.04.1-Ubuntu
  [   91.085393] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.6.2
  06/01/2022
  [   91.086205] RIP: 0010:memcpy_erms+0x6/0x10
  [   91.086640] Code: cc cc cc cc eb 1e 0f 1f 00 48 89 f8 48 89 d1 48 c1 e9 03
  83 e2 07 f3 48 a5 89 d1 f3 a4 c3 66 0f 1f 44 00 00 48 89 f8 48 89 d1 <f3> a4
  c3 0f 1f 80 00 00 00 00 48 89 f8 48 83 fa 20 72 7e 40 38 fe
  [   91.088559] RSP: 0018:ffffa9c1408e4b60 EFLAGS: 00010282
  [   91.089105] RAX: ffff938cd8e48000 RBX: 0000000000001000 RCX:
  0000000000001000
  [   91.089843] RDX: 0000000000001000 RSI: bb62fcf4fd5bf3d6 RDI:
  ffff938cd8e48000
  [   91.090578] RBP: ffffa9c1408e4c00 R08: ffffef2745639200 R09:
  0000000000000000
  [   91.091309] R10: ffffef27456399c8 R11: 0000000000004209 R12:
  0000000000001000
  [   91.092043] R13: ffffef2745639200 R14: 0000000000001000 R15:
  000000000d558380
  [   91.092782] FS:  0000000000000000(0000) GS:ffff938df4300000(0000)
  knlGS:0000000000000000
  [   91.093615] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [   91.094206] CR2: 00005573bf7ac958 CR3: 0008000105c12006 CR4:
  0000000000770ee0
  [   91.094943] PKRU: 55555554
  [   91.095230] Call Trace:
  [   91.095490]  <IRQ>
  [   91.095709]  ? skb_copy_ubufs+0x448/0x5e0
  [   91.096130]  __netif_receive_skb_core+0xdbf/0xf60
  [   91.096623]  ? irqentry_exit+0x20/0x30
  [   91.097018]  ? asm_common_interrupt+0x1e/0x40
  [   91.097471]  __netif_receive_skb_list_core+0x102/0x250
  [   91.098007]  netif_receive_skb_list_internal+0x1a1/0x2b0
  [   91.098560]  ? inet_gro_receive+0x24b/0x310
  [   91.098996]  gro_normal_list.part.0+0x1e/0x40
  [   91.099447]  gro_normal_one+0x46/0x50
  [   91.099832]  napi_gro_receive+0x161/0x1a0
  [   91.100251]  mlx5e_handle_rx_cqe_mpwrq+0x127/0x230 [mlx5_core]
  [   91.100886]  mlx5e_poll_rx_cq+0x20c/0xa30 [mlx5_core]
  [   91.101430]  mlx5e_napi_poll+0xda/0x670 [mlx5_core]
  [   91.101958]  ? mlx5_eq_comp_int+0x149/0x1b0 [mlx5_core]
  [   91.102520]  net_rx_action+0x13f/0x3f0
  [   91.102913]  __do_softirq+0xe0/0x29b
  [   91.103288]  asm_call_irq_on_stack+0x12/0x20
  [   91.103736]  </IRQ>
  [   91.103959]  do_softirq_own_stack+0x3d/0x50
  [   91.104394]  irq_exit_rcu+0xa4/0xb0
  [   91.104766]  common_interrupt+0x7d/0x150
  [   91.105177]  asm_common_interrupt+0x1e/0x40
  [   91.105616] RIP: 0010:native_safe_halt+0xe/0x10
  [   91.106087] Code: 7b ff ff ff eb bd cc cc cc cc cc cc e9 07 00 00 00 0f 00
  2d e6 76 59 00 f4 c3 66 90 e9 07 00 00 00 0f 00 2d d6 76 59 00 fb f4 <c3> cc
  0f 1f 44 00 00 55 48 89 e5 53 65 8b 15 5f 3f 1a 4e 0f 1f 44
  [   91.107593] RSP: 0018:ffffa9c1400b3e90 EFLAGS: 00000202
  [   91.108133] RAX: ffffffffb1e75470 RBX: 0000000000000004 RCX:
  ffff938df4334fc0
  [   91.109235] RDX: 00000000000181c6 RSI: 000000151f3f7720 RDI:
  0000000000000082
  [   91.110329] RBP: ffffa9c1400b3e98 R08: 000000cd42e4dffb R09:
  0000001532524720
  [   91.111419] R10: 0000000000000001 R11: 000000000000000c R12:
  ffff938c8034b100
  [   91.112503] R13: ffff938c8034b100 R14: 0000000000000000 R15:
  0000000000000000
  [   91.113587]  ? __cpuidle_text_start+0x8/0x8
  [   91.114358]  ? default_idle+0xe/0x20
  [   91.115068]  arch_cpu_idle+0x15/0x20
  [   91.115765]  default_idle_call+0x38/0xc0
  [   91.116481]  do_idle+0x1f8/0x260
  [   91.117130]  ? complete+0x3f/0x50
  [   91.117776]  cpu_startup_entry+0x20/0x30
  [   91.118479]  start_secondary+0x11f/0x160
  [   91.119183]  secondary_startup_64_no_verify+0xc2/0xcb
  [   91.120003] Modules linked in: ip6table_filter ip6_tables xt_comment
  xt_owner ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_state xt_conntrack
  nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter nls_iso8859_1
  dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua intel_rapl_msr
  intel_rapl_common amd_energy kvm lz4hc lz4hc_compress joydev input_leds
  efi_pstore serio_raw qemu_fw_cfg mac_hid sch_fq_codel msr sunrpc virtio_rng
  ip_tables x_tables autofs4 btrfs blake2b_generic iscsi_tcp libiscsi_tcp
  libiscsi scsi_transport_iscsi iscsi_ibft iscsi_boot_sysfs raid10 raid456
  async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1
  raid0 multipath linear mlx5_ib ib_uverbs ib_core bochs_drm drm_vram_helper
  crct10dif_pclmul drm_ttm_helper crc32_pclmul ghash_clmulni_intel ttm
  virtio_scsi aesni_intel crypto_simd cryptd glue_helper drm_kms_helper
  mlx5_core syscopyarea sysfillrect sysimgblt fb_sys_fops ahci pci_hyperv_intf
  i2c_i801 mlxfw i2c_smbus drm psmouse libahci lpc_ich
  [   91.131465] ---[ end trace 742180202e4ffeea ]---
  [   91.578040] RIP: 0010:memcpy_erms+0x6/0x10
  [   91.578993] Code: cc cc cc cc eb 1e 0f 1f 00 48 89 f8 48 89 d1 48 c1 e9 03
  83 e2 07 f3 48 a5 89 d1 f3 a4 c3 66 0f 1f 44 00 00 48 89 f8 48 89 d1 <f3> a4
  c3 0f 1f 80 00 00 00 00 48 89 f8 48 83 fa 20 72 7e 40 38 fe
  [   91.581955] RSP: 0018:ffffa9c1408e4b60 EFLAGS: 00010282
  [   91.582991] RAX: ffff938cd8e48000 RBX: 0000000000001000 RCX:
  0000000000001000
  [   91.584231] RDX: 0000000000001000 RSI: bb62fcf4fd5bf3d6 RDI:
  ffff938cd8e48000
  [   91.585471] RBP: ffffa9c1408e4c00 R08: ffffef2745639200 R09:
  0000000000000000
  [   91.586720] R10: ffffef27456399c8 R11: 0000000000004209 R12:
  0000000000001000
  [   91.587967] R13: ffffef2745639200 R14: 0000000000001000 R15:
  000000000d558380
  [   91.589212] FS:  0000000000000000(0000) GS:ffff938df4300000(0000)
  knlGS:0000000000000000
  [   91.590574] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [   91.591673] CR2: 00005573bf7ac958 CR3: 0008000105c12006 CR4:
  0000000000770ee0
  [   91.592931] PKRU: 55555554
  [   91.593709] Kernel panic - not syncing: Fatal exception in interrupt
  [   91.604082] Kernel Offset: 0x30200000 from 0xffffffff81000000 (relocation
  range: 0xffffffff80000000-0xffffffffbfffffff)
  [   92.049672] ---[ end Kernel panic - not syncing: Fatal exception in
  interrupt ]---

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-oracle-5.11/+bug/1980884/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to