FTR: I'm seeing exactly the same behavior when testing with "MT2892
Family [ConnectX-6 Dx]" on 24.04 LTS. Was this actually ever working in
Ubuntu on this hardware model?

ubuntu@romano:~$ lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 24.04.2 LTS
Release:        24.04
Codename:       noble
ubuntu@romano:~$ uname -a
Linux romano 6.8.0-57-generic #59-Ubuntu SMP PREEMPT_DYNAMIC Sat Mar 15 
17:40:59 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux

ubuntu@romano:~$ sudo cat /sys/kernel/debug/mlx5/0000:61:00.0/lag/state
disabled
ubuntu@romano:~$ sudo cat /sys/kernel/debug/mlx5/0000:61:00.1/lag/state
disabled
ubuntu@romano:~$ sudo devlink dev eswitch show pci/0000:61:00.0
pci/0000:61:00.0: mode legacy inline-mode none encap-mode basic
ubuntu@romano:~$ sudo devlink dev eswitch show pci/0000:61:00.1
pci/0000:61:00.1: mode legacy inline-mode none encap-mode basic

=> Broken. switchdev & VF-LAG disabled after reboot.

ubuntu@romano:~$ sudo netplan apply --sriov-only
ubuntu@romano:~$ sudo devlink dev eswitch show pci/0000:61:00.1
pci/0000:61:00.1: mode switchdev inline-mode none encap-mode basic
ubuntu@romano:~$ sudo devlink dev eswitch show pci/0000:61:00.0
pci/0000:61:00.0: mode switchdev inline-mode none encap-mode basic
ubuntu@romano:~$ sudo cat /sys/kernel/debug/mlx5/0000:61:00.1/lag/state
active
ubuntu@romano:~$ sudo cat /sys/kernel/debug/mlx5/0000:61:00.0/lag/state
active

=> Fixed state after issuing "netplan apply --sriov-only"


Reference:
ubuntu@romano:~$ sudo netplan get
network:
  version: 2
  ethernets:
    ens13f0np0:
      match:
        macaddress: "84:16:0c:3d:63:ce"
      addresses:
      - "10.241.7.26/24"
      nameservers:
        addresses:
        - 10.239.8.12
        - 10.239.8.13
        - 10.239.8.11
        - 10.176.2.3
        - 10.176.2.2
        - 10.176.2.4
        search:
        - maas
        - dh1-j8-1.tor3-sqa-shared-maas.solutionsqa
        - dh1-j8-2.tor3-sqa-shared-maas.solutionsqa
        - dh1-j9-1.tor3-sqa-shared-maas.solutionsqa
        - dh1-j9-2.tor3-sqa-shared-maas.solutionsqa
      gateway4: 10.241.7.1
      set-name: "ens13f0np0"
      mtu: 1500
    ens13f1np1:
      match:
        macaddress: "84:16:0c:3d:63:cf"
      set-name: "ens13f1np1"
      mtu: 1500
    ens7f0:
      match:
        macaddress: "b8:3f:d2:2d:68:7e"
      optional: true
      set-name: "ens7f0"
      mtu: 1500
      virtual-function-count: 8
      embedded-switch-mode: "switchdev"
      delay-virtual-functions-rebind: true
    ens7f1:
      match:
        macaddress: "b8:3f:d2:2d:68:7f"
      set-name: "ens7f1"
      mtu: 1500
      virtual-function-count: 8
      embedded-switch-mode: "switchdev"
      delay-virtual-functions-rebind: true
  bonds:
    bond0:
      interfaces:
      - ens7f0
      - ens7f1
      parameters:
        mode: "active-backup"

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1988018

Title:
  [SRU][mlx5] Intermittent VF-LAG activation failure

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1988018/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to