[Bug 1958952] Re: ARM64 node dmesg spammed with "mlx5_core 0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 0x5a5aa9"

2022-01-27 Thread Po-Hsu Lin
** Also affects: ubuntu-kernel-tests
   Importance: Undecided
   Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1958952

Title:
  ARM64 node dmesg spammed with "mlx5_core 0005:01:00.0:
  mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ
  0x5a5aa9"

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1958952/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1958952] Re: ARM64 node dmesg spammed with "mlx5_core 0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 0x5a5aa9"

2022-01-25 Thread Po-Hsu Lin
** Description changed:

  While investigating the SRU deployment failure, I noticed the dmesg will
  be spammed with:
  
  Jan 25 07:48:36 appleton-kernel kernel: [   22.885627] mlx5_core 
0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 
0x5a5aa9
  Jan 25 07:48:36 appleton-kernel kernel: [   22.885628] mlx5_core 
0005:01:00.0: mlx5_eq_comp_int:159:(pid 1218): Completion event for bogus CQ 
0x5a5aa9
  Jan 25 07:48:36 appleton-kernel kernel: [   22.885629] mlx5_core 
0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 
0x5a5aa9
  Jan 25 07:48:36 appleton-kernel kernel: [   22.885631] mlx5_core 
0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 
0x5a5aa9
  
  Issue found with Focal 5.4.0-96-generic
  
  Please find attachment for the syslog.
  
  Not sure if this is cause of our deployment issue, but it seems odd to me.
- Deployment issue:
-   1. System successfully deployed with Focal
-   2. Deployment process hangs with "Enabling PPA" stage
-   3. I cannot connect to this system manually, ssh hangs (soft lockup maybe?) 
after:
- Warning: Permanently added '10.229.50.13' (ECDSA) to the list of 
known hosts.
- 
+ And here is our deployment issue:
+   1. System successfully deployed with Focal
+   2. Deployment process hangs with "Enabling PPA" stage
+   3. I cannot connect to this system manually, ssh hangs (soft lockup maybe?) 
after:
+ Warning: Permanently added '10.229.50.13' (ECDSA) to the list of 
known hosts.
  
  ProblemType: Bug
  DistroRelease: Ubuntu 20.04
  Package: linux-image-5.4.0-96-generic 5.4.0-96.109
  ProcVersionSignature: Ubuntu 5.4.0-96.109-generic 5.4.157
  Uname: Linux 5.4.0-96-generic aarch64
  AlsaDevices:
   total 0
   crw-rw 1 root audio 116,  1 Jan 25 07:48 seq
   crw-rw 1 root audio 116, 33 Jan 25 07:48 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.11-0ubuntu27.21
  Architecture: arm64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CasperMD5CheckResult: skip
  Date: Tue Jan 25 07:53:33 2022
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb:
   Bus 001 Device 004: ID 12d1:0003 Huawei Technologies Co., Ltd.
   Bus 001 Device 003: ID 0424:2514 Microchip Technology, Inc. (formerly SMSC) 
USB 2.0 Hub
   Bus 001 Device 002: ID 0424:2514 Microchip Technology, Inc. (formerly SMSC) 
USB 2.0 Hub
   Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  Lsusb-t:
   /:  Bus 01.Port 1: Dev 1, Class=root_hub, Driver=ehci-platform/2p, 480M
   |__ Port 1: Dev 2, If 0, Class=Hub, Driver=hub/4p, 480M
   |__ Port 2: Dev 3, If 0, Class=Hub, Driver=hub/4p, 480M
   |__ Port 1: Dev 4, If 1, Class=Human Interface Device, 
Driver=usbhid, 12M
   |__ Port 1: Dev 4, If 0, Class=Human Interface Device, 
Driver=usbhid, 12M
  MachineType: Hisilicon D05
  PciMultimedia:
  
  ProcFB: 0 hibmcdrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.4.0-96-generic 
root=UUID=3abb8e5a-2f46-4221-b664-cb02a273a249 ro sysrq_always_enabled
  RelatedPackageVersions:
   linux-restricted-modules-5.4.0-96-generic N/A
   linux-backports-modules-5.4.0-96-generic  N/A
   linux-firmware1.187.25
  RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
  SourcePackage: linux
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 06/01/2018
  dmi.bios.vendor: Huawei
  dmi.bios.version: 1.50
  dmi.board.asset.tag: To be filled by O.E.M.
  dmi.board.name: BC11SPCD
  dmi.board.vendor: Huawei
  dmi.board.version: VER.A
  dmi.chassis.asset.tag: To be filled by O.E.M.
  dmi.chassis.type: 17
  dmi.chassis.vendor: Hisilicon
  dmi.chassis.version: To be filled by O.E.M.
  dmi.modalias: 
dmi:bvnHuawei:bvr1.50:bd06/01/2018:svnHisilicon:pnD05:pvrV100R001C00:rvnHuawei:rnBC11SPCD:rvrVER.A:cvnHisilicon:ct17:cvrTobefilledbyO.E.M.:
  dmi.product.family: To be filled by O.E.M.
  dmi.product.name: D05
  dmi.product.sku: To be filled by O.E.M.
  dmi.product.version: V100R001C00
  dmi.sys.vendor: Hisilicon

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1958952

Title:
  ARM64 node dmesg spammed with "mlx5_core 0005:01:00.0:
  mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ
  0x5a5aa9"

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1958952/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1958952] Re: ARM64 node dmesg spammed with "mlx5_core 0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 0x5a5aa9"

2022-01-25 Thread Po-Hsu Lin
** Description changed:

  While investigating the SRU deployment failure, I noticed the dmesg will
  be spammed with:
  
  Jan 25 07:48:36 appleton-kernel kernel: [   22.885627] mlx5_core 
0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 
0x5a5aa9
  Jan 25 07:48:36 appleton-kernel kernel: [   22.885628] mlx5_core 
0005:01:00.0: mlx5_eq_comp_int:159:(pid 1218): Completion event for bogus CQ 
0x5a5aa9
  Jan 25 07:48:36 appleton-kernel kernel: [   22.885629] mlx5_core 
0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 
0x5a5aa9
  Jan 25 07:48:36 appleton-kernel kernel: [   22.885631] mlx5_core 
0005:01:00.0: mlx5_eq_comp_int:159:(pid 1180): Completion event for bogus CQ 
0x5a5aa9
  
- 
  Issue found with Focal 5.4.0-96-generic
  
- Please find attachment for syslog
+ Please find attachment for the syslog.
+ 
+ Not sure if this is cause of our deployment issue, but it seems odd to me.
+ Deployment issue:
+   1. System successfully deployed with Focal
+   2. Deployment process hangs with "Enabling PPA" stage
+   3. I cannot connect to this system manually, ssh hangs (soft lockup maybe?) 
after:
+ Warning: Permanently added '10.229.50.13' (ECDSA) to the list of 
known hosts.
+ 
  
  ProblemType: Bug
  DistroRelease: Ubuntu 20.04
  Package: linux-image-5.4.0-96-generic 5.4.0-96.109
  ProcVersionSignature: Ubuntu 5.4.0-96.109-generic 5.4.157
  Uname: Linux 5.4.0-96-generic aarch64
  AlsaDevices:
-  total 0
-  crw-rw 1 root audio 116,  1 Jan 25 07:48 seq
-  crw-rw 1 root audio 116, 33 Jan 25 07:48 timer
+  total 0
+  crw-rw 1 root audio 116,  1 Jan 25 07:48 seq
+  crw-rw 1 root audio 116, 33 Jan 25 07:48 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.20.11-0ubuntu27.21
  Architecture: arm64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', 
'/dev/snd/timer'] failed with exit code 1:
  CasperMD5CheckResult: skip
  Date: Tue Jan 25 07:53:33 2022
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
  Lsusb:
-  Bus 001 Device 004: ID 12d1:0003 Huawei Technologies Co., Ltd. 
-  Bus 001 Device 003: ID 0424:2514 Microchip Technology, Inc. (formerly SMSC) 
USB 2.0 Hub
-  Bus 001 Device 002: ID 0424:2514 Microchip Technology, Inc. (formerly SMSC) 
USB 2.0 Hub
-  Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
+  Bus 001 Device 004: ID 12d1:0003 Huawei Technologies Co., Ltd.
+  Bus 001 Device 003: ID 0424:2514 Microchip Technology, Inc. (formerly SMSC) 
USB 2.0 Hub
+  Bus 001 Device 002: ID 0424:2514 Microchip Technology, Inc. (formerly SMSC) 
USB 2.0 Hub
+  Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  Lsusb-t:
-  /:  Bus 01.Port 1: Dev 1, Class=root_hub, Driver=ehci-platform/2p, 480M
-  |__ Port 1: Dev 2, If 0, Class=Hub, Driver=hub/4p, 480M
-  |__ Port 2: Dev 3, If 0, Class=Hub, Driver=hub/4p, 480M
-  |__ Port 1: Dev 4, If 1, Class=Human Interface Device, 
Driver=usbhid, 12M
-  |__ Port 1: Dev 4, If 0, Class=Human Interface Device, 
Driver=usbhid, 12M
+  /:  Bus 01.Port 1: Dev 1, Class=root_hub, Driver=ehci-platform/2p, 480M
+  |__ Port 1: Dev 2, If 0, Class=Hub, Driver=hub/4p, 480M
+  |__ Port 2: Dev 3, If 0, Class=Hub, Driver=hub/4p, 480M
+  |__ Port 1: Dev 4, If 1, Class=Human Interface Device, 
Driver=usbhid, 12M
+  |__ Port 1: Dev 4, If 0, Class=Human Interface Device, 
Driver=usbhid, 12M
  MachineType: Hisilicon D05
  PciMultimedia:
-  
+ 
  ProcFB: 0 hibmcdrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.4.0-96-generic 
root=UUID=3abb8e5a-2f46-4221-b664-cb02a273a249 ro sysrq_always_enabled
  RelatedPackageVersions:
-  linux-restricted-modules-5.4.0-96-generic N/A
-  linux-backports-modules-5.4.0-96-generic  N/A
-  linux-firmware1.187.25
+  linux-restricted-modules-5.4.0-96-generic N/A
+  linux-backports-modules-5.4.0-96-generic  N/A
+  linux-firmware1.187.25
  RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
  SourcePackage: linux
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 06/01/2018
  dmi.bios.vendor: Huawei
  dmi.bios.version: 1.50
  dmi.board.asset.tag: To be filled by O.E.M.
  dmi.board.name: BC11SPCD
  dmi.board.vendor: Huawei
  dmi.board.version: VER.A
  dmi.chassis.asset.tag: To be filled by O.E.M.
  dmi.chassis.type: 17
  dmi.chassis.vendor: Hisilicon
  dmi.chassis.version: To be filled by O.E.M.
  dmi.modalias: 
dmi:bvnHuawei:bvr1.50:bd06/01/2018:svnHisilicon:pnD05:pvrV100R001C00:rvnHuawei:rnBC11SPCD:rvrVER.A:cvnHisilicon:ct17:cvrTobefilledbyO.E.M.:
  dmi.product.family: To be filled by O.E.M.
  dmi.product.name: D05
  dmi.product.sku: To be filled by O.E.M.
  dmi.product.version: V100R001C00
  dmi.sys.vendor: Hisilicon

-- 
You received this bug notification because you are a member of Ubun