Public bug reported:

[Impact]
The system crash after suspend on Intel E810 ethernet card with ice driver.

Error log:
<1>[  105.054720] BUG: kernel NULL pointer dereference, address: 
0000000000000048
<1>[  105.054721] #PF: supervisor read access in kernel mode
<1>[  105.054722] #PF: error_code(0x0000) - not-present page
<6>[  105.054723] PGD 0 P4D 0
<4>[  105.054725] Oops: Oops: 0000 [#1] PREEMPT SMP NOPTI
<4>[  105.054728] CPU: 7 UID: 0 PID: 3449 Comm: rtcwake Not tainted 
6.11.0-1022-oem #22-Ubuntu
<4>[  105.054729] Hardware name: LENOVO 94938272/3376, BIOS M5OKT18A 01/03/2025
<4>[  105.054730] RIP: 0010:netif_queue_set_napi+0x22/0x130
<4>[  105.054737] Code: 90 90 90 90 90 90 90 90 0f 1f 44 00 00 55 48 89 e5 41 
57 41 56 49 89 fe 41 55 41 89 d5 41 54 49 89 cc 53 89 f3 48 85 c9 74 07 <48> 83 
79 38 00 74 60 41 80 be 88 05 00 00 00 0f 85 9e 00 00 00 45
Oops#1 Part4
<4>[  105.054738] RSP: 0018:ffffb31f8d917950 EFLAGS: 00010202
<4>[  105.054739] RAX: ffff9f48ce0fca00 RBX: 0000000000000000 RCX: 
0000000000000010
<4>[  105.054740] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 
ffff9f48c934f000
<4>[  105.054740] RBP: ffffb31f8d917978 R08: 0000000000000000 R09: 
0000000000000000
<4>[  105.054740] R10: 0000000000000000 R11: 0000000000000000 R12: 
0000000000000010
<4>[  105.054741] R13: 0000000000000000 R14: ffff9f48c934f000 R15: 
0000000000000000
<4>[  105.054741] FS:  000072855f4c5740(0000) GS:ffff9f501ff80000(0000) 
knlGS:0000000000000000
<4>[  105.054742] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[  105.054742] CR2: 0000000000000048 CR3: 0000000323e8c006 CR4: 
0000000000f70ef0
<4>[  105.054743] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
<4>[  105.054743] DR3: 0000000000000000 DR6: 00000000ffff07f0 DR7: 
0000000000000400
<4>[  105.054743] PKRU: 55555554
<4>[  105.054744] Call Trace:
<4>[  105.054746]  <TASK>
<4>[  105.054751]  ? show_regs+0x6c/0x80
<4>[  105.054755]  ? __die+0x24/0x80
<4>[  105.054756]  ? page_fault_oops+0x96/0x1b0
<4>[  105.054759]  ? do_user_addr_fault+0x4b2/0x870
<4>[  105.054760]  ? pci_write+0x53/0x90
<4>[  105.054763]  ? exc_page_fault+0x85/0x1c0
<4>[  105.054767]  ? asm_exc_page_fault+0x27/0x30
<4>[  105.054771]  ? netif_queue_set_napi+0x22/0x130
<4>[  105.054775]  ice_vsi_set_napi_queues+0x4b/0xe0 [ice]
<4>[  105.054815]  ice_resume+0xf8/0x290 [ice]

[Fix]
Add NULL pointer checks in ice_vsi_set_napi_queues() to prevent crashes
during resume from suspend when rings[q_idx]->q_vector is NULL.

After wakeup from suspend, IRDMA is initialized with error:
    
    kernel: ice 0000:60:00.0: IRDMA hardware initialization FAILED init_state=4 
status=-110
    kernel: irdma.gen_2 ice.roce.1: probe with driver irdma.gen_2 failed with 
error -110
    
IRDMA times out because the initialization before the schedule reset.
Initialize RDMA after rebuild to fix it.

Patches:
https://lore.kernel.org/lkml/[email protected]/

[Test]
Suspend 30 times by the command:
$ sudo rtcwake -m mem -s 20

[Where problems could occur]
It may break intel ethernet driver ice.

The patches are reviewed in intel-wired-lan maillist, but not merged yet.
So SRU for oem-6.17 only for now.

** Affects: hwe-next
     Importance: Medium
     Assignee: AaronMa (mapengyu)
         Status: In Progress

** Affects: linux (Ubuntu)
     Importance: Medium
         Status: In Progress

** Affects: linux-oem-6.17 (Ubuntu)
     Importance: Undecided
         Status: Invalid

** Affects: linux (Ubuntu Noble)
     Importance: Medium
         Status: In Progress

** Affects: linux-oem-6.17 (Ubuntu Noble)
     Importance: Medium
     Assignee: AaronMa (mapengyu)
         Status: In Progress

** Affects: linux (Ubuntu Plucky)
     Importance: Undecided
         Status: Invalid

** Affects: linux-oem-6.17 (Ubuntu Plucky)
     Importance: Undecided
         Status: Invalid

** Affects: linux (Ubuntu Questing)
     Importance: Medium
         Status: In Progress

** Affects: linux-oem-6.17 (Ubuntu Questing)
     Importance: Undecided
         Status: Invalid

** Affects: linux (Ubuntu Resolute)
     Importance: Medium
         Status: In Progress

** Affects: linux-oem-6.17 (Ubuntu Resolute)
     Importance: Undecided
         Status: Invalid


** Tags: jira-somerville-2718 jira-sutton-3182 oem-priority

** Tags added: jira-somerville-2718 oem-priority

** Tags added: jira-sutton-3182

** Also affects: linux-oem-6.17 (Ubuntu)
   Importance: Undecided
       Status: New

** Also affects: linux (Ubuntu Noble)
   Importance: Undecided
       Status: New

** Also affects: linux-oem-6.17 (Ubuntu Noble)
   Importance: Undecided
       Status: New

** Also affects: linux (Ubuntu Plucky)
   Importance: Undecided
       Status: New

** Also affects: linux-oem-6.17 (Ubuntu Plucky)
   Importance: Undecided
       Status: New

** Also affects: linux (Ubuntu Questing)
   Importance: Undecided
       Status: New

** Also affects: linux-oem-6.17 (Ubuntu Questing)
   Importance: Undecided
       Status: New

** Also affects: linux (Ubuntu Resolute)
   Importance: Undecided
       Status: New

** Also affects: linux-oem-6.17 (Ubuntu Resolute)
   Importance: Undecided
       Status: New

** Changed in: linux-oem-6.17 (Ubuntu Plucky)
       Status: New => Invalid

** Changed in: linux-oem-6.17 (Ubuntu Questing)
       Status: New => Invalid

** Changed in: linux-oem-6.17 (Ubuntu Resolute)
       Status: New => Invalid

** Changed in: linux-oem-6.17 (Ubuntu Noble)
       Status: New => In Progress

** Changed in: linux-oem-6.17 (Ubuntu Noble)
   Importance: Undecided => Medium

** Changed in: linux-oem-6.17 (Ubuntu Noble)
     Assignee: (unassigned) => AaronMa (mapengyu)

** Changed in: hwe-next
       Status: New => In Progress

** Changed in: hwe-next
     Assignee: (unassigned) => AaronMa (mapengyu)

** Changed in: hwe-next
   Importance: Undecided => Medium

** Changed in: linux (Ubuntu Noble)
   Importance: Undecided => Medium

** Changed in: linux (Ubuntu Noble)
       Status: New => In Progress

** Changed in: linux (Ubuntu Plucky)
       Status: New => Invalid

** Changed in: linux (Ubuntu Questing)
   Importance: Undecided => Medium

** Changed in: linux (Ubuntu Questing)
       Status: New => In Progress

** Changed in: linux (Ubuntu Resolute)
   Importance: Undecided => Medium

** Changed in: linux (Ubuntu Resolute)
       Status: New => In Progress

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2134584

Title:
  [SRU] Fix system crash of intel ice ethernet on E810

To manage notifications about this bug go to:
https://bugs.launchpad.net/hwe-next/+bug/2134584/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to