Public bug reported:
Description: s390/pci: Don't abort recovery for user-space drivers
Symptom:
When a PCI device under the control of a vfio-pci based user-space driver
encounters a PCI error event the subsequent error recovery flow in the kernel
is aborted because the vfio-pci driver only implements the error_detected PCI
error handler callback. This leaves the PCI device in the error state requiring
unbinding/re-binding of the driver to get it operational again instead of only
having to re-init the user-space driver.
Problem:
According to the kernel documentation implementing only the error_detected()
callback from the error handling operations should be enough for minimal
recovery support. Contrary to this s390 so far required also the reset_slot()
and resume() callbacks to be implemented, otherwise recovery would be aborted.
Solution:
Remove the requirement for the additional operations bringing s390 in line with
AER and EEH error recovery flows.
Reproduction:
The problem can be reproduced with any user-space PCI driver such as the NVMe
user-space driver built into QEMU
Required Fix / Upstream-ID:
62355f1f87b8c7f8785a8dd3cd5ca6e5b513566a
** Affects: linux (Ubuntu)
Importance: Undecided
Assignee: Skipper Bug Screeners (skipper-screen-team)
Status: New
** Tags: architecture-s39064 bugnameltc-214782 severity-high
targetmilestone-inin---
** Tags added: architecture-s39064 bugnameltc-214782 severity-high
targetmilestone-inin---
** Changed in: ubuntu
Assignee: (unassigned) => Skipper Bug Screeners (skipper-screen-team)
** Package changed: ubuntu => linux (Ubuntu)
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2121150
Title:
[UBUNTU 24.04] s390/pci: Don't abort recovery for user-space drivers
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2121150/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs