On Fri, Feb 06, 2026 at 10:23:11AM -0800, Breno Leitao wrote: > When a device lacks an error_detected callback, AER recovery fails and > the device is left in a disconnected state. This can mask serious > hardware issues during development and testing. > > Add a module parameter 'aer_unrecoverable_fatal' that panics the kernel > instead, making such failures immediately visible. The parameter > defaults to false to preserve existing behavior.
There's a parallel effort by Terry Bowman (+cc) to introduce a PCI_ERS_RESULT_PANIC return value for error handling: https://lore.kernel.org/all/[email protected]/ Please consider using that as the basis for your needs. Thanks, Lukas
