[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
** Changed in: ubuntu-kernel-tests Status: In Progress => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: Fix Released Bug description: [Impact] When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
This bug was fixed in the package linux - 5.4.0-71.79 --- linux (5.4.0-71.79) focal; urgency=medium * focal/linux: 5.4.0-71.79 -proposed tracker (LP: #1921040) * selftests: bpf verifier fails after sanitize_ptr_alu fixes (LP: #1920995) - bpf: Simplify alu_limit masking for pointer arithmetic - bpf: Add sanity check for upper ptr_limit - bpf, selftests: Fix up some test_verifier cases for unprivileged * Packaging resync (LP: #1786013) - update dkms package versions * Fix missing HDMI/DP audio on NVidia card after S3 (LP: #1918228) - ALSA: hda/hdmi: Reduce hda_jack_tbl lookup at unsol event handling - ALSA: hda/hdmi: Don't use standard hda_jack for generic HDMI jacks - ALSA: hda/hdmi: Move runtime PM resume into hdmi_present_sense_via_verbs() - ALSA: hda/hdmi: Move ELD parse and jack reporting into update_eld() * Focal update: v5.4.101 upstream stable release (LP: #1918170) - HID: make arrays usage and value to be the same - USB: quirks: sort quirk entries - usb: quirks: add quirk to start video capture on ELMO L-12F document camera reliable - ntfs: check for valid standard information attribute - arm64: tegra: Add power-domain for Tegra210 HDA - scripts: use pkg-config to locate libcrypto - scripts: set proper OpenSSL include dir also for sign-file - mm: unexport follow_pte_pmd - mm: simplify follow_pte{,pmd} - KVM: do not assume PTE is writable after follow_pfn - mm: provide a saner PTE walking API for modules - KVM: Use kvm_pfn_t for local PFN variable in hva_to_pfn_remapped() - NET: usb: qmi_wwan: Adding support for Cinterion MV31 - cxgb4: Add new T6 PCI device id 0x6092 - cifs: Set CIFS_MOUNT_USE_PREFIX_PATH flag on setting cifs_sb->prepath. - scripts/recordmcount.pl: support big endian for ARCH sh - Linux 5.4.101 * Focal update: v5.4.100 upstream stable release (LP: #1918168) - KVM: SEV: fix double locking due to incorrect backport - net: qrtr: Fix port ID for control messages - net: bridge: Fix a warning when del bridge sysfs - Xen/x86: don't bail early from clear_foreign_p2m_mapping() - Xen/x86: also check kernel mapping in set_foreign_p2m_mapping() - Xen/gntdev: correct dev_bus_addr handling in gntdev_map_grant_pages() - Xen/gntdev: correct error checking in gntdev_map_grant_pages() - xen/arm: don't ignore return errors from set_phys_to_machine - xen-blkback: don't "handle" error by BUG() - xen-netback: don't "handle" error by BUG() - xen-scsiback: don't "handle" error by BUG() - xen-blkback: fix error handling in xen_blkbk_map() - media: pwc: Use correct device for DMA - btrfs: fix backport of 2175bf57dc952 in 5.4.95 - Linux 5.4.100 * Focal update: v5.4.99 upstream stable release (LP: #1918167) - gpio: ep93xx: fix BUG_ON port F usage - gpio: ep93xx: Fix single irqchip with multi gpiochips - tracing: Do not count ftrace events in top level enable output - tracing: Check length before giving out the filter buffer - arm/xen: Don't probe xenbus as part of an early initcall - cgroup: fix psi monitor for root cgroup - arm64: dts: rockchip: Fix PCIe DT properties on rk3399 - arm64: dts: qcom: sdm845: Reserve LPASS clocks in gcc - ARM: OMAP2+: Fix suspcious RCU usage splats for omap_enter_idle_coupled - platform/x86: hp-wmi: Disable tablet-mode reporting by default - ovl: perform vfs_getxattr() with mounter creds - cap: fix conversions on getxattr - ovl: skip getxattr of security labels - nvme-pci: ignore the subsysem NQN on Phison E16 - drm/amd/display: Add more Clock Sources to DCN2.1 - drm/amd/display: Fix dc_sink kref count in emulated_link_detect - drm/amd/display: Free atomic state after drm_atomic_commit - drm/amd/display: Decrement refcount of dc_sink before reassignment - riscv: virt_addr_valid must check the address belongs to linear mapping - bfq-iosched: Revert "bfq: Fix computation of shallow depth" - ARM: dts: lpc32xx: Revert set default clock rate of HCLK PLL - ARM: ensure the signal page contains defined contents - ARM: kexec: fix oops after TLB are invalidated - vmlinux.lds.h: Create section for protection against instrumentation - lkdtm: don't move ctors to .rodata - mt76: dma: fix a possible memory leak in mt76_add_fragment() - drm/vc4: hvs: Fix buffer overflow with the dlist handling - bpf: Check for integer overflow when using roundup_pow_of_two() - netfilter: xt_recent: Fix attempt to update deleted entry - netfilter: nftables: fix possible UAF over chains from packet path in netns - netfilter: flowtable: fix tcp and udp header checksum update - xen/netback: avoid race in xenvif_rx_ring_slots_available() - net: enetc: initialize the RFS and RSS memories - selftests: txtimestamp: fix compilation issue - net: stmmac: set TxQ mode back to DCB after
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
Verified on node entei with Focal kernel, AHCI skipped as expected: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Skipped: ahci doesn't support recovery 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 3 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:10:00.0... 0021:10:00.0, Recovered after 0 seconds Breaking 0022:01:00.0... 0022:01:00.0, waited 0/60 0022:01:00.0, waited 1/60 0022:01:00.0, waited 2/60 0022:01:00.0, waited 3/60 0022:01:00.0, waited 4/60 0022:01:00.0, Recovered after 5 seconds 0 devices failed to recover (3 tested) ./eeh-basic.sh: 89: test: 0: unexpected operator For the unexpected operator issue, please check bug 1909428 ** Tags removed: verification-needed-focal ** Tags added: verification-done-focal -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: In Progress Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: Fix Committed Bug description: [Impact] When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed- focal' to 'verification-done-focal'. If the problem still exists, change the tag 'verification-needed-focal' to 'verification-failed-focal'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: verification-needed-focal -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: In Progress Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: Fix Committed Bug description: [Impact] When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
** Tags added: ubuntu-kernel-selftests ** Tags added: 5.4 focal ppc64el -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: In Progress Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: Fix Committed Bug description: [Impact] When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
** Changed in: linux (Ubuntu Focal) Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: In Progress Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: Fix Committed Bug description: [Impact] When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
https://lists.ubuntu.com/archives/kernel-team/2021-February/117502.html ** Changed in: ubuntu-kernel-tests Status: New => In Progress ** Changed in: ubuntu-kernel-tests Assignee: (unassigned) => Po-Hsu Lin (cypressyew) ** Changed in: linux (Ubuntu Focal) Assignee: (unassigned) => Po-Hsu Lin (cypressyew) ** Changed in: linux (Ubuntu Focal) Status: Incomplete => In Progress -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: In Progress Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: In Progress Bug description: [Impact] When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
** Description changed: - Issue found on node entei with Focal kernel. + [Impact] + When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver: - When trying to run this test, it will try to break 4 devices on Focal, - and one of them is using the AHCI driver: - - $ sudo ./eeh-basic.sh + $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error - 0021:0e:00.0, waited 3/60 - ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found - And drop into read-only state, dmesg can be found in the attachment. + And drop into a read-only state + + [Fixes] + * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") + + This is only affecting Focal and it can be cherry-picked. + + [Test case] + Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. + + [Where problems could occur] + This fix is limited to PowerPC testing tool, it should not cause any issue. ** Description changed: - [Impact] - When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver: + [Impact] + When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
Hi Guilherme, yes this should be skipped in the test. Thanks ** Changed in: linux (Ubuntu) Status: Incomplete => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: New Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: Incomplete Bug description: [Impact] When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
** Also affects: ubuntu-kernel-tests Importance: Undecided Status: New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: New Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: Incomplete Bug description: Issue found on node entei with Focal kernel. When trying to run this test, it will try to break 4 devices on Focal, and one of them is using the AHCI driver: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 3/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into read-only state, dmesg can be found in the attachment. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1916468] Re: powerpc/eeh-basic.sh in kselftest make P8 node stopped working
Hi Po-Hsu Lin, I think AHCI has no native support for EEH; the last news I found is an attempt to include such support from 2015, but got denied upstream [0]. When a driver has no native support, EEH works by using what is called the hotplug approach, which is to PCI-remove the device. When it comes to storage devices with filesystem mounted and in-flight I/O, this is very dangerous and prone to failure. So, I'm not sure how this test works, but one alternative would be skip testing with AHCI, or at least test it with no/idle filesystem mounted. Cheers, Guilherme [0] https://patchwork.ozlabs.org/project/linux-ide/patch/1431622517-5851-1-git-send-email-wenxi...@linux.vnet.ibm.com/ -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in linux package in Ubuntu: Incomplete Status in linux source package in Focal: Incomplete Bug description: Issue found on node entei with Focal kernel. When trying to run this test, it will try to break 4 devices on Focal, and one of them is using the AHCI driver: $ sudo ./eeh-basic.sh :00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 3/60 ./eeh-basic.sh: 74: sleep: Input/output error ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into read-only state, dmesg can be found in the attachment. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp