** Description changed:
- Issue found on node entei with Focal kernel.
+ [Impact]
+ When trying to run this test on P8 node entei with Focal kernel, it will try
to break 4 devices on Focal, and one of them is using the AHCI driver:
- When trying to run this test, it will try to break 4 devices on Focal,
- and one of them is using the AHCI driver:
-
- $ sudo ./eeh-basic.sh
+ $ sudo ./eeh-basic.sh
0000:00:00.0, Skipped: bridge
0001:00:00.0, Skipped: bridge
0020:00:00.0, Skipped: bridge
0021:00:00.0, Skipped: bridge
0021:01:00.0, Skipped: bridge
0021:02:01.0, Skipped: bridge
0021:02:08.0, Skipped: bridge
0021:02:09.0, Skipped: bridge
0021:02:0a.0, Skipped: bridge
0021:02:0b.0, Skipped: bridge
0021:02:0c.0, Skipped: bridge
0021:0d:00.0, Added
0021:0e:00.0, Added
0021:0f:00.0, Skipped: bridge
0021:10:00.0, Added
0022:00:00.0, Skipped: bridge
0022:01:00.0, Added
Found 4 breakable devices...
Breaking 0021:0d:00.0...
0021:0d:00.0, waited 0/60
0021:0d:00.0, waited 1/60
0021:0d:00.0, waited 2/60
0021:0d:00.0, waited 3/60
0021:0d:00.0, waited 4/60
0021:0d:00.0, waited 5/60
0021:0d:00.0, waited 6/60
0021:0d:00.0, waited 7/60
0021:0d:00.0, waited 8/60
0021:0d:00.0, Recovered after 9 seconds
Breaking 0021:0e:00.0...
0021:0e:00.0, waited 0/60
0021:0e:00.0, waited 1/60
./eeh-basic.sh: 74: sleep: Input/output error
0021:0e:00.0, waited 2/60
./eeh-basic.sh: 74: sleep: Input/output error
- 0021:0e:00.0, waited 3/60
- ./eeh-basic.sh: 74: sleep: Input/output error
....
./eeh-basic.sh: 74: sleep: Input/output error
0021:0e:00.0, waited 59/60
./eeh-basic.sh: 74: sleep: Input/output error
0021:0e:00.0, waited 60/60
./eeh-basic.sh: 74: sleep: Input/output error
0021:0e:00.0, Failed to recover!
Breaking 0021:10:00.0...
Skipping 0021:10:00.0, Initial PE state is not ok
Breaking 0022:01:00.0...
Skipping 0022:01:00.0, Initial PE state is not ok
3 devices failed to recover (4 tested)
./eeh-basic.sh: 81: lspci: Input/output error
./eeh-basic.sh: 81: diff: Input/output error
./eeh-basic.sh: 82: rm: Input/output error
./eeh-basic.sh: 84: test: 3: unexpected operator
With the driver failed to recovery, the system will start acting up.
$ ls
ls: command not found
- And drop into read-only state, dmesg can be found in the attachment.
+ And drop into a read-only state
+
+ [Fixes]
+ * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters")
+
+ This is only affecting Focal and it can be cherry-picked.
+
+ [Test case]
+ Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the
affected P8 node, the test should pass without any issue.
+
+ [Where problems could occur]
+ This fix is limited to PowerPC testing tool, it should not cause any issue.
** Description changed:
- [Impact]
- When trying to run this test on P8 node entei with Focal kernel, it will try
to break 4 devices on Focal, and one of them is using the AHCI driver:
+ [Impact]
+ When trying to run this test on P8 node entei with Focal kernel, it will try
to break 4 devices on Focal, and one of them is using the AHCI driver which
doesn't support error recovery:
$ sudo ./eeh-basic.sh
0000:00:00.0, Skipped: bridge
0001:00:00.0, Skipped: bridge
0020:00:00.0, Skipped: bridge
0021:00:00.0, Skipped: bridge
0021:01:00.0, Skipped: bridge
0021:02:01.0, Skipped: bridge
0021:02:08.0, Skipped: bridge
0021:02:09.0, Skipped: bridge
0021:02:0a.0, Skipped: bridge
0021:02:0b.0, Skipped: bridge
0021:02:0c.0, Skipped: bridge
0021:0d:00.0, Added
0021:0e:00.0, Added
0021:0f:00.0, Skipped: bridge
0021:10:00.0, Added
0022:00:00.0, Skipped: bridge
0022:01:00.0, Added
Found 4 breakable devices...
Breaking 0021:0d:00.0...
0021:0d:00.0, waited 0/60
0021:0d:00.0, waited 1/60
0021:0d:00.0, waited 2/60
0021:0d:00.0, waited 3/60
0021:0d:00.0, waited 4/60
0021:0d:00.0, waited 5/60
0021:0d:00.0, waited 6/60
0021:0d:00.0, waited 7/60
0021:0d:00.0, waited 8/60
0021:0d:00.0, Recovered after 9 seconds
Breaking 0021:0e:00.0...
0021:0e:00.0, waited 0/60
0021:0e:00.0, waited 1/60
./eeh-basic.sh: 74: sleep: Input/output error
0021:0e:00.0, waited 2/60
./eeh-basic.sh: 74: sleep: Input/output error
....
./eeh-basic.sh: 74: sleep: Input/output error
0021:0e:00.0, waited 59/60
./eeh-basic.sh: 74: sleep: Input/output error
0021:0e:00.0, waited 60/60
./eeh-basic.sh: 74: sleep: Input/output error
0021:0e:00.0, Failed to recover!
Breaking 0021:10:00.0...
Skipping 0021:10:00.0, Initial PE state is not ok
Breaking 0022:01:00.0...
Skipping 0022:01:00.0, Initial PE state is not ok
3 devices failed to recover (4 tested)
./eeh-basic.sh: 81: lspci: Input/output error
./eeh-basic.sh: 81: diff: Input/output error
./eeh-basic.sh: 82: rm: Input/output error
./eeh-basic.sh: 84: test: 3: unexpected operator
With the driver failed to recovery, the system will start acting up.
$ ls
ls: command not found
And drop into a read-only state
[Fixes]
* bbe9064f30f06e ("selftests/eeh: Skip ahci adapters")
This is only affecting Focal and it can be cherry-picked.
[Test case]
Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the
affected P8 node, the test should pass without any issue.
[Where problems could occur]
This fix is limited to PowerPC testing tool, it should not cause any issue.
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1916468
Title:
powerpc/eeh-basic.sh in kselftest make P8 node stopped working
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs