[Bug 1869032] Re: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8
Eoan EOL, closing this bug. ** Changed in: ubuntu-kernel-tests Status: New => Invalid ** Changed in: linux (Ubuntu) Status: Confirmed => Invalid -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1869032 Title: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1869032/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1869032] Re: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8
The jenkins job for sru-misc has successfully completed on node modoc with 5.3.0-43 without any hang issue. (I don't have a chance to check for syslog, but since it's not hanging I guess it's fine). I've also tested the net/tls test in the selftest for 100 times on modoc with 5.3.0-43, passed with oops: for i in $(seq 1 100); do sudo ./tls; done Furthermore, I had it tested with the net test suite for 100 time, passed without oops: for i in $(seq 1 100); do echo "== cycle $i ==" | sudo tee /dev/kmsg; sudo make run_tests TARGETS=net; done -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1869032 Title: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1869032/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1869032] Re: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8
I can finish running this test manually with autotest framework locally on this affected node modoc with 5.3.0-43: AUTOTEST_PATH=/home/ubuntu/autotest sudo -E autotest/client/autotest-local --verbose autotest/client/tests/ubuntu_kernel_selftests/control And the test suite can finish without tripping this issue. But during the test I noticed that there will be another "missing remote IOA" error in dmesg: [ 353.854103] test_bpf: Summary: 378 PASSED, 0 FAILED, [366/366 JIT'ed] [ 353.854127] test_bpf: test_skb_segment: success in skb_segment! [ 359.982427] u32 classifier [ 359.982431] input device check on [ 359.982432] Actions configured [ 360.023690] gre: GRE over IPv4 demultiplexor driver [ 360.027718] ip_gre: GRE over IPv4 tunneling driver [ 360.231910] ip6_gre: GRE over IPv6 tunneling driver [ 361.139317] bridge: filtering via arp/ip/ip6tables is no longer available by default. Update your scripts to load br_netfilter if you need this. [ 361.141793] test-br0: port 1(test-dummy0) entered blocking state [ 361.141796] test-br0: port 1(test-dummy0) entered disabled state [ 361.141929] device test-dummy0 entered promiscuous mode [ 361.143982] test-br0: port 1(test-dummy0) entered blocking state [ 361.143984] test-br0: port 1(test-dummy0) entered forwarding state [ 361.166931] 8021q: 802.1Q VLAN Support v1.8 [ 361.276750] device test-dummy0 left promiscuous mode [ 361.276826] test-br0: port 1(test-dummy0) entered disabled state [ 363.318680] MACsec IEEE 802.1AE [ 363.401018] Initializing XFRM netlink socket [ 365.457161] IPv6: ADDRCONF(NETDEV_CHANGE): veth0: link becomes ready [ 365.462267] IPv6: ADDRCONF(NETDEV_CHANGE): veth0: link becomes ready [ 365.464936] IPv6: ADDRCONF(NETDEV_CHANGE): veth0: link becomes ready [ 365.550548] bpfilter: Loaded bpfilter_umh pid 20653 [ 366.415447] IPv6: ADDRCONF(NETDEV_CHANGE): eth1: link becomes ready [ 397.070039] ipr 0001:08:00.0: 9076: Configuration error, missing remote IOA [ 397.070068] ipr 0001:08:00.0: Attached Adapter not discovered within allotted time [PRC: 17101541] [ 397.070077] ipr 0001:08:00.0: Remote IOA VPID/SN: [ 397.070084] ipr 0001:08:00.0: Remote IOA WWN: Maybe it's some combination issue with the sru-misc test suite, which contains the following tests and will be executed in the following order: 'hwclock', 'libhugetlbfs', 'ubuntu_bpf_jit', 'ubuntu_kernel_selftests', 'ubuntu_lxc', 'ubuntu_seccomp', 'ubuntu_unionmount_ovlfs', 'ubuntu_cts_kernel', 'ubuntu_kvm_unit_tests', -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1869032 Title: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1869032/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1869032] Re: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8
I can manually run the net suite in kselftest on 5.3.0-43-generic with modoc, by running the following command in Eoan tree: sudo make run_tests TARGETS=net The test can finish without tripping this issue. Also, I can see a "[ 28.249600] ipr 0001:08:00.0: 8150: Permanent IOA failure" message in boot dmesg, not sure if this means HW issue? https://www.ibm.com/support/knowledgecenter/TI0003N/p8ebk/urc_tables.htm ** Attachment added: "modoc.log" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1869032/+attachment/5341833/+files/modoc.log -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1869032 Title: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1869032/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1869032] Re: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8
This is what I saw on this Eoan P8 node modoc on the last cycle: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1867155 Which makes it failed to finish the test. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1869032 Title: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1869032/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 1869032] Re: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8
A quick search with keywork "tls" in Eoan tree brought me this: commit 299dfeeb7a216fe4dcfdd6ad0461ea93db72d389 Author: Jakub Kicinski Date: Fri Jan 10 04:38:32 2020 -0800 net/tls: fix async operation BugLink: https://bugs.launchpad.net/bugs/1864710 commit db885e66d268884dc72967279b7e84f522556abc upstream. Mallesham reports the TLS with async accelerator was broken by commit d10523d0b3d7 ("net/tls: free the record on encryption error") because encryption can return -EINPROGRESS in such setups, which should not be treated as an error. The error is also present in the BPF path (likely copied from there). Reported-by: Mallesham Jatharakonda Fixes: d3b18ad31f93 ("tls: add bpf support to sk_msg handling") Fixes: d10523d0b3d7 ("net/tls: free the record on encryption error") Signed-off-by: Jakub Kicinski Reviewed-by: Simon Horman Signed-off-by: David S. Miller Signed-off-by: Greg Kroah-Hartman Signed-off-by: Kamal Mostafa Signed-off-by: Khalid Elmously $ git tag --contains 299dfeeb7a216fe4dcfdd6ad0461ea93db72d389 Ubuntu-5.3.0-43.35 Ubuntu-5.3.0-43.36 Need to check if this is the cause. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1869032 Title: Kernel oopsed and null pointer dereference while running ubuntu_kernel_selftests on Eoan Power8 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1869032/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs