Bug#1025537: nfsd: Kernel Oops while serving NFS

2023-03-02 Thread Olivier Mehani
Source: linux
Followup-For: Bug #1025537

The 6.1.0 Changelog had some notes about a gnarly bug in the NFS bug 
having been fixed, so I thought I'd give it a try. Unfortunately, the 
problem reported here is still present. It has made the machine quite 
unstable, but I was able to retrieve logs of previous Oopses.

   2023-03-02T12:51:02.923421+11:00 supahwinch kernel: [0.00] Linux 
version 6.1.0-5-amd64 (debian-kernel@lists.debian.org) (gcc-12 (Debian 
12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40) #1 SMP 
PREEMPT_DYNAMIC Debian 6.1.12-1 (2023-02-15)
   [SNIP]-8<
   2023-03-02T13:11:21.468088+11:00 supahwinch kernel: [ 1262.015083] stack 
segment:  [#1] PREEMPT SMP NOPTI
   2023-03-02T13:11:21.473246+11:00 supahwinch kernel: [ 1262.019939] CPU: 0 
PID: 3345 Comm: nfsd Not tainted 6.1.0-5-amd64 #1  Debian 6.1.12-1
   2023-03-02T13:11:21.473258+11:00 supahwinch kernel: [ 1262.024779] Hardware 
name: HP ProLiant MicroServer, BIOS O41 10/01/2013
   2023-03-02T13:11:21.473261+11:00 supahwinch kernel: [ 1262.029613] RIP: 
0010:release_pages+0xcd/0x4d0
   2023-03-02T13:11:21.473264+11:00 supahwinch kernel: [ 1262.034410] Code: 84 
c0 74 1a 48 8b 04 24 48 8d 4c 24 30 49 89 46 08 48 89 44 24 30 4c 89 75 08 48 
89 4d 10 49 83 c7 08 4c 39 fb 74 75 49 8b 2f <48> 8b 45 08 a8 01 0f 85 58 01 00 
00 0f 1f 44 00 00 4d 85 ed 74 0e
   2023-03-02T13:11:21.473268+11:00 supahwinch kernel: [ 1262.044534] RSP: 
0018:bc550156fe30 EFLAGS: 00010206
   2023-03-02T13:11:21.473271+11:00 supahwinch kernel: [ 1262.049614] RAX: 
0007 RBX: 97c2ae3b4b78 RCX: bc550156fe60
   2023-03-02T13:11:21.473274+11:00 supahwinch kernel: [ 1262.054703] RDX: 
bc550156fe60 RSI: bc550156fe60 RDI: de0185ba2148
   2023-03-02T13:11:21.473277+11:00 supahwinch kernel: [ 1262.059850] RBP: 
000fc000 R08: de0180d57d08 R09: 00019198
   2023-03-02T13:11:21.473279+11:00 supahwinch kernel: [ 1262.064975] R10: 
0003 R11:  R12: 
   2023-03-02T13:11:21.473282+11:00 supahwinch kernel: [ 1262.070058] R13: 
 R14: de0180d57d08 R15: 97c2ae3b4b28
   2023-03-02T13:11:21.473284+11:00 supahwinch kernel: [ 1262.075101] FS:  
() GS:97c357c0() knlGS:
   2023-03-02T13:11:21.473287+11:00 supahwinch kernel: [ 1262.080166] CS:  0010 
DS:  ES:  CR0: 80050033
   2023-03-02T13:11:21.473290+11:00 supahwinch kernel: [ 1262.085268] CR2: 
7f6e0e512416 CR3: 000109c92000 CR4: 06f0
   2023-03-02T13:11:21.473292+11:00 supahwinch kernel: [ 1262.090377] Call 
Trace:
   2023-03-02T13:11:21.473295+11:00 supahwinch kernel: [ 1262.095402]  
   2023-03-02T13:11:21.473298+11:00 supahwinch kernel: [ 1262.100346]  ? 
nfsd_shutdown_threads+0x90/0x90 [nfsd]
   2023-03-02T13:11:21.473300+11:00 supahwinch kernel: [ 1262.105417]  
__pagevec_release+0x1b/0x30
   2023-03-02T13:11:21.473303+11:00 supahwinch kernel: [ 1262.110399]  
svc_xprt_release+0x1a3/0x1e0 [sunrpc]
   2023-03-02T13:11:21.473306+11:00 supahwinch kernel: [ 1262.115547]  
svc_send+0x59/0x160 [sunrpc]
   2023-03-02T13:11:21.473308+11:00 supahwinch kernel: [ 1262.120602]  
nfsd+0xd5/0x190 [nfsd]
   2023-03-02T13:11:21.473311+11:00 supahwinch kernel: [ 1262.125538]  
kthread+0xe9/0x110
   2023-03-02T13:11:21.473313+11:00 supahwinch kernel: [ 1262.130372]  ? 
kthread_complete_and_exit+0x20/0x20
   2023-03-02T13:11:21.473315+11:00 supahwinch kernel: [ 1262.135195]  
ret_from_fork+0x22/0x30
   2023-03-02T13:11:21.473318+11:00 supahwinch kernel: [ 1262.139953]  
   2023-03-02T13:11:21.473321+11:00 supahwinch kernel: [ 1262.144620] Modules 
linked in: veth xt_nat nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack_netlink 
xfrm_user xfrm_algo xt_addrtype br_netfilter bridge stp llc overlay cmac 
algif_hash ecb algif_skcipher af_alg bnep ip6t_REJECT nf_reject_ipv6 xt_tcpudp 
xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_compat nf_tables 
nfnetlink binfmt_misc amdgpu isofs gpu_sched cdrom drm_buddy xc2028 zl10353 
rc_fusionhdtv_mce ir_kbd_i2c cx23885 altera_ci tda18271 altera_stapl m88ds3103 
i2c_mux btusb btrtl cx2341x btbcm tveeprom btintel videobuf2_dvb dvb_core btmtk 
bluetooth videobuf2_dma_sg videobuf2_memops videobuf2_v4l2 amd64_edac 
jitterentropy_rng videobuf2_common edac_mce_amd radeon sha512_generic kvm_amd 
videodev video ctr ccp wmi mc drbg cp210x rng_core snd_pcm ansi_cprng usbserial 
drm_display_helper ecdh_generic snd_timer joydev snd rfkill kvm cec soundcore 
evdev ecc pcspkr rc_core drm_ttm_helper xfs ttm irqbypass drm_kms_helper 
sp5100_tco watchdog i2c_algo_bit acpi_cpufreq sg button
   2023-03-02T13:11:21.473327+11:00 supahwinch kernel: [ 1262.144868]  k10temp 
w83795 jc42 tun loop nfsd msr auth_rpcgss nfs_acl lockd parport_pc grace ppdev 
sunrpc lp drm parport efi_pstore fuse configfs ip_tables x_tables autofs4 ext4 
crc16 mbcache jbd2 btrfs blake2b_generic zstd_compress raid10 raid456 

Bug#1032262: Kernel builds past 5.3 seem to interfere with AMD dedicated GPU performance on Acer Nitro 5 AN515-42 laptop (almost certainly kernel config related)

2023-03-02 Thread Za Ra
Package: linux-image-amd64

Version: Any that include kernel versions post-5.3

Hello, I current run Debian 12 (Bookworm) on my Acer Nitro 5 AN515-42
laptop, but this issue has been present in Debian for quite a while.
I've noticed that the default kernel has some problems regarding the
hybrid Raven Ridge/RX560x setup in this machine.
No matter what 3D load is run, the performance of the GPU hardware
seems to be locked at its lowest state, resulting in extremely poor
performance. This happens regardless of DRI_PRIME setting.

Interestingly, this did NOT happen when I tested other distributions
on this hardware, such as Fedora and Ubuntu, regardless of kernel
version present.

The problem also disappears if I use the Xanmod kernel, which is a
third-party custom kernel not present in Debian's repositories.

Finally, out of curiosity, I compiled the mainline kernel for myself
the other day (version 6.1) using the kernel config present by default
in the third-party Xanmod kernel, and my compiled kernel resulted in
proper and expected performance across the board.

For these reasons, I believe the kernel config present in Debian is
somehow at fault. Unfortunately, despite looking at the diff between
the default kernel config and the custom one I used to build my
kernel, I don't really have a good idea of which settings were
responsible for the solution to this issue.

I would greatly appreciate it if anyone more knowledgeable than me
could help to investigate this and perhaps change the default Debian
kernel config upstream so that other users who might be affected by
this bug can benefit from the solution without having to compile their
own kernels.


Processed: Re: Processed (with 1 error): Re: Bug#1015272: liburing autopkgtest started to hang containers in Debian and Ubuntu since ~2022-07-11

2023-03-02 Thread Debian Bug Tracking System
Processing commands for cont...@bugs.debian.org:

> fixed 1015272 5.10.162-1
Bug #1015272 [linux] liburing autopkgtest started to hang containers in Debian 
and Ubuntu since ~2022-07-11
There is no source info for the package 'linux' at version '5.10.162-1' with 
architecture ''
Unable to make a source version for version '5.10.162-1'
Marked as fixed in versions 5.10.162-1.
> fixed 1015272 6.1.8-1
Bug #1015272 [linux] liburing autopkgtest started to hang containers in Debian 
and Ubuntu since ~2022-07-11
There is no source info for the package 'linux' at version '6.1.8-1' with 
architecture ''
Unable to make a source version for version '6.1.8-1'
Marked as fixed in versions 6.1.8-1.
> thanks
Stopping processing here.

Please contact me if you need assistance.
-- 
1015272: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1015272
Debian Bug Tracking System
Contact ow...@bugs.debian.org with problems



Bug#1015272: liburing autopkgtest started to hang containers in Debian and Ubuntu since ~2022-07-11

2023-03-02 Thread Guillem Jover
Hi!

On Thu, 2023-03-02 at 12:12:06 +0100, Paul Gevers wrote:
> Control: fixed -1 5.10.162-1 6.1.8-1

> On Sun, 21 Aug 2022 21:35:58 +0200 Bastian Blank  wrote:
> > On Sun, Aug 21, 2022 at 07:42:10PM +0200, Guillem Jover wrote:
> > > It seems like there was a regression with the latest stable update
> > > that affects the autopkgtest for liburing. Reassigning.
> > 
> > Please provide enough information to make isolating the problem
> > possible.
> > 
> > https://ci.debian.net/packages/libu/liburing/ is completely silent as
> > there are not results for any of the failed runs.

> I decided to try again to see if I could collect more information. The test
> now passes on amd64, arm64, i386 and ppc64el, all running 5.10.162-1 and on
> riscv64 running unstable. However, on armhf, armel (amd64 kernel) and s390x
> (all running 5.10.158-2), it seems that the observation of brian is still
> true, some test in test-unit test segfaults, the test exits and hangs.
> @Guillem, do you see something more in the output below (armhf log) that may
> be of interest? And maybe spot something to run in isolation?

> Reading the changelog of 5.10.162-1 I see io_uring mentioned a couple of
> times. Therefor I assume this bug is fixed in that version. Is it worth
> pursuing the real issue here?

Thanks for looking into this! As this appears fixed in latest Linux
releases, then I'd honestly not bother further, and just try to get the
remaining hosts upgraded to the fixed versions. If this was to reappear
then it might make sense to look into it again.

Regards,
Guillem



Bug#1015272: liburing autopkgtest started to hang containers in Debian and Ubuntu since ~2022-07-11

2023-03-02 Thread Paul Gevers

Control: fixed -1 5.10.162-1 6.1.8-1

Hi,

On Sun, 21 Aug 2022 21:35:58 +0200 Bastian Blank  wrote:

On Sun, Aug 21, 2022 at 07:42:10PM +0200, Guillem Jover wrote:
> It seems like there was a regression with the latest stable update
> that affects the autopkgtest for liburing. Reassigning.

Please provide enough information to make isolating the problem
possible.

https://ci.debian.net/packages/libu/liburing/ is completely silent as
there are not results for any of the failed runs.


I decided to try again to see if I could collect more information. The 
test now passes on amd64, arm64, i386 and ppc64el, all running 
5.10.162-1 and on riscv64 running unstable. However, on armhf, armel 
(amd64 kernel) and s390x (all running 5.10.158-2), it seems that the 
observation of brian is still true, some test in test-unit test 
segfaults, the test exits and hangs. @Guillem, do you see something more 
in the output below (armhf log) that may be of interest? And maybe spot 
something to run in isolation?


When I try to destroy the lxc, that fails and in ps output I see this:
root 3053528  0.0  0.0   5388  3072 ?Ss   03:34   0:00 [lxc 
monitor] /var/lib/lxc ci-061-8c60e21c
root 3061512  0.0  0.0  0 0 ?Ss   03:35   0:00  \_ 
[systemd]
debian   3110684  0.0  0.0   2140   192 ?DL   03:37   0:00 
\_ ./iopoll-leak.t


Note the "D" state.

Reading the changelog of 5.10.162-1 I see io_uring mentioned a couple of 
times. Therefor I assume this bug is fixed in that version. Is it worth 
pursuing the real issue here?


Paul

root@ci-061-705317d0:/tmp/autopkgtest-lxc.v8gx_5j5/downtmp# cat 
test-unit-stdout

+ [ -n  ]
+ CC=gcc
+ ./configure --cc=gcc
prefix/usr
includedir/usr/include
libdir/usr/lib
libdevdir /usr/lib
relativelibdir
mandir/usr/man
datadir   /usr/share
stringop_overflow yes
array_bounds  yes
__kernel_rwf_tyes
__kernel_timespec yes
open_how  yes
statx yes
glibc_statx   yes
C++   yes
has_ucontext  yes
NVMe uring command supportyes
liburing_nolibc   no
CCgcc
CXX   g++
+ make runtests
make[1]: Entering directory 
'/tmp/autopkgtest-lxc.v8gx_5j5/downtmp/build.ksh/src/src'

 CC setup.ol
 CC queue.ol
 CC register.ol
 CC syscall.ol
 AR liburing.a
ar: creating liburing.a
 RANLIB liburing.a
 CC setup.os
 CC queue.os
 CC register.os
 CC syscall.os
 CC liburing.so.2.3
make[1]: Leaving directory 
'/tmp/autopkgtest-lxc.v8gx_5j5/downtmp/build.ksh/src/src'
make[1]: Entering directory 
'/tmp/autopkgtest-lxc.v8gx_5j5/downtmp/build.ksh/src/test'

 CC helpers.o
 CC 232c93d07b74.t
 CC 35fa71a030ca.t
 CC 500f9fbadef8.t
 CC 7ad0e4b2f83c.t
 CC 8a9973408177.t
 CC 917257daa0fe.t
 CC a0908ae19763.t
 CC a4c0b3decb33.t
 CC accept.t
 CC accept-link.t
 CC accept-reuse.t
 CC accept-test.t
 CC across-fork.t
 CC b19062a56726.t
 CC b5837bd5311d.t
 CC buf-ring.t
 CC ce593a6c480a.t
 CC close-opath.t
 CC connect.t
 CC cq-full.t
 CC cq-overflow.t
 CC cq-peek-batch.t
 CC cq-ready.t
 CC cq-size.t
 CC d4ae271dfaae.t
 CC d77a67ed5f27.t
 CC defer.t
 CC defer-taskrun.t
 CC double-poll-crash.t
 CC drop-submit.t
 CC eeed8b54e0df.t
 CC empty-eownerdead.t
 CC eventfd.t
 CC eventfd-disable.t
 CC eventfd-reg.t
 CC eventfd-ring.t
 CC exec-target.t
 CC exit-no-cleanup.t
 CC fadvise.t
 CC fallocate.t
 CC fc2a85cb02ef.t
 CC fd-pass.t
 CC file-register.t
 CC files-exit-hang-poll.t
 CC files-exit-hang-timeout.t
 CC file-update.t
 CC file-verify.t
 CC fixed-buf-iter.t
 CC fixed-link.t
 CC fixed-reuse.t
 CC fpos.t
 CC fsync.t
 CC hardlink.t
 CC io-cancel.t
 CC iopoll.t
 CC iopoll-leak.t
 CC io_uring_enter.t
 CC io_uring_passthrough.t
 CC io_uring_register.t
 CC io_uring_setup.t
 CC lfs-openat.t
 CC lfs-openat-write.t
 CC link.t
 CC link_drain.t
 CC link-timeout.t
 CC madvise.t
 CC mkdir.t
 CC msg-ring.t
 CC multicqes_drain.t
 CC nolibc.t
 CC nop-all-sizes.t
 CC nop.t
 CC openat2.t
 CC open-close.t
 CC open-direct-link.t
 CC open-direct-pick.t
 CC personality.t
 CC pipe-eof.t
 CC pipe-reuse.t
 CC poll.t
 CC poll-cancel.t
 CC poll-cancel-all.t
 CC poll-cancel-ton.t
 CC poll-link.t
 CC poll-many.t
 CC poll-mshot-update.t
 CC poll-mshot-overflow.t
 CC poll-ring.t
 CC poll-v-poll.t
 CC pollfree.t
 CC probe.t
 CC read-before-exit.t
 CC read-write.t
 CC recv-msgall.t
 CC 

Bug#1032271: Module advansys fails to load on amd64

2023-03-02 Thread Pascal Hambourg

Package: src:linux
Version: 5.10.162-1
Severity: normal

Dear maintainer,

Loading the 'advansys' module fails with any -amd64 kernel:

# modprobe advansys
modprobe: ERROR: could not insert 'advansys': No such device

The module loads successfully with any -686 or -686-pae kernel.

I observed the same behaviour with any stable Debian kernel from 4.9 to 
6.1 I have, regardless of whether the supported device and required 
firmware are present or not.


After adding a few printk() in the module code, it appears that the 
reason is because calls to isa_register_driver() in advansys_init() 
fail. The failure may be related with some ISA-related config option 
missing in amd64 kernels. A quick comparison shows that the following 
options are enabled in -686 config but not in -amd64 config:


CONFIG_ISA=y
CONFIG_ISA_BUS_API=y
CONFIG_ISAPNP=y

However I believe a better solution would be to have the module not fail 
if ISA is disabled, as it also support EISA and PCI devices.




Bug#1032284: Linux-image-6.1.x with rtw88_8723de module

2023-03-02 Thread Philes Philbert
Package: linux-image-6.1.0-5-amd64
Version: 6.1.12-1

Linux-image 6.1 series hangs on boot when rtw88_8723de module is enabled.
Blacklisting the module by creating a file in /etc/modprobe.d allows the
kernel to boot successfully. This is only a workaround I found at
https://forum.endeavouros.com/t/kernel-6-1-stuck-in-boot-before-login-window/35127/11.