Bug#976788: linux-image-5.9.0-4-amd64: nouveau DRM timeout causes machine to freeze

2021-01-07 Thread Philip Stewart
I think this is the upstream issue: 
https://gitlab.freedesktop.org/drm/nouveau/-/issues/14


As of 5.10.3, the regression appears to remain.



Bug#976788: linux-image-5.9.0-4-amd64: nouveau DRM timeout causes machine to freeze

2020-12-14 Thread Uros Knuples
Package: src:linux
Version: 5.9.11-1
Followup-For: Bug #976788

Hello,

I have the same problem as described the previous reportes.

The lock-up appears when using kwin-x11 display manager with compositing 
enabled. It does not appear on previous kernel versions (<=5.8).

This is the relevant kernel log;

Dec 15 04:17:39 localhost kernel: [3.758379] fb0: switching to nouveaufb 
from VESA VGA
Dec 15 04:17:39 localhost kernel: [4.258725] nouveau :01:00.0: vgaarb: 
deactivate vga console
Dec 15 04:17:39 localhost kernel: [4.258879] nouveau :01:00.0: NVIDIA 
GT216 (0a5a00a2)
Dec 15 04:17:39 localhost kernel: [4.296522] nouveau :01:00.0: bios: 
version 70.16.26.00.05
Dec 15 04:17:39 localhost kernel: [4.296925] nouveau :01:00.0: bios: 
OOB 1 015f1901 015f1901
Dec 15 04:17:39 localhost kernel: [4.318543] nouveau :01:00.0: fb: 1024 
MiB DDR3
Dec 15 04:17:39 localhost kernel: [4.399251] nouveau :01:00.0: DRM: 
VRAM: 1024 MiB
Dec 15 04:17:39 localhost kernel: [4.399255] nouveau :01:00.0: DRM: 
GART: 1048576 MiB
Dec 15 04:17:39 localhost kernel: [4.399261] nouveau :01:00.0: DRM: 
TMDS table version 2.0
Dec 15 04:17:39 localhost kernel: [4.399264] nouveau :01:00.0: DRM: DCB 
version 4.0
Dec 15 04:17:39 localhost kernel: [4.399268] nouveau :01:00.0: DRM: DCB 
outp 00: 01000323 00010034
Dec 15 04:17:39 localhost kernel: [4.399273] nouveau :01:00.0: DRM: DCB 
outp 01: 02014300 
Dec 15 04:17:39 localhost kernel: [4.399276] nouveau :01:00.0: DRM: DCB 
outp 02: 02021362 00020010
Dec 15 04:17:39 localhost kernel: [4.399283] nouveau :01:00.0: DRM: DCB 
conn 00: 0340
Dec 15 04:17:39 localhost kernel: [4.399286] nouveau :01:00.0: DRM: DCB 
conn 01: 1061
Dec 15 04:17:39 localhost kernel: [4.399289] nouveau :01:00.0: DRM: DCB 
conn 02: 0147
Dec 15 04:17:39 localhost kernel: [4.399293] nouveau :01:00.0: DRM: DCB 
conn 03: 00202346
Dec 15 04:17:39 localhost kernel: [4.399296] nouveau :01:00.0: DRM: DCB 
conn 04: 0400
Dec 15 04:17:39 localhost kernel: [4.399299] nouveau :01:00.0: DRM: DCB 
conn 05: 0210
Dec 15 04:17:39 localhost kernel: [4.399302] nouveau :01:00.0: DRM: DCB 
conn 06: 0211
Dec 15 04:17:39 localhost kernel: [4.399305] nouveau :01:00.0: DRM: DCB 
conn 07: 0213
Dec 15 04:17:39 localhost kernel: [4.402944] nouveau :01:00.0: DRM: MM: 
using COPY for buffer copies
Dec 15 04:17:39 localhost kernel: [4.491176] nouveau :01:00.0: DRM: 
allocated 1366x768 fb: 0x7, bo (ptrval)
Dec 15 04:17:39 localhost kernel: [4.491273] fbcon: nouveaudrmfb (fb0) is 
primary device
Dec 15 04:17:39 localhost kernel: [5.828809] nouveau :01:00.0: [drm] 
fb0: nouveaudrmfb frame buffer device
Dec 15 04:17:39 localhost kernel: [5.846907] [drm] Initialized nouveau 
1.3.1 20120801 for :01:00.0 on minor 0
Dec 15 04:18:10 localhost kernel: [   62.947755] nouveau :01:00.0: bios: 
OOB 1 015f1901 015f1901
Dec 15 04:18:10 localhost kernel: [   62.947992] nouveau :01:00.0: bios: 
OOB 1 015f1901 015f1901
Dec 15 04:18:10 localhost kernel: [   62.948398] nouveau :01:00.0: bios: 
OOB 1 015f1901 015f1901
Dec 15 04:30:57 localhost kernel: [  829.485336] nouveau :01:00.0: disp: 
ERROR 5 [INVALID_STATE] 06 [] chid 1 mthd 0080 data 0001
Dec 15 04:30:57 localhost kernel: [  829.485343] nouveau :01:00.0: disp: 
Base 1:
Dec 15 04:30:57 localhost kernel: [  829.485355] nouveau :01:00.0: disp:
0084:   
Dec 15 04:30:57 localhost kernel: [  829.485362] nouveau :01:00.0: disp:
0088:   
Dec 15 04:30:57 localhost kernel: [  829.485369] nouveau :01:00.0: disp:
008c:   
Dec 15 04:30:57 localhost kernel: [  829.485375] nouveau :01:00.0: disp:
0090:   
Dec 15 04:30:57 localhost kernel: [  829.485381] nouveau :01:00.0: disp:
0094:   
Dec 15 04:30:57 localhost kernel: [  829.485388] nouveau :01:00.0: disp:
00a0: 0070 -> 0060
Dec 15 04:30:57 localhost kernel: [  829.485395] nouveau :01:00.0: disp:
00a4: f000  
Dec 15 04:30:57 localhost kernel: [  829.485402] nouveau :01:00.0: disp:
00c0: fb7a  
Dec 15 04:30:57 localhost kernel: [  829.485409] nouveau :01:00.0: disp:
00c4:   
Dec 15 04:30:57 localhost kernel: [  829.485415] nouveau :01:00.0: disp:
00c8:   
Dec 15 04:30:57 localhost kernel: [  829.485422] nouveau :01:00.0: disp:
00cc:   
Dec 15 04:30:57 localhost kernel: [  829.485429] nouveau :01:00.0: disp:
00e0: 4000  
Dec 15 04:30:57 localhost kernel: [  829.485435] nouveau :01:00.0: disp:
00e4:   
Dec 15 04:30:57 localhost kernel: [  829.485442] nouveau :01:00.0: disp:

Bug#976788: linux-image-5.9.0-4-amd64: nouveau DRM timeout causes machine to freeze

2020-12-13 Thread Philip Stewart

Hello again,

I wanted to share a further finding since the bug report was submitted.

Following repeated lock-ups making the system practically unusable, I 
have installed linux-image-5.8.0-2-amd64 (package version 5.8.10-1) 
from the Debian snapshot archive.


Having now tested this 5.8 kernel for several days without incident, it 
appears this is a regression in the 5.9 kernel.


Kind regards,
Phil



Bug#976788: linux-image-5.9.0-4-amd64: nouveau DRM timeout causes machine to freeze

2020-12-07 Thread Philip Stewart

Package: src:linux
Version: 5.9.11-1

Hello,

I'm experiencing frequent lock-ups of my machine since installing 
bullseye on Saturday, rendering it inoperable for several minutes at a 
time, most recently following a launch of GNOME Disk Utility.


Looking in journalctl afterwards, I note:



Dec 07 16:19:00 dbus-daemon[1409]: [session uid=1000 pid=1409] 
Activating service name='org.gnome.DiskUtility' requested by ':1.34' 
(uid=1000 pid=1537 comm="/usr/bin/gnome-shell ")
Dec 07 16:19:00 dbus-daemon[1409]: [session uid=1000 pid=1409] 
Successfully activated service 'org.gnome.DiskUtility'

Dec 07 16:19:03 kernel: nouveau :02:00.0: DRM: base-0: timeout
Dec 07 16:19:05 kernel: nouveau :02:00.0: DRM: base-0: timeout



The kernel timeout is repeated every couple of seconds before 
eventually emitting repeated occurrences of:




Dec 07 16:22:13 kernel: [ cut here ]
Dec 07 16:22:13 kernel: WARNING: CPU: 0 PID: 2126 at 
drivers/gpu/drm/nouveau/dispnv50/disp.c:211 nv50_dmac_wait+0x1e1/0x230 
[nouveau]
Dec 07 16:22:13 kernel: Modules linked in: rfcomm snd_seq_dummy 
snd_hrtimer snd_seq snd_seq_device cmac algif_hash algif_skcipher 
af_alg bnep btusb btrtl btbcm btintel bluetooth joydev efi_pstore 
jitterentropy_rng nls_ascii wl(POE) nls_cp437 uvcvideo vfat coretemp 
nf_log_ipv6 fat drbg ip6t_REJECT snd_hda_codec_cirrus nf_reject_ipv6 
videobuf2_vmalloc kvm_intel snd_hda_codec_generic videobuf2_memops 
videobuf2_v4l2 ledtrig_audio videobuf2_common ansi_cprng videodev 
snd_hda_intel ecdh_generic snd_intel_dspcfg ecc mc kvm snd_hda_codec 
xt_hl cfg80211 snd_hda_core bcm5974 applesmc apple_mfi_fastcharge 
ip6_tables snd_hwdep snd_pcm ip6t_rt irqbypass snd_timer rfkill pcspkr 
snd efivars soundcore sg nv_tco nf_log_ipv4 acpi_als nf_log_common 
kfifo_buf ipt_REJECT nf_reject_ipv4 industrialio xt_LOG sbs sbshc evdev 
ac acpi_cpufreq apple_bl nft_limit xt_limit xt_addrtype xt_tcpudp 
xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c 
nft_compat nft_counter nf_tables nfnetlink firewire_sbp2 fuse msr
Dec 07 16:22:13 kernel: configfs efivarfs ip_tables x_tables autofs4 
ext4 crc16 mbcache jbd2 crc32c_generic ecb aes_generic libaes 
crypto_simd cryptd glue_helper xts hid_apple uas usb_storage 
hid_appleir hid_generic usbhid hid dm_crypt dm_mod sd_mod t10_pi 
crc_t10dif crct10dif_generic sr_mod cdrom crct10dif_common nouveau ahci 
libahci mxm_wmi wmi libata video i2c_algo_bit ohci_pci ttm 
drm_kms_helper cec drm ehci_pci ohci_hcd ehci_hcd firewire_ohci 
scsi_mod firewire_core crc_itu_t usbcore forcedeth usb_common 
i2c_nforce2 button
Dec 07 16:22:13 kernel: CPU: 0 PID: 2126 Comm: kworker/u4:1 Tainted: P 
W OE 5.9.0-4-amd64 #1 Debian 5.9.11-1
Dec 07 16:22:13 kernel: Hardware name: Apple Inc. 
MacBookPro5,5/Mac-F2268AC8, BIOS MBP55.88Z.00AC.B03.0906151708 06/15/09
Dec 07 16:22:13 kernel: Workqueue: events_unbound 
nv50_disp_atomic_commit_work [nouveau]

Dec 07 16:22:13 kernel: RIP: 0010:nv50_dmac_wait+0x1e1/0x230 [nouveau]
Dec 07 16:22:13 kernel: Code: 8d 48 04 48 89 4a 68 c7 00 00 00 00 20 49 
8b 46 38 41 c7 86 20 01 00 00 00 00 00 00 49 89 46 68 e8 e4 fc ff ff e9 
76 fe ff ff <0f> 0b b8 92 ff ff ff e9 ed fe ff ff 49 8b be 80 00 00 00 
e8 c7 fc

Dec 07 16:22:13 kernel: RSP: 0018:a0bec139fd60 EFLAGS: 00010282
Dec 07 16:22:13 kernel: RAX: ff92 RBX: 0002 
RCX: 
Dec 07 16:22:13 kernel: RDX: ff92 RSI: a0bec139fca0 
RDI: a0bec139fd40
Dec 07 16:22:13 kernel: RBP: a0bec139fd60 R08:  
R09: a0bec139fc70
Dec 07 16:22:13 kernel: R10: 0030 R11: a0bec00f97f8 
R12: fffb
Dec 07 16:22:13 kernel: R13: 92626097ab68 R14: 92626097aba8 
R15: 
Dec 07 16:22:13 kernel: FS: () 
GS:926377e0() knlGS:

Dec 07 16:22:13 kernel: CS: 0010 DS:  ES:  CR0: 80050033
Dec 07 16:22:13 kernel: CR2: 56430661b3c0 CR3: 00012040a000 
CR4: 000406f0

Dec 07 16:22:13 kernel: Call Trace:
Dec 07 16:22:13 kernel: base507c_update+0x30/0x70 [nouveau]
Dec 07 16:22:13 kernel: nv50_disp_atomic_commit_wndw+0x5a/0x70 [nouveau]
Dec 07 16:22:13 kernel: nv50_disp_atomic_commit_tail+0x4f9/0x7c0 
[nouveau]

Dec 07 16:22:13 kernel: ? __switch_to+0x7f/0x3f0
Dec 07 16:22:13 kernel: process_one_work+0x1b4/0x370
Dec 07 16:22:13 kernel: worker_thread+0x53/0x3e0
Dec 07 16:22:13 kernel: ? process_one_work+0x370/0x370
Dec 07 16:22:13 kernel: kthread+0x11b/0x140
Dec 07 16:22:13 kernel: ? __kthread_bind_mask+0x60/0x60
Dec 07 16:22:13 kernel: ret_from_fork+0x22/0x30
Dec 07 16:22:13 kernel: ---[ end trace b1da6d2c3f301154 ]---
Dec 07 16:22:13 gnome-shell[1537]: libinput error: client bug: timer 
event7 tap: scheduled expiry is in the past (-3566ms), your system is 
too slow

Dec 07 16:22:17 kernel: nouveau :02:00.0: DRM: base-0: timeout