Bug#901990: linux-image-4.16.0-2-amd64: kernel BUG at startup in usercopy.c ; impossible to boot

2018-07-04 Thread Ludovic Pouzenc
Package: nvidia-kernel-dkms
Version: 390.48-2~bpo9+3
Followup-For: Bug #901990

Dear Maintainer,

It happens to me to, with AMD Ryzen and 4.16.0-0.bpo.2-amd64.
I am fine with 4.16.0-0.bpo.1-amd64. I've rebooted on vmlinux.old to
reportbug.
I'll try driver from sid.


-- Package-specific info:
uname -a:
Linux lud-MN1 4.16.0-0.bpo.1-amd64 #1 SMP Debian 4.16.5-1~bpo9+1 (2018-05-06) 
x86_64 GNU/Linux

/proc/version:
Linux version 4.16.0-0.bpo.1-amd64 (debian-ker...@lists.debian.org) (gcc 
version 6.3.0 20170516 (Debian 6.3.0-18+deb9u1)) #1 SMP Debian 4.16.5-1~bpo9+1 
(2018-05-06)

/proc/driver/nvidia/version:
NVRM version: NVIDIA UNIX x86_64 Kernel Module  390.48  Thu Mar 22 00:42:57 PDT 
2018
GCC version:  gcc version 6.3.0 20170516 (Debian 6.3.0-18+deb9u1) 

lspci 'display controller [030?]':
07:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP106 [GeForce GTX 
1060 3GB] [10de:1c02] (rev a1) (prog-if 00 [VGA controller])
Subsystem: ASUSTeK Computer Inc. GP106 [GeForce GTX 1060 3GB] 
[1043:85b9]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- DisINTx-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- 
Kernel driver in use: nvidia
Kernel modules: nvidia

dmesg:

Device node permissions:
crw-rw+ 1 root video 226,   0 Jul  4 19:57 /dev/dri/card0
crw-rw+ 1 root video 226, 128 Jul  4 19:57 /dev/dri/renderD128
crw-rw-rw-  1 root root  195, 254 Jul  4 19:57 /dev/nvidia-modeset
crw-rw-rw-  1 root root  195,   0 Jul  4 19:57 /dev/nvidia0
crw-rw-rw-  1 root root  195, 255 Jul  4 19:57 /dev/nvidiactl
video:x:44:lpouzenc

OpenGL and NVIDIA library files installed:
lrwxrwxrwx 1 root root   15 Feb 17 12:02 /etc/alternatives/glx -> 
/usr/lib/nvidia
lrwxrwxrwx 1 root root   42 Feb 17 12:02 
/etc/alternatives/glx--libEGL.so.1-i386-linux-gnu -> 
/usr/lib/i386-linux-gnu/nvidia/libEGL.so.1
lrwxrwxrwx 1 root root   44 Feb 17 12:02 
/etc/alternatives/glx--libEGL.so.1-x86_64-linux-gnu -> 
/usr/lib/x86_64-linux-gnu/nvidia/libEGL.so.1
lrwxrwxrwx 1 root root   41 Feb 17 12:02 
/etc/alternatives/glx--libGL.so.1-i386-linux-gnu -> 
/usr/lib/i386-linux-gnu/nvidia/libGL.so.1
lrwxrwxrwx 1 root root   41 Feb 17 12:02 
/etc/alternatives/glx--libGL.so.1-i386-linux-gnu -> 
/usr/lib/i386-linux-gnu/nvidia/libGL.so.1
lrwxrwxrwx 1 root root   43 Feb 17 12:02 
/etc/alternatives/glx--libGL.so.1-x86_64-linux-gnu -> 
/usr/lib/x86_64-linux-gnu/nvidia/libGL.so.1
lrwxrwxrwx 1 root root   43 Feb 17 12:02 
/etc/alternatives/glx--libGL.so.1-x86_64-linux-gnu -> 
/usr/lib/x86_64-linux-gnu/nvidia/libGL.so.1
lrwxrwxrwx 1 root root   48 Feb 17 12:02 
/etc/alternatives/glx--libGLESv1_CM.so.1-i386-linux-gnu -> 
/usr/lib/i386-linux-gnu/nvidia/libGLESv1_CM.so.1
lrwxrwxrwx 1 root root   48 Feb 17 12:02 
/etc/alternatives/glx--libGLESv1_CM.so.1-i386-linux-gnu -> 
/usr/lib/i386-linux-gnu/nvidia/libGLESv1_CM.so.1
lrwxrwxrwx 1 root root   50 Feb 17 12:02 
/etc/alternatives/glx--libGLESv1_CM.so.1-x86_64-linux-gnu -> 
/usr/lib/x86_64-linux-gnu/nvidia/libGLESv1_CM.so.1
lrwxrwxrwx 1 root root   50 Feb 17 12:02 
/etc/alternatives/glx--libGLESv1_CM.so.1-x86_64-linux-gnu -> 
/usr/lib/x86_64-linux-gnu/nvidia/libGLESv1_CM.so.1
lrwxrwxrwx 1 root root   45 Feb 17 12:02 
/etc/alternatives/glx--libGLESv2.so.2-i386-linux-gnu -> 
/usr/lib/i386-linux-gnu/nvidia/libGLESv2.so.2
lrwxrwxrwx 1 root root   45 Feb 17 12:02 
/etc/alternatives/glx--libGLESv2.so.2-i386-linux-gnu -> 
/usr/lib/i386-linux-gnu/nvidia/libGLESv2.so.2
lrwxrwxrwx 1 root root   47 Feb 17 12:02 
/etc/alternatives/glx--libGLESv2.so.2-x86_64-linux-gnu -> 
/usr/lib/x86_64-linux-gnu/nvidia/libGLESv2.so.2
lrwxrwxrwx 1 root root   47 Feb 17 12:02 
/etc/alternatives/glx--libGLESv2.so.2-x86_64-linux-gnu -> 
/usr/lib/x86_64-linux-gnu/nvidia/libGLESv2.so.2
lrwxrwxrwx 1 root root   49 Feb 17 12:02 
/etc/alternatives/glx--libnvidia-cfg.so.1-i386-linux-gnu -> 
/usr/lib/i386-linux-gnu/nvidia/libnvidia-cfg.so.1
lrwxrwxrwx 1 root root   51 Feb 17 12:02 
/etc/alternatives/glx--libnvidia-cfg.so.1-x86_64-linux-gnu -> 
/usr/lib/x86_64-linux-gnu/nvidia/libnvidia-cfg.so.1
lrwxrwxrwx 1 root root   25 Feb 17 12:02 
/etc/alternatives/glx--linux-libglx.so -> /usr/lib/nvidia/libglx.so
lrwxrwxrwx 1 root root   42 Feb 17 12:02 
/etc/alternatives/glx--nvidia-blacklists-nouveau.conf -> 
/etc/nvidia/nvidia-blacklists-nouveau.conf
lrwxrwxrwx 1 root root   36 Feb 17 12:02 
/etc/alternatives/glx--nvidia-bug-report.sh -> 
/usr/lib/nvidia/nvidia-bug-report.sh
lrwxrwxrwx 1 root root   39 Feb 17 12:02 
/etc/alternatives/glx--nvidia-drm-outputclass.conf -> 
/etc/nvidia/nvidia-drm-outputclass.conf
lrwxrwxrwx 1 root root   28 Feb 17 12:02 
/etc/alternatives/glx--nvidia-load.conf -> /etc/nvidia/nvidia-load.conf
lrwxrwxrwx 1 root root   32 Feb 17 12:02 
/etc/alternatives/glx--nvidia-modprobe.conf -> /etc/nvidia/nvidia-modprobe.conf
lrwxrwxrwx 1 root root 

Bug#901990: linux-image-4.16.0-2-amd64: kernel BUG at startup in usercopy.c ; impossible to boot

2018-06-21 Thread Bastian Blank
Control: reassign -1 src:nvidia-graphics-drivers
Control: severity -1 important

Hi

On Thu, Jun 21, 2018 at 07:34:02AM +0200, Ara Keary wrote:
> after switching to version 4.16.16-1 of linux-image-4.16.0-2-amd64 my system 
> does not boot anymore. I obtain a kernel BUG message in syslog in usercopy.c.

Your messages show that the proprietary nvidia driver is responsible,
re-assigning.

Bastian

-- 
The best diplomat I know is a fully activated phaser bank.
-- Scotty



Bug#901990: linux-image-4.16.0-2-amd64: kernel BUG at startup in usercopy.c ; impossible to boot

2018-06-20 Thread Ara Keary
Package: src:linux
Version: 4.16.16-1
Severity: critical
Justification: breaks the whole system

Dear debian kernel maintainers

after switching to version 4.16.16-1 of linux-image-4.16.0-2-amd64 my system 
does not boot anymore. I obtain a kernel BUG message in syslog in usercopy.c.

Reverting to the current testing version of the amd64 kernel makes the system 
bootable again (and no kernel BUG message appears anymore in syslog).

Since i'm reporting the bug from the recovery startup mode, i can't attach now 
the kernel BUG message, but i'll do it soon after booting with the testing 
version of the kernel.

Best,

Ara

-- Package-specific info:
** Version:
Linux version 4.16.0-2-amd64 (debian-ker...@lists.debian.org) (gcc version 
7.3.0 (Debian 7.3.0-23)) #1 SMP Debian 4.16.16-1 (2018-06-19)

** Command line:
BOOT_IMAGE=/boot/vmlinuz-4.16.0-2-amd64 
root=UUID=745bad16-7720-4350-805d-956d712c90fd ro single

** Tainted: PO (4097)
 * Proprietary module has been loaded.
 * Out-of-tree module has been loaded.

** Kernel log:
[   13.738558] CPU3: Package temperature above threshold, cpu clock throttled 
(total events = 1)
[   13.738559] CPU2: Package temperature above threshold, cpu clock throttled 
(total events = 1)
[   13.738645] CPU5: Package temperature above threshold, cpu clock throttled 
(total events = 1)
[   13.739542] CPU5: Core temperature/speed normal
[   13.739542] CPU2: Package temperature/speed normal
[   13.739543] CPU6: Package temperature/speed normal
[   13.739543] CPU3: Package temperature/speed normal
[   13.739544] CPU7: Package temperature/speed normal
[   13.739544] CPU1: Package temperature/speed normal
[   13.739545] CPU0: Package temperature/speed normal
[   13.739546] CPU4: Core temperature/speed normal
[   13.739621] CPU5: Package temperature/speed normal
[   13.739696] CPU4: Package temperature/speed normal
[   13.749499] input: PC Speaker as /devices/platform/pcspkr/input/input12
[   13.882051] iwlwifi :3e:00.0: Detected Intel(R) Dual Band Wireless N 
7260, REV=0x144
[   13.903339] iwlwifi :3e:00.0: base HW address: a4:c4:94:3f:55:b6
[   13.964115] ACPI: AC Adapter [AC] (off-line)
[   13.964335] input: Sleep Button as 
/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0C0E:00/input/input13
[   13.964446] ACPI: Sleep Button [SLPB]
[   13.964567] input: Lid Switch as 
/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0C0D:00/input/input14
[   13.964673] ACPI: Lid Switch [LID]
[   13.964784] input: Power Button as 
/devices/LNXSYSTM:00/LNXPWRBN:00/input/input15
[   13.964828] ACPI: Battery Slot [BAT0] (battery absent)
[   13.964908] ACPI: Power Button [PWRF]
[   13.969747] ACPI: Battery Slot [BAT1] (battery absent)
[   14.141074] ieee80211 phy0: Selected rate control algorithm 'iwl-mvm-rs'
[   14.290474] hp_wmi: query 0xd returned error 0x5
[   14.290609] input: HP WMI hotkeys as /devices/virtual/input/input16
[   14.364058] snd_hda_intel :01:00.1: Disabling MSI
[   14.364141] snd_hda_intel :01:00.1: Handle vga_switcheroo audio client
[   14.461229] media: Linux media interface: v0.10
[   14.528410] Linux video capture interface: v2.00
[   14.619245] usbcore: registered new interface driver snd-usb-audio
[   14.670472] uvcvideo: Found UVC 1.00 device HP HD Webcam (05c8:0374)
[   14.671925] EXT4-fs (nvme0n1p1): mounted filesystem with ordered data mode. 
Opts: (null)
[   14.679051] uvcvideo 1-7:1.0: Entity type for entity Extension 4 was not 
initialized!
[   14.679144] uvcvideo 1-7:1.0: Entity type for entity Extension 3 was not 
initialized!
[   14.679238] uvcvideo 1-7:1.0: Entity type for entity Processing 2 was not 
initialized!
[   14.679344] uvcvideo 1-7:1.0: Entity type for entity Camera 1 was not 
initialized!
[   14.679506] input: HP HD Webcam: HP HD Webcam as 
/devices/pci:00/:00:14.0/usb1/1-7/1-7:1.0/input/input17
[   14.679652] usbcore: registered new interface driver uvcvideo
[   14.679731] USB Video Class driver (1.1.1)
[   14.690168] snd_hda_codec_realtek hdaudioC0D0: autoconfig for ALC3228: 
line_outs=1 (0x14/0x0/0x0/0x0/0x0) type:speaker
[   14.690268] snd_hda_codec_realtek hdaudioC0D0:speaker_outs=0 
(0x0/0x0/0x0/0x0/0x0)
[   14.690363] snd_hda_codec_realtek hdaudioC0D0:hp_outs=1 
(0x15/0x0/0x0/0x0/0x0)
[   14.690456] snd_hda_codec_realtek hdaudioC0D0:mono: mono_out=0x0
[   14.690536] snd_hda_codec_realtek hdaudioC0D0:inputs:
[   14.690615] snd_hda_codec_realtek hdaudioC0D0:  Mic=0x1a
[   14.690694] snd_hda_codec_realtek hdaudioC0D0:  Internal Mic=0x12
[   14.744642] iTCO_vendor_support: vendor-support=0
[   14.745111] iTCO_wdt: Intel TCO WatchDog Timer Driver v1.11
[   14.745228] iTCO_wdt: Found a Lynx Point TCO device (Version=2, 
TCOBASE=0x1860)
[   14.745400] iTCO_wdt: initialized. heartbeat=30 sec (nowayout=0)
[   14.764767] RAPL PMU: API unit is 2^-32 Joules, 4 fixed counters, 655360 ms 
ovfl timer
[   14.764870] RAPL PMU: hw unit of domain pp0-core 2^-14 Joules
[   14.764956] RAPL PMU: hw unit of domain package 2^-14 Joules
[   14.765041]