[Kernel-packages] [Bug 1802050] Re: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10

2019-02-11 Thread kenorb
My OS details:
$ uname -a
Linux Ubuntu-PC 4.18.0-15-generic #16-Ubuntu SMP Thu Feb 7 10:56:39 UTC 2019 
x86_64 x86_64 x86_64 GNU/Linux

$ lsb_release -a 
No LSB modules are available.
Distributor ID: Ubuntu
Description:Ubuntu 18.10
Release:18.10
Codename:   cosmic

$ lsmod | grep nvidia
nvidia_uvm786432  0
nvidia_drm 40960  8
nvidia_modeset   1110016  12 nvidia_drm
nvidia  14368768  696 nvidia_uvm,nvidia_modeset
drm_kms_helper172032  1 nvidia_drm
drm   458752  11 drm_kms_helper,nvidia_drm
ipmi_msghandler   102400  2 ipmi_devintf,nvidia

$ modinfo nvidia
filename:   /lib/modules/4.18.0-15-generic/updates/dkms/nvidia.ko
alias:  char-major-195-*
version:390.87
supported:  external
license:NVIDIA
srcversion: 9680830EEE0BAB392576E99
alias:  pci:v10DEd0E00sv*sd*bc04sc80i00*
alias:  pci:v10DEd*sv*sd*bc03sc02i00*
alias:  pci:v10DEd*sv*sd*bc03sc00i00*
depends:ipmi_msghandler
retpoline:  Y
name:   nvidia
vermagic:   4.18.0-15-generic SMP mod_unload

$ lspci -v | grep NVIDIA
17:00.0 VGA compatible controller: NVIDIA Corporation GP102 [GeForce GTX 1080 
Ti] (rev a1) (prog-if 00 [VGA controller])
17:00.1 Audio device: NVIDIA Corporation GP102 HDMI Audio Controller (rev a1)
18:00.0 VGA compatible controller: NVIDIA Corporation GP102 [GeForce GTX 1080 
Ti] (rev a1) (prog-if 00 [VGA controller])
18:00.1 Audio device: NVIDIA Corporation GP102 HDMI Audio Controller (rev a1)
65:00.0 VGA compatible controller: NVIDIA Corporation GP102 [GeForce GTX 1080 
Ti] (rev a1) (prog-if 00 [VGA controller])
65:00.1 Audio device: NVIDIA Corporation GP102 HDMI Audio Controller (rev a1)
b4:00.0 VGA compatible controller: NVIDIA Corporation GP102 [GeForce GTX 1080 
Ti] (rev a1) (prog-if 00 [VGA controller])
b4:00.1 Audio device: NVIDIA Corporation GP102 HDMI Audio Controller (rev a1)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-390 in Ubuntu.
https://bugs.launchpad.net/bugs/1802050

Title:
  Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10

Status in nvidia-graphics-drivers-390 package in Ubuntu:
  Confirmed

Bug description:
  [   56.996976] [ cut here ]
  [   56.996978] Bad or missing usercopy whitelist? Kernel memory exposure 
attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)!
  [   56.996985] WARNING: CPU: 3 PID: 2547 at mm/usercopy.c:81 
usercopy_warn+0x81/0xa0
  [   56.996985] Modules linked in: ebtable_filter ebtables ip6table_filter 
ip6_tables iptable_filter devlink bpfilter pci_stub vboxpci(OE) vboxnetadp(OE) 
vboxnetflt(OE) vboxdrv(OE) nvidia_uvm(POE) eeepc_wmi asus_wmi 
snd_hda_codec_hdmi mxm_wmi wmi_bmof sparse_keymap nvidia_drm(POE) 
nvidia_modeset(POE) snd_hda_codec_realtek intel_rapl nvidia(POE) 
snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp 
snd_hda_intel kvm_intel kvm snd_hda_codec joydev snd_hda_core snd_usb_audio 
snd_usbmidi_lib irqbypass snd_hwdep input_leds crct10dif_pclmul crc32_pclmul 
ghash_clmulni_intel snd_pcm pcbc aesni_intel snd_seq_midi snd_seq_midi_event 
aes_x86_64 snd_rawmidi crypto_simd cryptd snd_seq glue_helper drm_kms_helper 
intel_cstate drm ipmi_devintf ipmi_msghandler intel_rapl_perf snd_seq_device 
snd_timer fb_sys_fops
  [   56.997003]  syscopyarea sysfillrect sysimgblt snd mac_hid soundcore wmi 
video mei_me mei acpi_pad sch_fq_codel parport_pc ppdev lp parport ip_tables 
x_tables autofs4 raid10 multipath linear raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor raid6_pq raid0 libcrc32c raid1 hid_generic 
usbhid hid gpio_ich r8169 i2c_i801 mii lpc_ich ahci libahci
  [   56.997016] CPU: 3 PID: 2547 Comm: Xorg Tainted: P   OE 
4.18.0-10-generic #11-Ubuntu
  [   56.997017] Hardware name: ASUS All Series/Z97-K, BIOS 2902 03/31/2016
  [   56.997018] RIP: 0010:usercopy_warn+0x81/0xa0
  [   56.997018] Code: b0 aa 41 51 4d 89 d8 48 c7 c0 89 8d af aa 49 89 f1 48 89 
f9 48 0f 45 c2 48 c7 c7 f0 a1 b0 aa 4c 89 d2 48 89 c6 e8 f1 cf df ff <0f> 0b 48 
83 c4 18 c9 c3 48 c7 c6 b2 8a b2 aa 49 89 f1 49 89 f3 eb 
  [   56.997030] RSP: 0018:b9550248bb08 EFLAGS: 00010286
  [   56.997031] RAX:  RBX: 90fdc8535cb0 RCX: 
0006
  [   56.997032] RDX: 0007 RSI: 0096 RDI: 
90fdded964b0
  [   56.997032] RBP: b9550248bb20 R08: 0001 R09: 
038f
  [   56.997032] R10: 0004 R11:  R12: 
0003
  [   56.997033] R13: 0001 R14: 90fdc8535cb3 R15: 
90fdc8535cf8
  [   56.997033] FS:  7f854792ba80() GS:90fdded8() 
knlGS:
  [   56.997034] CS:  0010 DS:  ES:  CR0: 80050033
  [   56.997034] CR2: 7f8543fd2110 CR3: 000406668006 CR4: 
001606e0
  [  

[Kernel-packages] [Bug 1802050] Re: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10

2019-02-11 Thread kenorb
Also, worth to note, I had similar crashes before upgrading to nVidia,
by using Nouveau driver. See:
https://bugs.freedesktop.org/show_bug.cgi?id=100567

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-390 in Ubuntu.
https://bugs.launchpad.net/bugs/1802050

Title:
  Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10

Status in nvidia-graphics-drivers-390 package in Ubuntu:
  Confirmed

Bug description:
  [   56.996976] [ cut here ]
  [   56.996978] Bad or missing usercopy whitelist? Kernel memory exposure 
attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)!
  [   56.996985] WARNING: CPU: 3 PID: 2547 at mm/usercopy.c:81 
usercopy_warn+0x81/0xa0
  [   56.996985] Modules linked in: ebtable_filter ebtables ip6table_filter 
ip6_tables iptable_filter devlink bpfilter pci_stub vboxpci(OE) vboxnetadp(OE) 
vboxnetflt(OE) vboxdrv(OE) nvidia_uvm(POE) eeepc_wmi asus_wmi 
snd_hda_codec_hdmi mxm_wmi wmi_bmof sparse_keymap nvidia_drm(POE) 
nvidia_modeset(POE) snd_hda_codec_realtek intel_rapl nvidia(POE) 
snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp 
snd_hda_intel kvm_intel kvm snd_hda_codec joydev snd_hda_core snd_usb_audio 
snd_usbmidi_lib irqbypass snd_hwdep input_leds crct10dif_pclmul crc32_pclmul 
ghash_clmulni_intel snd_pcm pcbc aesni_intel snd_seq_midi snd_seq_midi_event 
aes_x86_64 snd_rawmidi crypto_simd cryptd snd_seq glue_helper drm_kms_helper 
intel_cstate drm ipmi_devintf ipmi_msghandler intel_rapl_perf snd_seq_device 
snd_timer fb_sys_fops
  [   56.997003]  syscopyarea sysfillrect sysimgblt snd mac_hid soundcore wmi 
video mei_me mei acpi_pad sch_fq_codel parport_pc ppdev lp parport ip_tables 
x_tables autofs4 raid10 multipath linear raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor raid6_pq raid0 libcrc32c raid1 hid_generic 
usbhid hid gpio_ich r8169 i2c_i801 mii lpc_ich ahci libahci
  [   56.997016] CPU: 3 PID: 2547 Comm: Xorg Tainted: P   OE 
4.18.0-10-generic #11-Ubuntu
  [   56.997017] Hardware name: ASUS All Series/Z97-K, BIOS 2902 03/31/2016
  [   56.997018] RIP: 0010:usercopy_warn+0x81/0xa0
  [   56.997018] Code: b0 aa 41 51 4d 89 d8 48 c7 c0 89 8d af aa 49 89 f1 48 89 
f9 48 0f 45 c2 48 c7 c7 f0 a1 b0 aa 4c 89 d2 48 89 c6 e8 f1 cf df ff <0f> 0b 48 
83 c4 18 c9 c3 48 c7 c6 b2 8a b2 aa 49 89 f1 49 89 f3 eb 
  [   56.997030] RSP: 0018:b9550248bb08 EFLAGS: 00010286
  [   56.997031] RAX:  RBX: 90fdc8535cb0 RCX: 
0006
  [   56.997032] RDX: 0007 RSI: 0096 RDI: 
90fdded964b0
  [   56.997032] RBP: b9550248bb20 R08: 0001 R09: 
038f
  [   56.997032] R10: 0004 R11:  R12: 
0003
  [   56.997033] R13: 0001 R14: 90fdc8535cb3 R15: 
90fdc8535cf8
  [   56.997033] FS:  7f854792ba80() GS:90fdded8() 
knlGS:
  [   56.997034] CS:  0010 DS:  ES:  CR0: 80050033
  [   56.997034] CR2: 7f8543fd2110 CR3: 000406668006 CR4: 
001606e0
  [   56.997035] Call Trace:
  [   56.997038]  __check_heap_object+0xc2/0x110
  [   56.997039]  __check_object_size+0x14c/0x178
  [   56.997148]  os_memcpy_to_user+0x26/0x50 [nvidia]
  [   56.997254]  _nv009384rm+0xbf/0xe0 [nvidia]
  [   56.997255] WARNING: kernel stack frame pointer at 8fe294b8 in 
Xorg:2547 has bad value 5fea09aa
  [   56.997256] unwind stack type:0 next_sp:  (null) mask:0x2 
graph_idx:0
  [   56.997257] 1bbff756: b9550248bb30 (0xb9550248bb30)
  [   56.997258] 21ffd201: a9c69a82 
(__check_heap_object+0xc2/0x110)
  [   56.997258] 2ec89026: b9550248bb60 (0xb9550248bb60)
  [   56.997259] 297b0120: a9c91abc 
(__check_object_size+0x14c/0x178)
  [   56.997259] 8569feab: 0003 (0x3)
  [   56.997260] b2feea66: 90fdc8535cb0 (0x90fdc8535cb0)
  [   56.997260] a5345c20: 55924226a63e (0x55924226a63e)
  [   56.997260] 798de38a:  ...
  [   56.997261] 871cf253: b9550248bb88 (0xb9550248bb88)
  [   56.997311] 5645fcd2: c0b305b6 
(os_memcpy_to_user+0x26/0x50 [nvidia])
  [   56.997311] 929e7969: 0003 (0x3)
  [   56.997312] cbb9721c: 55924226a63e (0x55924226a63e)
  [   56.997312] cf624d15: 90fdc8535cb0 (0x90fdc8535cb0)
  [   56.997312] 8fe294b8: 90fdc8535ca8 (0x90fdc8535ca8)
  [   56.997417] 7dbcdd8d: c110087f (_nv009384rm+0xbf/0xe0 
[nvidia])
  [   56.997418] c0dd2ba2: 90fdc8535c88 (0x90fdc8535c88)
  [   56.997418] a931d446:  ...
  [   56.997418] 28151017: 90fda8ed62e8 (0x90fda8ed62e8)
  [   56.997419] f1c853ed: 90fdc8535db8 (0x90fdc8535db8)
  [   5

[Kernel-packages] [Bug 1802050] Re: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10

2019-02-11 Thread kenorb
Related: https://forums.geforce.com/default/topic/1098094/geforce-
drivers/kernel-crash-at-_nv009384rm-os_memcpy_to_user-bad-or-missing-
usercopy-whitelist-/

Here are the full logs:
- 1st crash: https://gist.github.com/kenorb/4b7c333e648498f3d31c3ed8dc6625f4
- 2nd crash (posted above): 
https://gist.github.com/kenorb/8ff664d2cb439ad624b438a954b11c62

Logs:
caller os_map_kernel_space.part.9+0xdb/0x120 [nvidia] mapping multiple BARs
Bad or missing usercopy whitelist? Kernel memory exposure attempt detected from 
SLUB object 'nvidia_stack_cache' (offset 11440, size 3)!
WARNING: CPU: 11 PID: 7002 at mm/usercopy.c:81 usercopy_warn+0x81/0xa0
CPU: 11 PID: 7002 Comm: Xorg Tainted: P   O  4.18.0-15-generic 
#16-Ubuntu
Hardware name: Punch technology 646-716-44/ROG RAMPAGE VI EXTREME, BIOS 1401 
05/10/2018
RIP: 0010:usercopy_warn+0x81/0xa0
RSP: 0018:b1a54aa5fb08 EFLAGS: 00010286
RAX:  RBX: 936a9e355cb0 RCX: 0006
RDX: 0007 RSI: 0096 RDI: 936b1f2d64b0
RBP: b1a54aa5fb20 R08: 0001 R09: 0692
R10: 0004 R11:  R12: 0003
R13: 0001 R14: 936a9e355cb3 R15: 936a9e355cf8
FS:  7f7c7ce35a80() GS:936b1f2c() knlGS:
CS:  0010 DS:  ES:  CR0: 80050033
CR2: 7f7c794d0110 CR3: 001f8b2d0006 CR4: 003606e0
DR0:  DR1:  DR2: 
DR3:  DR6: fffe0ff0 DR7: 0400
Call Trace:
 __check_heap_object+0xc2/0x110
 __check_object_size+0x14c/0x178
 os_memcpy_to_user+0x26/0x50 [nvidia]
 _nv009384rm+0xbf/0xe0 [nvidia]
WARNING: kernel stack frame pointer at 9e6b26f6 in Xorg:7002 has bad 
value d8fadd4b
unwind stack type:0 next_sp:  (null) mask:0x2 graph_idx:0
3b11c2d2: b1a54aa5fb30 (0xb1a54aa5fb30)
764c1375: ad469e02 (__check_heap_object+0xc2/0x110)
...
 ? _nv028097rm+0x79/0x90 [nvidia]
 ? _nv028097rm+0x55/0x90 [nvidia]
 ? _nv013699rm+0xee/0x100 [nvidia]
 ? _nv015347rm+0x154/0x270 [nvidia]
 ? _nv008317rm+0x134/0x1a0 [nvidia]
 ? _nv008296rm+0x29c/0x2b0 [nvidia]
 ? _nv001072rm+0xe/0x20 [nvidia]
 ? _nv007324rm+0xd8/0x100 [nvidia]
 ? _nv001171rm+0x627/0x830 [nvidia]
 ? rm_ioctl+0x73/0x100 [nvidia]
 ? __check_object_size+0x91/0x178
 ? nvidia_ioctl+0x56a/0x720 [nvidia]
 ? nvidia_frontend_unlocked_ioctl+0x3e/0x50 [nvidia]
 ? do_vfs_ioctl+0xa8/0x620
 ? handle_mm_fault+0xe1/0x210
 ? ksys_ioctl+0x67/0x90
 ? __x64_sys_ioctl+0x1a/0x20
 ? do_syscall_64+0x5a/0x110
 ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
---[ end trace e0cf3a831124a7e9 ]---

** Bug watch added: freedesktop.org Bugzilla #100567
   https://bugs.freedesktop.org/show_bug.cgi?id=100567

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to nvidia-graphics-drivers-390 in Ubuntu.
https://bugs.launchpad.net/bugs/1802050

Title:
  Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10

Status in nvidia-graphics-drivers-390 package in Ubuntu:
  Confirmed

Bug description:
  [   56.996976] [ cut here ]
  [   56.996978] Bad or missing usercopy whitelist? Kernel memory exposure 
attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)!
  [   56.996985] WARNING: CPU: 3 PID: 2547 at mm/usercopy.c:81 
usercopy_warn+0x81/0xa0
  [   56.996985] Modules linked in: ebtable_filter ebtables ip6table_filter 
ip6_tables iptable_filter devlink bpfilter pci_stub vboxpci(OE) vboxnetadp(OE) 
vboxnetflt(OE) vboxdrv(OE) nvidia_uvm(POE) eeepc_wmi asus_wmi 
snd_hda_codec_hdmi mxm_wmi wmi_bmof sparse_keymap nvidia_drm(POE) 
nvidia_modeset(POE) snd_hda_codec_realtek intel_rapl nvidia(POE) 
snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp 
snd_hda_intel kvm_intel kvm snd_hda_codec joydev snd_hda_core snd_usb_audio 
snd_usbmidi_lib irqbypass snd_hwdep input_leds crct10dif_pclmul crc32_pclmul 
ghash_clmulni_intel snd_pcm pcbc aesni_intel snd_seq_midi snd_seq_midi_event 
aes_x86_64 snd_rawmidi crypto_simd cryptd snd_seq glue_helper drm_kms_helper 
intel_cstate drm ipmi_devintf ipmi_msghandler intel_rapl_perf snd_seq_device 
snd_timer fb_sys_fops
  [   56.997003]  syscopyarea sysfillrect sysimgblt snd mac_hid soundcore wmi 
video mei_me mei acpi_pad sch_fq_codel parport_pc ppdev lp parport ip_tables 
x_tables autofs4 raid10 multipath linear raid456 async_raid6_recov async_memcpy 
async_pq async_xor async_tx xor raid6_pq raid0 libcrc32c raid1 hid_generic 
usbhid hid gpio_ich r8169 i2c_i801 mii lpc_ich ahci libahci
  [   56.997016] CPU: 3 PID: 2547 Comm: Xorg Tainted: P   OE 
4.18.0-10-generic #11-Ubuntu
  [   56.997017] Hardware name: ASUS All Series/Z97-K, BIOS 2902 03/31/2016
  [   56.997018] RIP: 0010:usercopy_warn+0x81/0xa0
  [   56.997018] Code: b0 aa 41 51 4d 89 d8 48 c7 c0 89 8d af aa 49 89 f1 48 89 
f9 48 0f 45 c2 48