[Kernel-packages] [Bug 1802050] Re: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10
My OS details: $ uname -a Linux Ubuntu-PC 4.18.0-15-generic #16-Ubuntu SMP Thu Feb 7 10:56:39 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux $ lsb_release -a No LSB modules are available. Distributor ID: Ubuntu Description:Ubuntu 18.10 Release:18.10 Codename: cosmic $ lsmod | grep nvidia nvidia_uvm786432 0 nvidia_drm 40960 8 nvidia_modeset 1110016 12 nvidia_drm nvidia 14368768 696 nvidia_uvm,nvidia_modeset drm_kms_helper172032 1 nvidia_drm drm 458752 11 drm_kms_helper,nvidia_drm ipmi_msghandler 102400 2 ipmi_devintf,nvidia $ modinfo nvidia filename: /lib/modules/4.18.0-15-generic/updates/dkms/nvidia.ko alias: char-major-195-* version:390.87 supported: external license:NVIDIA srcversion: 9680830EEE0BAB392576E99 alias: pci:v10DEd0E00sv*sd*bc04sc80i00* alias: pci:v10DEd*sv*sd*bc03sc02i00* alias: pci:v10DEd*sv*sd*bc03sc00i00* depends:ipmi_msghandler retpoline: Y name: nvidia vermagic: 4.18.0-15-generic SMP mod_unload $ lspci -v | grep NVIDIA 17:00.0 VGA compatible controller: NVIDIA Corporation GP102 [GeForce GTX 1080 Ti] (rev a1) (prog-if 00 [VGA controller]) 17:00.1 Audio device: NVIDIA Corporation GP102 HDMI Audio Controller (rev a1) 18:00.0 VGA compatible controller: NVIDIA Corporation GP102 [GeForce GTX 1080 Ti] (rev a1) (prog-if 00 [VGA controller]) 18:00.1 Audio device: NVIDIA Corporation GP102 HDMI Audio Controller (rev a1) 65:00.0 VGA compatible controller: NVIDIA Corporation GP102 [GeForce GTX 1080 Ti] (rev a1) (prog-if 00 [VGA controller]) 65:00.1 Audio device: NVIDIA Corporation GP102 HDMI Audio Controller (rev a1) b4:00.0 VGA compatible controller: NVIDIA Corporation GP102 [GeForce GTX 1080 Ti] (rev a1) (prog-if 00 [VGA controller]) b4:00.1 Audio device: NVIDIA Corporation GP102 HDMI Audio Controller (rev a1) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-390 in Ubuntu. https://bugs.launchpad.net/bugs/1802050 Title: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10 Status in nvidia-graphics-drivers-390 package in Ubuntu: Confirmed Bug description: [ 56.996976] [ cut here ] [ 56.996978] Bad or missing usercopy whitelist? Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)! [ 56.996985] WARNING: CPU: 3 PID: 2547 at mm/usercopy.c:81 usercopy_warn+0x81/0xa0 [ 56.996985] Modules linked in: ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter devlink bpfilter pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) nvidia_uvm(POE) eeepc_wmi asus_wmi snd_hda_codec_hdmi mxm_wmi wmi_bmof sparse_keymap nvidia_drm(POE) nvidia_modeset(POE) snd_hda_codec_realtek intel_rapl nvidia(POE) snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_intel kvm_intel kvm snd_hda_codec joydev snd_hda_core snd_usb_audio snd_usbmidi_lib irqbypass snd_hwdep input_leds crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_pcm pcbc aesni_intel snd_seq_midi snd_seq_midi_event aes_x86_64 snd_rawmidi crypto_simd cryptd snd_seq glue_helper drm_kms_helper intel_cstate drm ipmi_devintf ipmi_msghandler intel_rapl_perf snd_seq_device snd_timer fb_sys_fops [ 56.997003] syscopyarea sysfillrect sysimgblt snd mac_hid soundcore wmi video mei_me mei acpi_pad sch_fq_codel parport_pc ppdev lp parport ip_tables x_tables autofs4 raid10 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid0 libcrc32c raid1 hid_generic usbhid hid gpio_ich r8169 i2c_i801 mii lpc_ich ahci libahci [ 56.997016] CPU: 3 PID: 2547 Comm: Xorg Tainted: P OE 4.18.0-10-generic #11-Ubuntu [ 56.997017] Hardware name: ASUS All Series/Z97-K, BIOS 2902 03/31/2016 [ 56.997018] RIP: 0010:usercopy_warn+0x81/0xa0 [ 56.997018] Code: b0 aa 41 51 4d 89 d8 48 c7 c0 89 8d af aa 49 89 f1 48 89 f9 48 0f 45 c2 48 c7 c7 f0 a1 b0 aa 4c 89 d2 48 89 c6 e8 f1 cf df ff <0f> 0b 48 83 c4 18 c9 c3 48 c7 c6 b2 8a b2 aa 49 89 f1 49 89 f3 eb [ 56.997030] RSP: 0018:b9550248bb08 EFLAGS: 00010286 [ 56.997031] RAX: RBX: 90fdc8535cb0 RCX: 0006 [ 56.997032] RDX: 0007 RSI: 0096 RDI: 90fdded964b0 [ 56.997032] RBP: b9550248bb20 R08: 0001 R09: 038f [ 56.997032] R10: 0004 R11: R12: 0003 [ 56.997033] R13: 0001 R14: 90fdc8535cb3 R15: 90fdc8535cf8 [ 56.997033] FS: 7f854792ba80() GS:90fdded8() knlGS: [ 56.997034] CS: 0010 DS: ES: CR0: 80050033 [ 56.997034] CR2: 7f8543fd2110 CR3: 000406668006 CR4: 001606e0 [
[Kernel-packages] [Bug 1802050] Re: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10
Also, worth to note, I had similar crashes before upgrading to nVidia, by using Nouveau driver. See: https://bugs.freedesktop.org/show_bug.cgi?id=100567 -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-390 in Ubuntu. https://bugs.launchpad.net/bugs/1802050 Title: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10 Status in nvidia-graphics-drivers-390 package in Ubuntu: Confirmed Bug description: [ 56.996976] [ cut here ] [ 56.996978] Bad or missing usercopy whitelist? Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)! [ 56.996985] WARNING: CPU: 3 PID: 2547 at mm/usercopy.c:81 usercopy_warn+0x81/0xa0 [ 56.996985] Modules linked in: ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter devlink bpfilter pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) nvidia_uvm(POE) eeepc_wmi asus_wmi snd_hda_codec_hdmi mxm_wmi wmi_bmof sparse_keymap nvidia_drm(POE) nvidia_modeset(POE) snd_hda_codec_realtek intel_rapl nvidia(POE) snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_intel kvm_intel kvm snd_hda_codec joydev snd_hda_core snd_usb_audio snd_usbmidi_lib irqbypass snd_hwdep input_leds crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_pcm pcbc aesni_intel snd_seq_midi snd_seq_midi_event aes_x86_64 snd_rawmidi crypto_simd cryptd snd_seq glue_helper drm_kms_helper intel_cstate drm ipmi_devintf ipmi_msghandler intel_rapl_perf snd_seq_device snd_timer fb_sys_fops [ 56.997003] syscopyarea sysfillrect sysimgblt snd mac_hid soundcore wmi video mei_me mei acpi_pad sch_fq_codel parport_pc ppdev lp parport ip_tables x_tables autofs4 raid10 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid0 libcrc32c raid1 hid_generic usbhid hid gpio_ich r8169 i2c_i801 mii lpc_ich ahci libahci [ 56.997016] CPU: 3 PID: 2547 Comm: Xorg Tainted: P OE 4.18.0-10-generic #11-Ubuntu [ 56.997017] Hardware name: ASUS All Series/Z97-K, BIOS 2902 03/31/2016 [ 56.997018] RIP: 0010:usercopy_warn+0x81/0xa0 [ 56.997018] Code: b0 aa 41 51 4d 89 d8 48 c7 c0 89 8d af aa 49 89 f1 48 89 f9 48 0f 45 c2 48 c7 c7 f0 a1 b0 aa 4c 89 d2 48 89 c6 e8 f1 cf df ff <0f> 0b 48 83 c4 18 c9 c3 48 c7 c6 b2 8a b2 aa 49 89 f1 49 89 f3 eb [ 56.997030] RSP: 0018:b9550248bb08 EFLAGS: 00010286 [ 56.997031] RAX: RBX: 90fdc8535cb0 RCX: 0006 [ 56.997032] RDX: 0007 RSI: 0096 RDI: 90fdded964b0 [ 56.997032] RBP: b9550248bb20 R08: 0001 R09: 038f [ 56.997032] R10: 0004 R11: R12: 0003 [ 56.997033] R13: 0001 R14: 90fdc8535cb3 R15: 90fdc8535cf8 [ 56.997033] FS: 7f854792ba80() GS:90fdded8() knlGS: [ 56.997034] CS: 0010 DS: ES: CR0: 80050033 [ 56.997034] CR2: 7f8543fd2110 CR3: 000406668006 CR4: 001606e0 [ 56.997035] Call Trace: [ 56.997038] __check_heap_object+0xc2/0x110 [ 56.997039] __check_object_size+0x14c/0x178 [ 56.997148] os_memcpy_to_user+0x26/0x50 [nvidia] [ 56.997254] _nv009384rm+0xbf/0xe0 [nvidia] [ 56.997255] WARNING: kernel stack frame pointer at 8fe294b8 in Xorg:2547 has bad value 5fea09aa [ 56.997256] unwind stack type:0 next_sp: (null) mask:0x2 graph_idx:0 [ 56.997257] 1bbff756: b9550248bb30 (0xb9550248bb30) [ 56.997258] 21ffd201: a9c69a82 (__check_heap_object+0xc2/0x110) [ 56.997258] 2ec89026: b9550248bb60 (0xb9550248bb60) [ 56.997259] 297b0120: a9c91abc (__check_object_size+0x14c/0x178) [ 56.997259] 8569feab: 0003 (0x3) [ 56.997260] b2feea66: 90fdc8535cb0 (0x90fdc8535cb0) [ 56.997260] a5345c20: 55924226a63e (0x55924226a63e) [ 56.997260] 798de38a: ... [ 56.997261] 871cf253: b9550248bb88 (0xb9550248bb88) [ 56.997311] 5645fcd2: c0b305b6 (os_memcpy_to_user+0x26/0x50 [nvidia]) [ 56.997311] 929e7969: 0003 (0x3) [ 56.997312] cbb9721c: 55924226a63e (0x55924226a63e) [ 56.997312] cf624d15: 90fdc8535cb0 (0x90fdc8535cb0) [ 56.997312] 8fe294b8: 90fdc8535ca8 (0x90fdc8535ca8) [ 56.997417] 7dbcdd8d: c110087f (_nv009384rm+0xbf/0xe0 [nvidia]) [ 56.997418] c0dd2ba2: 90fdc8535c88 (0x90fdc8535c88) [ 56.997418] a931d446: ... [ 56.997418] 28151017: 90fda8ed62e8 (0x90fda8ed62e8) [ 56.997419] f1c853ed: 90fdc8535db8 (0x90fdc8535db8) [ 5
[Kernel-packages] [Bug 1802050] Re: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10
Related: https://forums.geforce.com/default/topic/1098094/geforce- drivers/kernel-crash-at-_nv009384rm-os_memcpy_to_user-bad-or-missing- usercopy-whitelist-/ Here are the full logs: - 1st crash: https://gist.github.com/kenorb/4b7c333e648498f3d31c3ed8dc6625f4 - 2nd crash (posted above): https://gist.github.com/kenorb/8ff664d2cb439ad624b438a954b11c62 Logs: caller os_map_kernel_space.part.9+0xdb/0x120 [nvidia] mapping multiple BARs Bad or missing usercopy whitelist? Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)! WARNING: CPU: 11 PID: 7002 at mm/usercopy.c:81 usercopy_warn+0x81/0xa0 CPU: 11 PID: 7002 Comm: Xorg Tainted: P O 4.18.0-15-generic #16-Ubuntu Hardware name: Punch technology 646-716-44/ROG RAMPAGE VI EXTREME, BIOS 1401 05/10/2018 RIP: 0010:usercopy_warn+0x81/0xa0 RSP: 0018:b1a54aa5fb08 EFLAGS: 00010286 RAX: RBX: 936a9e355cb0 RCX: 0006 RDX: 0007 RSI: 0096 RDI: 936b1f2d64b0 RBP: b1a54aa5fb20 R08: 0001 R09: 0692 R10: 0004 R11: R12: 0003 R13: 0001 R14: 936a9e355cb3 R15: 936a9e355cf8 FS: 7f7c7ce35a80() GS:936b1f2c() knlGS: CS: 0010 DS: ES: CR0: 80050033 CR2: 7f7c794d0110 CR3: 001f8b2d0006 CR4: 003606e0 DR0: DR1: DR2: DR3: DR6: fffe0ff0 DR7: 0400 Call Trace: __check_heap_object+0xc2/0x110 __check_object_size+0x14c/0x178 os_memcpy_to_user+0x26/0x50 [nvidia] _nv009384rm+0xbf/0xe0 [nvidia] WARNING: kernel stack frame pointer at 9e6b26f6 in Xorg:7002 has bad value d8fadd4b unwind stack type:0 next_sp: (null) mask:0x2 graph_idx:0 3b11c2d2: b1a54aa5fb30 (0xb1a54aa5fb30) 764c1375: ad469e02 (__check_heap_object+0xc2/0x110) ... ? _nv028097rm+0x79/0x90 [nvidia] ? _nv028097rm+0x55/0x90 [nvidia] ? _nv013699rm+0xee/0x100 [nvidia] ? _nv015347rm+0x154/0x270 [nvidia] ? _nv008317rm+0x134/0x1a0 [nvidia] ? _nv008296rm+0x29c/0x2b0 [nvidia] ? _nv001072rm+0xe/0x20 [nvidia] ? _nv007324rm+0xd8/0x100 [nvidia] ? _nv001171rm+0x627/0x830 [nvidia] ? rm_ioctl+0x73/0x100 [nvidia] ? __check_object_size+0x91/0x178 ? nvidia_ioctl+0x56a/0x720 [nvidia] ? nvidia_frontend_unlocked_ioctl+0x3e/0x50 [nvidia] ? do_vfs_ioctl+0xa8/0x620 ? handle_mm_fault+0xe1/0x210 ? ksys_ioctl+0x67/0x90 ? __x64_sys_ioctl+0x1a/0x20 ? do_syscall_64+0x5a/0x110 ? entry_SYSCALL_64_after_hwframe+0x44/0xa9 ---[ end trace e0cf3a831124a7e9 ]--- ** Bug watch added: freedesktop.org Bugzilla #100567 https://bugs.freedesktop.org/show_bug.cgi?id=100567 -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to nvidia-graphics-drivers-390 in Ubuntu. https://bugs.launchpad.net/bugs/1802050 Title: Kernel module crash 4.18.0-10 after upgrade from 18.04 to 18.10 Status in nvidia-graphics-drivers-390 package in Ubuntu: Confirmed Bug description: [ 56.996976] [ cut here ] [ 56.996978] Bad or missing usercopy whitelist? Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)! [ 56.996985] WARNING: CPU: 3 PID: 2547 at mm/usercopy.c:81 usercopy_warn+0x81/0xa0 [ 56.996985] Modules linked in: ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter devlink bpfilter pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) nvidia_uvm(POE) eeepc_wmi asus_wmi snd_hda_codec_hdmi mxm_wmi wmi_bmof sparse_keymap nvidia_drm(POE) nvidia_modeset(POE) snd_hda_codec_realtek intel_rapl nvidia(POE) snd_hda_codec_generic x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_intel kvm_intel kvm snd_hda_codec joydev snd_hda_core snd_usb_audio snd_usbmidi_lib irqbypass snd_hwdep input_leds crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_pcm pcbc aesni_intel snd_seq_midi snd_seq_midi_event aes_x86_64 snd_rawmidi crypto_simd cryptd snd_seq glue_helper drm_kms_helper intel_cstate drm ipmi_devintf ipmi_msghandler intel_rapl_perf snd_seq_device snd_timer fb_sys_fops [ 56.997003] syscopyarea sysfillrect sysimgblt snd mac_hid soundcore wmi video mei_me mei acpi_pad sch_fq_codel parport_pc ppdev lp parport ip_tables x_tables autofs4 raid10 multipath linear raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid0 libcrc32c raid1 hid_generic usbhid hid gpio_ich r8169 i2c_i801 mii lpc_ich ahci libahci [ 56.997016] CPU: 3 PID: 2547 Comm: Xorg Tainted: P OE 4.18.0-10-generic #11-Ubuntu [ 56.997017] Hardware name: ASUS All Series/Z97-K, BIOS 2902 03/31/2016 [ 56.997018] RIP: 0010:usercopy_warn+0x81/0xa0 [ 56.997018] Code: b0 aa 41 51 4d 89 d8 48 c7 c0 89 8d af aa 49 89 f1 48 89 f9 48 0f 45 c2 48