[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
I wrote: "PS: it's possible that that commit doesn't actually fix the underlying kernel crash, it just means that rr isn't triggering it any more, and that if you modified EFLAGS via the ptrace interface rather than r11 you'd get the crash back again." but looking at the kernel I think that is impossible, since the set_flags() function in arch/x86/kernel/ptrace.c sanitizes eflags if set via ptrace. The issue fixed by 29722cd4ef66 is that setting r11 in some situations would bypass the sanitizing. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Triaged Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [ 392.175012] BUG: soft lockup - CPU#1 stuck for 23s! [condvar_stress-:3380] Jan 18 12:33:33 e104462 kernel: [ 392.175014] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
I've now completed the kernel git bisect. git bisect says the commit which fixed this issue is 29722cd4ef666705b2eda1c3ba44435488e509eb ("x86/asm/entry/64: Save R11 into pt_regs->flags on SYSCALL64 fastpath"). This fits in with the discovery on the rr side that the rr commit which started causing the test suite to deadlock the kernel is https://github.com/mozilla/rr/commit/36ac389649e3e84caa46d64487813c018857f831 which is an R11-related change. rr upstream have now committed https://github.com/mozilla/rr/commit/11e67a66a2218effb9381b9ac86a2085b0f5d5d1 which is essentially working around this kernel bug. However, since "unprivileged user can deadlock the kernel" is a local-user DoS exploit I think the fix is still worth backporting. I haven't attempted to analyze the kernel change to determine why the kernel was crashing before or if you could do something more interesting than just crash it by setting r11 to something other than 0 in the userspace code. ** Changed in: linux (Ubuntu) Status: Incomplete => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Confirmed Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
PS: it's possible that that commit doesn't actually fix the underlying kernel crash, it just means that rr isn't triggering it any more, and that if you modified EFLAGS via the ptrace interface rather than r11 you'd get the crash back again. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Confirmed Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [ 392.175012] BUG: soft lockup - CPU#1 stuck for 23s! [condvar_stress-:3380] Jan 18 12:33:33 e104462 kernel: [ 392.175014] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
** Information type changed from Public to Public Security -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Triaged Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [ 392.175012] BUG: soft lockup - CPU#1 stuck for 23s! [condvar_stress-:3380] Jan 18 12:33:33 e104462 kernel: [ 392.175014] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:33 e104462 kernel: [ 392.175054] CPU: 1 PID: 3380 Comm: condvar_stress-
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
** Tags removed: needs-reverse-bisect ** Tags added: cherry-pick reverse-bisect-done ** Changed in: linux (Ubuntu) Importance: Medium => High ** Changed in: linux (Ubuntu) Status: Confirmed => Triaged -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Triaged Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [ 392.175012] BUG: soft lockup - CPU#1 stuck for 23s! [condvar_stress-:3380] Jan 18 12:33:33 e104462 kernel: [ 392.175014] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
Peter Maydell, the next step is to fully reverse commit bisect from kernel 3.13 to 4.4 in order to identify the last bad commit, followed immediately by the first good one. Once this good commit has been identified, it may be reviewed for backporting. Could you please do this following https://wiki.ubuntu.com/Kernel/KernelBisection#How_do_I_reverse_bisect_the_upstream_kernel.3F ? Please note, finding adjacent kernel versions is not fully commit bisecting. After the fix commit (not kernel version) has been identified, then please mark this report Status Confirmed. Thank you for your understanding. Helpful bug reporting tips: https://wiki.ubuntu.com/ReportingBugs ** Tags added: bios-outdated-9skt91a kernel-fixed-upstream-4.5-rc1 needs-reverse-bisect ** Changed in: linux (Ubuntu) Status: Confirmed => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Incomplete Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [ 392.175012] BUG: soft lockup - CPU#1 stuck for 23s! [condvar_stress-:3380] Jan 18 12:33:33 e104462 kernel: [ 392.175014] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
I retried with a workaround for the rr bug I described in "caveat (2)" and the kernel still does not lockup, so I am now confident that this bug is not present in the upstream kernel. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Confirmed Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [ 392.175012] BUG: soft lockup - CPU#1 stuck for 23s! [condvar_stress-:3380] Jan 18 12:33:33 e104462 kernel: [ 392.175014] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
Did this issue start happening after an update/upgrade? Was there a prior kernel version where you were not having this particular problem? Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.4 kernel[0]. If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'. If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'. Once testing of the upstream kernel is complete, please mark this bug as "Confirmed". Thanks in advance. [0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.4-wily ** Changed in: linux (Ubuntu) Importance: Undecided => Medium ** Changed in: linux (Ubuntu) Status: Confirmed => Incomplete ** Tags added: kernel-da-key -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Incomplete Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [ 392.175012] BUG: soft lockup - CPU#1 stuck for 23s! [condvar_stress-:3380] Jan 18 12:33:33 e104462 kernel: [ 392.175014] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
No, I don't think this was associated with a kernel upgrade, though it's hard to say for certain as I hadn't run the test suite in some months. I suspect it's more that rr's test suite got more complex and included some stress tests that reveal pre-existing kernel bugs. I will test the upstream kernel. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Incomplete Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 watchdog_overflow_callback+0x 9c/0xd0() Jan 18 12:33:12 e104462 kernel: [ 367.805334] Watchdog detected hard LOCKUP on cpu 2 Jan 18 12:33:12 e104462 kernel: [ 367.805335] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi Jan 18 12:33:12 e104462 kernel: [ 367.805365] CPU: 2 PID: 4294 Comm: rr:async_signal Tainted: G OX 3.13.0-74-generic #118-Ubuntu Jan 18 12:33:12 e104462 kernel: [ 367.805366] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 18 12:33:12 e104462 kernel: [ 367.805367] 0009 8804033abc78 81724b70 8804033abcc0 Jan 18 12:33:12 e104462 kernel: [ 367.805369] 8804033abcb0 810678bd 880407798000 Jan 18 12:33:12 e104462 kernel: [ 367.805370] 8804033abde0 8804033abf58 8804033abd10 Jan 18 12:33:12 e104462 kernel: [ 367.805372] Call Trace: Jan 18 12:33:12 e104462 kernel: [ 367.805376] [] dump_stack+0x45/0x56 Jan 18 12:33:12 e104462 kernel: [ 367.805379] [] warn_slowpath_common+0x7d/0xa0 Jan 18 12:33:12 e104462 kernel: [ 367.805380] [] warn_slowpath_fmt+0x4c/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805382] [] ? restart_watchdog_hrtimer+0x50/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805384] [] watchdog_overflow_callback+0x9c/0xd0 Jan 18 12:33:12 e104462 kernel: [ 367.805386] [] __perf_event_overflow+0x8e/0x250 Jan 18 12:33:12 e104462 kernel: [ 367.805389] [] ? x86_perf_event_set_period+0xe8/0x150 Jan 18 12:33:12 e104462 kernel: [ 367.805390] [] perf_event_overflow+0x14/0x20 Jan 18 12:33:12 e104462 kernel: [ 367.805392] [] intel_pmu_handle_irq+0x1ed/0x3f0 Jan 18 12:33:12 e104462 kernel: [ 367.805395] [] perf_event_nmi_handler+0x2b/0x50 Jan 18 12:33:12 e104462 kernel: [ 367.805397] [] nmi_handle.isra.2+0x88/0x180 Jan 18 12:33:12 e104462 kernel: [ 367.805399] [] do_nmi+0x1c9/0x3e0 Jan 18 12:33:12 e104462 kernel: [ 367.805400] [] nmi+0x5a/0xbf Jan 18 12:33:12 e104462 kernel: [ 367.805402] ---[ end trace 1511edbd209fafac ]--- followed a little later by Jan 18 12:33:33 e104462 kernel: [ 392.175012] BUG: soft lockup - CPU#1 stuck for 23s! [condvar_stress-:3380] Jan 18 12:33:33 e104462 kernel: [ 392.175014] Modules linked in: pci_stub vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc
[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"
I tested with linux-headers-4.4.0-040400 / linux- headers-4.4.0-040400-generic / linux-image-4.4.0-040400-generic 4.4.0-040400.201601101930. The kernel lockups did *not* reproduce. Two caveats: (1) I did get this kernel warning in the log: Jan 21 18:00:08 e104462 kernel: [ 171.577000] [ cut here ] Jan 21 18:00:08 e104462 kernel: [ 171.577005] WARNING: CPU: 6 PID: 14136 at /home/kernel/COD/linux/kernel/signal.c:2072 get_signal+0x597/0x600() Jan 21 18:00:08 e104462 kernel: [ 171.577006] Modules linked in: rfcomm bnep bluetooth nfsd auth_rpcgss nfs_acl nfs binfmt_misc lockd grace sunrpc fscache drbg ansi_cprng snd_hda_codec_hdmi intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm dm_crypt irqbypass crct10dif_pclmul crc32_pclmul aesni_intel aes_x86_64 lrw snd_hda_codec_realtek snd_hda_codec_generic gf128mul glue_helper ablk_helper snd_hda_intel cryptd snd_hda_codec snd_hda_core snd_hwdep input_leds pl2303 snd_pcm usbserial snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device snd_timer snd 8250_fintek soundcore mei_me mei shpchp ie31200_edac lpc_ich ppdev edac_core serio_raw mac_hid parport_pc lp parport hid_generic usbhid hid uas usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops e1000e drm ahci psmouse libahci ptp pps_core wmi fjes video Jan 21 18:00:08 e104462 kernel: [ 171.577037] CPU: 6 PID: 14136 Comm: block-z8iSdZiRK Not tainted 4.4.0-040400-generic #201601101930 Jan 21 18:00:08 e104462 kernel: [ 171.577038] Hardware name: LENOVO 32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013 Jan 21 18:00:08 e104462 kernel: [ 171.577039] a495bf0f 88040afdbd20 813c8d94 Jan 21 18:00:08 e104462 kernel: [ 171.577040] 88040afdbd58 8107dba2 Jan 21 18:00:08 e104462 kernel: [ 171.577042] 88040afdbe38 8000 88040afdbf58 880405d58000 Jan 21 18:00:08 e104462 kernel: [ 171.577043] Call Trace: Jan 21 18:00:08 e104462 kernel: [ 171.577046] [] dump_stack+0x44/0x60 Jan 21 18:00:08 e104462 kernel: [ 171.577049] [] warn_slowpath_common+0x82/0xc0 Jan 21 18:00:08 e104462 kernel: [ 171.577050] [] warn_slowpath_null+0x1a/0x20 Jan 21 18:00:08 e104462 kernel: [ 171.577051] [] get_signal+0x597/0x600 Jan 21 18:00:08 e104462 kernel: [ 171.577053] [] do_signal+0x37/0x6f0 Jan 21 18:00:08 e104462 kernel: [ 171.577055] [] ? schedule+0x35/0x80 Jan 21 18:00:08 e104462 kernel: [ 171.577057] [] ? do_exit+0x736/0xab0 Jan 21 18:00:08 e104462 kernel: [ 171.577058] [] ? __switch_to_xtra+0xe2/0x1b0 Jan 21 18:00:08 e104462 kernel: [ 171.577061] [] exit_to_usermode_loop+0x8c/0xd0 Jan 21 18:00:08 e104462 kernel: [ 171.577062] [] syscall_return_slowpath+0x4e/0x60 Jan 21 18:00:08 e104462 kernel: [ 171.577064] [] int_ret_from_sys_call+0x25/0x8f Jan 21 18:00:08 e104462 kernel: [ 171.577065] ---[ end trace 1c1af43c0c2249f3 ]--- but it seems to be a completely different thing from the lockup back traces seen with the 3.13 kernel. (2) rr complains: 1134: [FATAL /home/petmay01/src/rr/src/Monkeypatcher.cc:660:patch_after_exec_arch() errno: 0 'Success'] 1134: -> Failed to monkeypatch vdso: your __kernel_vsyscall() wasn't recognized. 1134: Syscall buffering is now effectively disabled. If you're OK with 1134: running rr without syscallbuf, then run the recorder passing the 1134: --no-syscall-buffer arg. 1134: If you're *not* OK with that, file an issue. which indicates that it's not happy with running under a 4.4 kernel in some way, and the code being run won't be the same as it was under the 3.13 kernel. (I filed https://github.com/mozilla/rr/issues/1627 against rr about that.) ** Tags added: kernel-fixed-upstream ** Changed in: linux (Ubuntu) Status: Incomplete => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1535307 Title: running 'rr' make check causes kernel "hard LOCKUP" Status in linux package in Ubuntu: Confirmed Bug description: Running the 'rr' make check on Ubuntu Trusty causes the machine to become unusable because the kernel crashes. (rr build-and-make-check instructions: https://github.com/mozilla/rr/wiki/Building-And- Installing) This is 3.13.0-74-generic #118-Ubuntu for x86_64. This is rr bug https://github.com/mozilla/rr/issues/1614 (which includes information about a Debian Jessie kernel crash initially and also the Ubuntu crash later on in the same report), but since crashing the kernel is a kernel bug I'm reporting that part here. kern.log reports: Jan 18 12:33:12 e104462 kernel: [ 367.805326] [ cut here ] Jan 18 12:33:12 e104462 kernel: [ 367.805333] WARNING: CPU: 2 PID: 4294 at /build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245