[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-29 Thread Peter Maydell
I wrote: "PS: it's possible that that commit doesn't actually fix the 
underlying kernel crash, it just means that rr isn't triggering it any more, 
and that if you modified EFLAGS via the ptrace interface rather than r11 you'd 
get the crash back again."
but looking at the kernel I think that is impossible, since the set_flags() 
function in arch/x86/kernel/ptrace.c sanitizes eflags if set via ptrace. The 
issue fixed by 29722cd4ef66 is that setting r11 in some situations would bypass 
the sanitizing.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Triaged

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  392.175012] BUG: soft lockup - CPU#1 stuck 
for 23s! [condvar_stress-:3380]
  Jan 18 12:33:33 e104462 kernel: [  392.175014] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi 

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-28 Thread Peter Maydell
I've now completed the kernel git bisect. git bisect says the commit
which fixed this issue is 29722cd4ef666705b2eda1c3ba44435488e509eb
("x86/asm/entry/64: Save R11 into pt_regs->flags on SYSCALL64
fastpath").

This fits in with the discovery on the rr side that the rr commit which
started causing the test suite to deadlock the kernel is
https://github.com/mozilla/rr/commit/36ac389649e3e84caa46d64487813c018857f831
which is an R11-related change.

rr upstream have now committed
https://github.com/mozilla/rr/commit/11e67a66a2218effb9381b9ac86a2085b0f5d5d1
which is essentially working around this kernel bug. However, since
"unprivileged user can deadlock the kernel" is a local-user DoS exploit
I think the fix is still worth backporting.

I haven't attempted to analyze the kernel change to determine why the
kernel was crashing before or if you could do something more interesting
than just crash it by setting r11 to something other than 0 in the
userspace code.


** Changed in: linux (Ubuntu)
   Status: Incomplete => Confirmed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-28 Thread Peter Maydell
PS: it's possible that that commit doesn't actually fix the underlying
kernel crash, it just means that rr isn't triggering it any more, and
that if you modified EFLAGS via the ptrace interface rather than r11
you'd get the crash back again.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  392.175012] BUG: soft lockup - CPU#1 stuck 
for 23s! [condvar_stress-:3380]
  Jan 18 12:33:33 e104462 kernel: [  392.175014] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau 

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-28 Thread Seth Arnold
** Information type changed from Public to Public Security

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Triaged

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  392.175012] BUG: soft lockup - CPU#1 stuck 
for 23s! [condvar_stress-:3380]
  Jan 18 12:33:33 e104462 kernel: [  392.175014] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:33 e104462 kernel: [  392.175054] CPU: 1 PID: 3380 Comm: 
condvar_stress- 

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-28 Thread Christopher M. Penalver
** Tags removed: needs-reverse-bisect
** Tags added: cherry-pick reverse-bisect-done

** Changed in: linux (Ubuntu)
   Importance: Medium => High

** Changed in: linux (Ubuntu)
   Status: Confirmed => Triaged

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Triaged

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  392.175012] BUG: soft lockup - CPU#1 stuck 
for 23s! [condvar_stress-:3380]
  Jan 18 12:33:33 e104462 kernel: [  392.175014] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-25 Thread Christopher M. Penalver
Peter Maydell, the next step is to fully reverse commit bisect from
kernel 3.13 to 4.4 in order to identify the last bad commit, followed
immediately by the first good one. Once this good commit has been
identified, it may be reviewed for backporting. Could you please do this
following
https://wiki.ubuntu.com/Kernel/KernelBisection#How_do_I_reverse_bisect_the_upstream_kernel.3F
?

Please note, finding adjacent kernel versions is not fully commit
bisecting.

After the fix commit (not kernel version) has been identified, then
please mark this report Status Confirmed.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

** Tags added: bios-outdated-9skt91a kernel-fixed-upstream-4.5-rc1
needs-reverse-bisect

** Changed in: linux (Ubuntu)
   Status: Confirmed => Incomplete

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  392.175012] BUG: soft lockup - CPU#1 stuck 
for 23s! [condvar_stress-:3380]
  Jan 18 12:33:33 e104462 kernel: [  392.175014] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep 

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-22 Thread Peter Maydell
I retried with a workaround for the rr bug I described in "caveat (2)"
and the kernel still does not lockup, so I am now confident that this
bug is not present in the upstream kernel.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  392.175012] BUG: soft lockup - CPU#1 stuck 
for 23s! [condvar_stress-:3380]
  Jan 18 12:33:33 e104462 kernel: [  392.175014] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci 

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-21 Thread Joseph Salisbury
Did this issue start happening after an update/upgrade?  Was there a
prior kernel version where you were not having this particular problem?

Would it be possible for you to test the latest upstream kernel? Refer
to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest
v4.4 kernel[0].

If this bug is fixed in the mainline kernel, please add the following
tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag:
'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as
"Confirmed".


Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.4-wily

** Changed in: linux (Ubuntu)
   Importance: Undecided => Medium

** Changed in: linux (Ubuntu)
   Status: Confirmed => Incomplete

** Tags added: kernel-da-key

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  392.175012] BUG: soft lockup - CPU#1 stuck 
for 23s! [condvar_stress-:3380]
  Jan 18 12:33:33 e104462 kernel: [  392.175014] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep 

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-21 Thread Peter Maydell
No, I don't think this was associated with a kernel upgrade, though it's
hard to say for certain as I hadn't run the test suite in some months. I
suspect it's more that rr's test suite got more complex and included
some stress tests that reveal pre-existing kernel bugs.

I will test the upstream kernel.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245 
watchdog_overflow_callback+0x
  9c/0xd0()
  Jan 18 12:33:12 e104462 kernel: [  367.805334] Watchdog detected hard LOCKUP 
on cpu 2
  Jan 18 12:33:12 e104462 kernel: [  367.805335] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc ppdev lp parport 
hid_generic usbhid hid usb_storage nouveau mxm_wmi i2c_algo_bit ttm 
drm_kms_helper e1000e psmouse ahci drm ptp libahci pps_core video wmi
  Jan 18 12:33:12 e104462 kernel: [  367.805365] CPU: 2 PID: 4294 Comm: 
rr:async_signal Tainted: G   OX 3.13.0-74-generic #118-Ubuntu
  Jan 18 12:33:12 e104462 kernel: [  367.805366] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
  Jan 18 12:33:12 e104462 kernel: [  367.805367]  0009 
8804033abc78 81724b70 8804033abcc0
  Jan 18 12:33:12 e104462 kernel: [  367.805369]  8804033abcb0 
810678bd 880407798000 
  Jan 18 12:33:12 e104462 kernel: [  367.805370]  8804033abde0 
 8804033abf58 8804033abd10
  Jan 18 12:33:12 e104462 kernel: [  367.805372] Call Trace:
  Jan 18 12:33:12 e104462 kernel: [  367.805376]  [] 
dump_stack+0x45/0x56
  Jan 18 12:33:12 e104462 kernel: [  367.805379]  [] 
warn_slowpath_common+0x7d/0xa0
  Jan 18 12:33:12 e104462 kernel: [  367.805380]  [] 
warn_slowpath_fmt+0x4c/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805382]  [] ? 
restart_watchdog_hrtimer+0x50/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805384]  [] 
watchdog_overflow_callback+0x9c/0xd0
  Jan 18 12:33:12 e104462 kernel: [  367.805386]  [] 
__perf_event_overflow+0x8e/0x250
  Jan 18 12:33:12 e104462 kernel: [  367.805389]  [] ? 
x86_perf_event_set_period+0xe8/0x150
  Jan 18 12:33:12 e104462 kernel: [  367.805390]  [] 
perf_event_overflow+0x14/0x20
  Jan 18 12:33:12 e104462 kernel: [  367.805392]  [] 
intel_pmu_handle_irq+0x1ed/0x3f0
  Jan 18 12:33:12 e104462 kernel: [  367.805395]  [] 
perf_event_nmi_handler+0x2b/0x50
  Jan 18 12:33:12 e104462 kernel: [  367.805397]  [] 
nmi_handle.isra.2+0x88/0x180
  Jan 18 12:33:12 e104462 kernel: [  367.805399]  [] 
do_nmi+0x1c9/0x3e0
  Jan 18 12:33:12 e104462 kernel: [  367.805400]  [] 
nmi+0x5a/0xbf
  Jan 18 12:33:12 e104462 kernel: [  367.805402] ---[ end trace 
1511edbd209fafac ]---

  followed a little later by

  Jan 18 12:33:33 e104462 kernel: [  392.175012] BUG: soft lockup - CPU#1 stuck 
for 23s! [condvar_stress-:3380]
  Jan 18 12:33:33 e104462 kernel: [  392.175014] Modules linked in: pci_stub 
vboxpci(OX) vboxnetadp(OX) vboxnetflt(OX) vboxdrv(OX) bnep rfcomm bluetooth 
nfsd auth_rpcgss nfs_acl binfmt_misc nfs lockd sunrpc fscache dm_crypt 
snd_hda_codec_hdmi intel_rapl snd_hda_codec_realtek x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel snd_hda_intel snd_hda_codec kvm pl2303 
usbserial snd_hwdep snd_pcm crct10dif_pclmul crc32_pclmul snd_page_alloc 
snd_seq_midi snd_seq_midi_event snd_rawmidi aesni_intel aes_x86_64 lrw snd_seq 
gf128mul glue_helper ablk_helper cryptd snd_seq_device serio_raw snd_timer 
lpc_ich snd soundcore mei_me mei shpchp mac_hid parport_pc 

[Kernel-packages] [Bug 1535307] Re: running 'rr' make check causes kernel "hard LOCKUP"

2016-01-21 Thread Peter Maydell
I tested with linux-headers-4.4.0-040400 / linux-
headers-4.4.0-040400-generic / linux-image-4.4.0-040400-generic
4.4.0-040400.201601101930. The kernel lockups did *not* reproduce.

Two caveats:

(1) I did get this kernel warning in the log:
Jan 21 18:00:08 e104462 kernel: [  171.577000] [ cut here 
]
Jan 21 18:00:08 e104462 kernel: [  171.577005] WARNING: CPU: 6 PID: 14136 at 
/home/kernel/COD/linux/kernel/signal.c:2072 get_signal+0x597/0x600()
Jan 21 18:00:08 e104462 kernel: [  171.577006] Modules linked in: rfcomm bnep 
bluetooth nfsd auth_rpcgss nfs_acl nfs binfmt_misc lockd grace sunrpc fscache 
drbg ansi_cprng snd_hda_codec_hdmi intel_rapl x86_pkg_temp_thermal 
intel_powerclamp coretemp kvm_intel kvm dm_crypt irqbypass crct10dif_pclmul 
crc32_pclmul aesni_intel aes_x86_64 lrw snd_hda_codec_realtek 
snd_hda_codec_generic gf128mul glue_helper ablk_helper snd_hda_intel cryptd 
snd_hda_codec snd_hda_core snd_hwdep input_leds pl2303 snd_pcm usbserial 
snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device snd_timer 
snd 8250_fintek soundcore mei_me mei shpchp ie31200_edac lpc_ich ppdev 
edac_core serio_raw mac_hid parport_pc lp parport hid_generic usbhid hid uas 
usb_storage nouveau mxm_wmi i2c_algo_bit ttm drm_kms_helper syscopyarea 
sysfillrect sysimgblt fb_sys_fops e1000e drm ahci psmouse libahci ptp pps_core 
wmi fjes video
Jan 21 18:00:08 e104462 kernel: [  171.577037] CPU: 6 PID: 14136 Comm: 
block-z8iSdZiRK Not tainted 4.4.0-040400-generic #201601101930
Jan 21 18:00:08 e104462 kernel: [  171.577038] Hardware name: LENOVO 
32281S1/MAHOBAY, BIOS 9SKT71AUS 07/02/2013
Jan 21 18:00:08 e104462 kernel: [  171.577039]   
a495bf0f 88040afdbd20 813c8d94
Jan 21 18:00:08 e104462 kernel: [  171.577040]   
88040afdbd58 8107dba2 
Jan 21 18:00:08 e104462 kernel: [  171.577042]  88040afdbe38 
8000 88040afdbf58 880405d58000
Jan 21 18:00:08 e104462 kernel: [  171.577043] Call Trace:
Jan 21 18:00:08 e104462 kernel: [  171.577046]  [] 
dump_stack+0x44/0x60
Jan 21 18:00:08 e104462 kernel: [  171.577049]  [] 
warn_slowpath_common+0x82/0xc0
Jan 21 18:00:08 e104462 kernel: [  171.577050]  [] 
warn_slowpath_null+0x1a/0x20
Jan 21 18:00:08 e104462 kernel: [  171.577051]  [] 
get_signal+0x597/0x600
Jan 21 18:00:08 e104462 kernel: [  171.577053]  [] 
do_signal+0x37/0x6f0
Jan 21 18:00:08 e104462 kernel: [  171.577055]  [] ? 
schedule+0x35/0x80
Jan 21 18:00:08 e104462 kernel: [  171.577057]  [] ? 
do_exit+0x736/0xab0
Jan 21 18:00:08 e104462 kernel: [  171.577058]  [] ? 
__switch_to_xtra+0xe2/0x1b0
Jan 21 18:00:08 e104462 kernel: [  171.577061]  [] 
exit_to_usermode_loop+0x8c/0xd0
Jan 21 18:00:08 e104462 kernel: [  171.577062]  [] 
syscall_return_slowpath+0x4e/0x60
Jan 21 18:00:08 e104462 kernel: [  171.577064]  [] 
int_ret_from_sys_call+0x25/0x8f
Jan 21 18:00:08 e104462 kernel: [  171.577065] ---[ end trace 1c1af43c0c2249f3 
]---

but it seems to be a completely different thing from the lockup back
traces seen with the 3.13 kernel.

(2) rr complains:
1134: [FATAL 
/home/petmay01/src/rr/src/Monkeypatcher.cc:660:patch_after_exec_arch()
 errno: 0 'Success'] 
1134:  -> Failed to monkeypatch vdso: your __kernel_vsyscall() wasn't 
recognized.
1134: Syscall buffering is now effectively disabled.  If you're OK with
1134: running rr without syscallbuf, then run the recorder passing the
1134: --no-syscall-buffer arg.
1134: If you're *not* OK with that, file an issue.

which indicates that it's not happy with running under a 4.4 kernel in
some way, and the code being run won't be the same as it was under the
3.13 kernel. (I filed https://github.com/mozilla/rr/issues/1627 against
rr about that.)


** Tags added: kernel-fixed-upstream

** Changed in: linux (Ubuntu)
   Status: Incomplete => Confirmed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1535307

Title:
  running 'rr' make check causes kernel "hard LOCKUP"

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Running the 'rr' make check on Ubuntu Trusty causes the machine to
  become unusable because the kernel crashes. (rr build-and-make-check
  instructions: https://github.com/mozilla/rr/wiki/Building-And-
  Installing)

  This is 3.13.0-74-generic #118-Ubuntu for x86_64.

  This is rr bug https://github.com/mozilla/rr/issues/1614 (which
  includes information about a Debian Jessie kernel crash initially and
  also the Ubuntu crash later on in the same report), but since crashing
  the kernel is a kernel bug I'm reporting that part here.

  kern.log reports:

  Jan 18 12:33:12 e104462 kernel: [  367.805326] [ cut here 
]
  Jan 18 12:33:12 e104462 kernel: [  367.805333] WARNING: CPU: 2 PID: 4294 at 
/build/linux-_xRakU/linux-3.13.0/kernel/watchdog.c:245