Re: another perf_fuzzer soft lockup
* Vince Weaver wrote: > After some idle time I've started running the perf_fuzzer again. > > It quickly found this lockup on a core2 system running 3.13 (yes, I > should probably be running 3.14-git but I'd rather wait for rc1 > first). JFYI, if -git looks too dangerous to you (and I cannot blame you for that view in the middle of a merge window) then tip:master is usually a pretty safe substitute for testing, it's the v3.14 perf code on a v3.13 base. > It looks like it's stuck in a lock in generic_exec_single() while > calling perf_event_task_output()? > > I'll try to see if I can come up with a test case. Thanks, those testcases are always very useful! Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
another perf_fuzzer soft lockup
After some idle time I've started running the perf_fuzzer again. It quickly found this lockup on a core2 system running 3.13 (yes, I should probably be running 3.14-git but I'd rather wait for rc1 first). It looks like it's stuck in a lock in generic_exec_single() while calling perf_event_task_output()? I'll try to see if I can come up with a test case. Vince [347336.228001] BUG: soft lockup - CPU#1 stuck for 23s! [perf_fuzzer:13175] [347336.228001] Modules linked in: cpufreq_userspace cpufreq_stats cpufreq_powersave cpufreq_conservative f71882fg acpi_cpufreq mcs7830 usbnet evdev pcspkr video processor thermal_sys coretemp wmi psmouse serio_raw button ohci_pci ohci_hcd i2c_nforce2 sg ehci_pci ehci_hcd sd_mod usbcore usb_common [347336.228001] CPU: 1 PID: 13175 Comm: perf_fuzzer Not tainted 3.13.0 #126 [347336.228001] Hardware name: AOpen DE7000/nMCP7ALPx-DE R1.06 Oct.19.2012, BIOS 080015 10/19/2012 [347336.228001] task: 88086ac0 ti: 8800cb2ea000 task.ti: 8800cb2ea000 [347336.228001] RIP: 0010:[] [] generic_exec_single+0x7b/0x89 [347336.228001] RSP: 0018:8800cb2ebdc8 EFLAGS: 0202 [347336.228001] RAX: 08fb RBX: 0001 RCX: 88086ac0 [347336.228001] RDX: 0206 RSI: 00fb RDI: 0001 [347336.228001] RBP: R08: 0206 R09: 7fa82042e120 [347336.228001] R10: 7fa82042e100 R11: 0246 R12: 8800cb2ebdc8 [347336.228001] R13: 8800a5e00800 R14: 810b70e6 R15: 006a7380 [347336.228001] FS: 7fa820646700() GS:88011fc8() knlGS: [347336.228001] CS: 0010 DS: ES: CR0: 8005003b [347336.228001] CR2: 006139c0 CR3: c2ca2000 CR4: 000407e0 [347336.228001] DR0: DR1: DR2: [347336.228001] DR3: DR6: 0ff0 DR7: 0600 [347336.228001] Stack: [347336.228001] 88086ac0 810b744d 0001 [347336.228001] 8800cb2ebe68 0001 001e 81086b55 [347336.228001] 88011fc13440 88011fc13440 810b744d 8800cb2ebe68 [347336.228001] Call Trace: [347336.228001] [] ? perf_event_task_output+0x117/0x117 [347336.228001] [] ? smp_call_function_single+0xdc/0xf2 [347336.228001] [] ? perf_event_task_output+0x117/0x117 [347336.228001] [] ? task_function_call+0x42/0x4c [347336.228001] [] ? __perf_event_read+0xd8/0xd8 [347336.228001] [] ? perf_event_disable+0x6d/0xb3 [347336.228001] [] ? mutex_lock+0xd/0x2b [347336.228001] [] ? perf_event_refresh+0x2e/0x2e [347336.228001] [] ? perf_event_for_each_child+0x64/0x88 [347336.228001] [] ? perf_event_task_disable+0x3f/0x6c [347336.228001] [] ? SyS_prctl+0x140/0x339 [347336.228001] [] ? syscall_trace_leave+0x44/0xa4 [347336.228001] [] ? tracesys+0xd4/0xd9 [347336.228001] Code: 89 f7 4c 89 23 48 89 53 08 48 89 1a e8 6c cd 45 00 4d 39 e7 75 08 89 ef ff 15 31 77 99 00 45 85 ed 75 04 eb 08 f3 90 f6 43 20 01 <75> f8 5f 5b 5d 41 5c 41 5d 41 5e 41 5f c3 41 54 55 89 fd 53 48 -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
another perf_fuzzer soft lockup
After some idle time I've started running the perf_fuzzer again. It quickly found this lockup on a core2 system running 3.13 (yes, I should probably be running 3.14-git but I'd rather wait for rc1 first). It looks like it's stuck in a lock in generic_exec_single() while calling perf_event_task_output()? I'll try to see if I can come up with a test case. Vince [347336.228001] BUG: soft lockup - CPU#1 stuck for 23s! [perf_fuzzer:13175] [347336.228001] Modules linked in: cpufreq_userspace cpufreq_stats cpufreq_powersave cpufreq_conservative f71882fg acpi_cpufreq mcs7830 usbnet evdev pcspkr video processor thermal_sys coretemp wmi psmouse serio_raw button ohci_pci ohci_hcd i2c_nforce2 sg ehci_pci ehci_hcd sd_mod usbcore usb_common [347336.228001] CPU: 1 PID: 13175 Comm: perf_fuzzer Not tainted 3.13.0 #126 [347336.228001] Hardware name: AOpen DE7000/nMCP7ALPx-DE R1.06 Oct.19.2012, BIOS 080015 10/19/2012 [347336.228001] task: 88086ac0 ti: 8800cb2ea000 task.ti: 8800cb2ea000 [347336.228001] RIP: 0010:[810869cc] [810869cc] generic_exec_single+0x7b/0x89 [347336.228001] RSP: 0018:8800cb2ebdc8 EFLAGS: 0202 [347336.228001] RAX: 08fb RBX: 0001 RCX: 88086ac0 [347336.228001] RDX: 0206 RSI: 00fb RDI: 0001 [347336.228001] RBP: R08: 0206 R09: 7fa82042e120 [347336.228001] R10: 7fa82042e100 R11: 0246 R12: 8800cb2ebdc8 [347336.228001] R13: 8800a5e00800 R14: 810b70e6 R15: 006a7380 [347336.228001] FS: 7fa820646700() GS:88011fc8() knlGS: [347336.228001] CS: 0010 DS: ES: CR0: 8005003b [347336.228001] CR2: 006139c0 CR3: c2ca2000 CR4: 000407e0 [347336.228001] DR0: DR1: DR2: [347336.228001] DR3: DR6: 0ff0 DR7: 0600 [347336.228001] Stack: [347336.228001] 88086ac0 810b744d 0001 [347336.228001] 8800cb2ebe68 0001 001e 81086b55 [347336.228001] 88011fc13440 88011fc13440 810b744d 8800cb2ebe68 [347336.228001] Call Trace: [347336.228001] [810b744d] ? perf_event_task_output+0x117/0x117 [347336.228001] [81086b55] ? smp_call_function_single+0xdc/0xf2 [347336.228001] [810b744d] ? perf_event_task_output+0x117/0x117 [347336.228001] [810b74c8] ? task_function_call+0x42/0x4c [347336.228001] [810b55f6] ? __perf_event_read+0xd8/0xd8 [347336.228001] [810b780e] ? perf_event_disable+0x6d/0xb3 [347336.228001] [814e1d52] ? mutex_lock+0xd/0x2b [347336.228001] [810b77a1] ? perf_event_refresh+0x2e/0x2e [347336.228001] [810b5148] ? perf_event_for_each_child+0x64/0x88 [347336.228001] [810b51ab] ? perf_event_task_disable+0x3f/0x6c [347336.228001] [81048b4b] ? SyS_prctl+0x140/0x339 [347336.228001] [8100c48d] ? syscall_trace_leave+0x44/0xa4 [347336.228001] [814e9f61] ? tracesys+0xd4/0xd9 [347336.228001] Code: 89 f7 4c 89 23 48 89 53 08 48 89 1a e8 6c cd 45 00 4d 39 e7 75 08 89 ef ff 15 31 77 99 00 45 85 ed 75 04 eb 08 f3 90 f6 43 20 01 75 f8 5f 5b 5d 41 5c 41 5d 41 5e 41 5f c3 41 54 55 89 fd 53 48 -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: another perf_fuzzer soft lockup
* Vince Weaver vincent.wea...@maine.edu wrote: After some idle time I've started running the perf_fuzzer again. It quickly found this lockup on a core2 system running 3.13 (yes, I should probably be running 3.14-git but I'd rather wait for rc1 first). JFYI, if -git looks too dangerous to you (and I cannot blame you for that view in the middle of a merge window) then tip:master is usually a pretty safe substitute for testing, it's the v3.14 perf code on a v3.13 base. It looks like it's stuck in a lock in generic_exec_single() while calling perf_event_task_output()? I'll try to see if I can come up with a test case. Thanks, those testcases are always very useful! Ingo -- To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/