https://bugs.freedesktop.org/show_bug.cgi?id=91413
--- Comment #7 from [email protected] --- On kernel 4.2.6, I got a similar crash. The CPU locked up. The machine was doing some heavy computation on many processors. After removing the nouveau module, the problem went away. Dec 06 18:51:50 bullseye kernel: WARNING: CPU: 0 PID: 19974 at kernel/watchdog.c:338 watchdog_overflow_callback+0x79/0xa0() Dec 06 18:51:50 bullseye kernel: Watchdog detected hard LOCKUP on cpu 0 Dec 06 18:51:50 bullseye kernel: Modules linked in: Dec 06 18:51:50 bullseye kernel: ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 xt_conntrack ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 Dec 06 18:51:50 bullseye kernel: crc32_pclmul crc32c_intel drm serio_raw uas usb_storage hpsa ata_generic pata_acpi wmi Dec 06 18:51:50 bullseye kernel: CPU: 0 PID: 19974 Comm: redacted Not tainted 4.2.6-201.fc22.x86_64 #1 Dec 06 18:51:50 bullseye kernel: Hardware name: HP ProLiant DL360p Gen8, BIOS P71 07/01/2015 Dec 06 18:51:50 bullseye kernel: 0000000000000000 000000009a975639 ffff880fff605aa0 ffffffff817729ea Dec 06 18:51:50 bullseye kernel: 0000000000000000 ffff880fff605af8 ffff880fff605ae0 ffffffff8109e4b6 Dec 06 18:51:50 bullseye kernel: 0000000000000000 ffff881ff92ae000 0000000000000000 ffff880fff605c00 Dec 06 18:51:50 bullseye kernel: Call Trace: Dec 06 18:51:50 bullseye kernel: <NMI> [<ffffffff817729ea>] dump_stack+0x45/0x57 Dec 06 18:51:50 bullseye kernel: [<ffffffff8109e4b6>] warn_slowpath_common+0x86/0xc0 Dec 06 18:51:50 bullseye kernel: [<ffffffff8109e545>] warn_slowpath_fmt+0x55/0x70 Dec 06 18:51:50 bullseye kernel: [<ffffffff81153029>] watchdog_overflow_callback+0x79/0xa0 Dec 06 18:51:50 bullseye kernel: [<ffffffff81197f50>] __perf_event_overflow+0x90/0x1c0 Dec 06 18:51:50 bullseye kernel: [<ffffffff81198b54>] perf_event_overflow+0x14/0x20 Dec 06 18:51:50 bullseye kernel: [<ffffffff81033af7>] intel_pmu_handle_irq+0x1e7/0x470 Dec 06 18:51:50 bullseye kernel: [<ffffffff8102a3e6>] perf_event_nmi_handler+0x26/0x40 Dec 06 18:51:50 bullseye kernel: [<ffffffff810188b3>] nmi_handle+0x83/0x120 Dec 06 18:51:50 bullseye kernel: [<ffffffff81018df2>] default_do_nmi+0x42/0xf0 Dec 06 18:51:50 bullseye kernel: [<ffffffff81018f8a>] do_nmi+0xea/0x140 Dec 06 18:51:50 bullseye kernel: [<ffffffff8177b701>] end_repeat_nmi+0x1a/0x1e Dec 06 18:51:50 bullseye kernel: [<ffffffff810e677c>] ? queued_spin_lock_slowpath+0x15c/0x170 Dec 06 18:51:50 bullseye kernel: [<ffffffff810e677c>] ? queued_spin_lock_slowpath+0x15c/0x170 Dec 06 18:51:50 bullseye kernel: [<ffffffff810e677c>] ? queued_spin_lock_slowpath+0x15c/0x170 Dec 06 18:51:50 bullseye kernel: <<EOE>> <IRQ> [<ffffffff817791df>] _raw_spin_lock_irqsave+0x3f/0x50 Dec 06 18:51:50 bullseye kernel: [<ffffffffa0218cbe>] nvkm_fantog_update+0x4e/0x120 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa0218de5>] nvkm_fantog_set+0x35/0x40 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa02182cc>] nvkm_fan_update+0xec/0x1e0 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa02183f9>] nvkm_therm_fan_set+0x19/0x20 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa0217bfc>] nvkm_therm_update+0x11c/0x2d0 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa0217dca>] nvkm_therm_alarm+0x1a/0x20 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa021b472>] nv04_timer_alarm_trigger+0x122/0x170 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa021b521>] nv04_timer_alarm+0x61/0xc0 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa0218d82>] nvkm_fantog_update+0x112/0x120 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa0218daa>] nvkm_fantog_alarm+0x1a/0x20 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa021b472>] nv04_timer_alarm_trigger+0x122/0x170 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa021b5e2>] nv04_timer_intr+0x62/0x80 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffffa021211b>] nvkm_mc_intr+0xfb/0x170 [nouveau] Dec 06 18:51:50 bullseye kernel: [<ffffffff810f5ff4>] handle_irq_event_percpu+0x74/0x180 Dec 06 18:51:50 bullseye kernel: [<ffffffff810f6130>] handle_irq_event+0x30/0x60 Dec 06 18:51:50 bullseye kernel: [<ffffffff810f944f>] handle_edge_irq+0x6f/0x130 Dec 06 18:51:50 bullseye kernel: [<ffffffff81016e62>] handle_irq+0x72/0x120 Dec 06 18:51:50 bullseye kernel: [<ffffffff8177c01f>] do_IRQ+0x4f/0xe0 Dec 06 18:51:50 bullseye kernel: [<ffffffff81779f2b>] common_interrupt+0x6b/0x6b Dec 06 18:51:50 bullseye kernel: <EOI> Dec 06 18:51:50 bullseye kernel: ---[ end trace c72347df4d25d0c7 ]--- Dec 06 18:51:50 bullseye kernel: INFO: rcu_sched detected stalls on CPUs/tasks: { 0} (detected by 17, t=60002 jiffies, g=184581, c=184580, q=0) Dec 06 18:51:50 bullseye kernel: Task dump for CPU 0: Dec 06 18:51:50 bullseye kernel: redacted R running task 0 19974 3145 0x00000008 Dec 06 18:51:50 bullseye kernel: ffff8819589e7b98 ffff881ff7e69800 ffffffffffffff74 0000000000000028 Dec 06 18:51:50 bullseye kernel: 0000000000000000 0000000000000020 ffff8819589e7b98 ffffffff8164869c Dec 06 18:51:50 bullseye kernel: 0000000000000000 ffff881ff7e69800 ffff8819589e7bc8 ffffffff816c1c80 Dec 06 18:51:50 bullseye kernel: Call Trace: Dec 06 18:51:50 bullseye kernel: [<ffffffff8164869c>] ? sk_reset_timer+0x1c/0x30 Dec 06 18:51:50 bullseye kernel: [<ffffffff816c1c80>] tcp_send_delayed_ack+0x100/0x130 Dec 06 18:51:50 bullseye kernel: [<ffffffff816b36cd>] __tcp_ack_snd_check+0x6d/0x90 Dec 06 18:51:50 bullseye kernel: [<ffffffff816bb4fc>] tcp_rcv_established+0x4cc/0x780 Dec 06 18:51:50 bullseye kernel: [<ffffffff813aeac9>] ? copy_to_iter+0x79/0x260 Dec 06 18:51:50 bullseye kernel: [<ffffffff81778f2a>] ? _raw_write_unlock_bh+0x1a/0x20 Dec 06 18:51:50 bullseye kernel: [<ffffffff81778f3e>] ? _raw_spin_unlock_bh+0xe/0x10 Dec 06 18:51:50 bullseye kernel: [<ffffffff81648f66>] ? release_sock+0x106/0x150 Dec 06 18:51:50 bullseye kernel: [<ffffffff810d28af>] ? numa_migrate_preferred+0x2f/0x90 Dec 06 18:51:50 bullseye kernel: [<ffffffff810d686d>] ? task_numa_fault+0x7bd/0xae0 Dec 06 18:51:50 bullseye kernel: [<ffffffff811d5f81>] ? handle_mm_fault+0xb81/0x17d0 Dec 06 18:51:50 bullseye kernel: [<ffffffff816447cb>] ? sock_recvmsg+0x3b/0x50 Dec 06 18:51:50 bullseye kernel: [<ffffffff81644a16>] ? SYSC_recvfrom+0xd6/0x150 Dec 06 18:51:50 bullseye kernel: [<ffffffff81065454>] ? __do_page_fault+0x1b4/0x400 Dec 06 18:51:50 bullseye kernel: [<ffffffff81774d11>] ? __schedule+0x371/0x950 Dec 06 18:51:50 bullseye kernel: [<ffffffff810656cf>] ? do_page_fault+0x2f/0x80 Dec 06 18:51:50 bullseye kernel: [<ffffffff8177b378>] ? page_fault+0x28/0x30 -- You are receiving this mail because: You are the assignee for the bug.
_______________________________________________ Nouveau mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/nouveau
