I found something interesting on my previous boot logs, since I had to force shutdown.

The error is follows (from journalctl -b -1):

```

Jul 01 23:10:48 matrix-central kernel: nvidia-modeset: WARNING: GPU:0: Lost display notification; continuing. Jul 01 23:12:32 matrix-central kernel: nvidia-modeset: ERROR: GPU:0: Idling EVO timed out: 0x0000957d:0:0:0x00000040 Jul 01 23:12:55 matrix-central kernel: INFO: rcu_sched self-detected stall on CPU Jul 01 23:12:55 matrix-central kernel: 7: (21000 ticks this GP) idle=be3/140000000000001/0 softirq=4499/4499 fqs=7000 Jul 01 23:12:55 matrix-central kernel: (t=21000 jiffies g=12925 c=12924 q=135)
Jul 01 23:12:55 matrix-central kernel: Task dump for CPU 7:
Jul 01 23:12:55 matrix-central kernel: X R running task 0 2997 2969 0x00400088 Jul 01 23:12:55 matrix-central kernel: ffffffff8184e340 ffff88081edc3d68 ffffffff8109892f 0000000000000007 Jul 01 23:12:55 matrix-central kernel: ffffffff8184e340 ffff88081edc3d88 ffffffff8109be6d ffff88081edc3dc8 Jul 01 23:12:55 matrix-central kernel: 0000000000000008 ffff88081edc3db8 ffffffff810c4180 ffff88081edd6a40
Jul 01 23:12:55 matrix-central kernel: Call Trace:
Jul 01 23:12:55 matrix-central kernel: <IRQ> [<ffffffff8109892f>] sched_show_task+0xaf/0x120 Jul 01 23:12:55 matrix-central kernel: [<ffffffff8109be6d>] dump_cpu_task+0x3d/0x50 Jul 01 23:12:55 matrix-central kernel: [<ffffffff810c4180>] rcu_dump_cpu_stacks+0x90/0xe0 Jul 01 23:12:55 matrix-central kernel: [<ffffffff810c794e>] rcu_check_callbacks+0x53e/0x830 Jul 01 23:12:55 matrix-central kernel: [<ffffffff8104ae5a>] ? native_smp_send_reschedule+0x4a/0x70 Jul 01 23:12:55 matrix-central kernel: [<ffffffff810cc629>] update_process_times+0x39/0x60 Jul 01 23:12:55 matrix-central kernel: [<ffffffff810dc396>] tick_sched_handle.isra.15+0x36/0x50 Jul 01 23:12:55 matrix-central kernel: [<ffffffff810dc3f4>] tick_sched_timer+0x44/0x80 Jul 01 23:12:55 matrix-central kernel: [<ffffffff810cd285>] __run_hrtimer+0x85/0x240 Jul 01 23:12:55 matrix-central kernel: [<ffffffff810dc3b0>] ? tick_sched_handle.isra.15+0x50/0x50 Jul 01 23:12:55 matrix-central kernel: [<ffffffff810cd7c3>] hrtimer_interrupt+0xf3/0x210 Jul 01 23:12:55 matrix-central kernel: [<ffffffff8104d51c>] local_apic_timer_interrupt+0x3c/0x70 Jul 01 23:12:55 matrix-central kernel: [<ffffffff81502b01>] smp_apic_timer_interrupt+0x41/0x60 Jul 01 23:12:55 matrix-central kernel: [<ffffffff81500b7e>] apic_timer_interrupt+0x6e/0x80 Jul 01 23:12:55 matrix-central kernel: <EOI> [<ffffffff8101dd99>] ? read_tsc+0x9/0x10 Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e90600>] ? _nv001840kms+0xd0/0x120 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e905e2>] ? _nv001840kms+0xb2/0x120 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e86b83>] ? _nv001737kms+0xa3/0x110 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1ea238e>] ? _nv001964kms+0x145e/0x2440 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa109d08d>] ? _nv009724rm+0xd/0x40 [nvidia] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e704bd>] ? nvkms_alloc+0x5d/0x70 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e71400>] ? _nv000171kms+0x40/0x40 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e7142e>] ? _nv000327kms+0x2e/0x40 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e727f1>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e70cd5>] ? nvkms_ioctl_common+0x45/0x90 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e70d82>] ? nvkms_ioctl+0x62/0x90 [nvidia_modeset] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa0bf3080>] ? nvidia_frontend_compat_ioctl+0x40/0x50 [nvidia] Jul 01 23:12:55 matrix-central kernel: [<ffffffffa0bf309e>] ? nvidia_frontend_unlocked_ioctl+0xe/0x10 [nvidia] Jul 01 23:12:55 matrix-central kernel: [<ffffffff811d4586>] ? do_vfs_ioctl+0x2a6/0x4b0 Jul 01 23:12:55 matrix-central kernel: [<ffffffff811c2425>] ? __sb_end_write+0x35/0x70 Jul 01 23:12:55 matrix-central kernel: [<ffffffff8110344c>] ? __audit_syscall_entry+0xac/0x100 Jul 01 23:12:55 matrix-central kernel: [<ffffffff81022c95>] ? do_audit_syscall_entry+0x55/0x80 Jul 01 23:12:55 matrix-central kernel: [<ffffffff811d4811>] ? SyS_ioctl+0x81/0xa0 Jul 01 23:12:55 matrix-central kernel: [<ffffffff8102422e>] ? syscall_trace_leave+0x9e/0x120 Jul 01 23:12:55 matrix-central kernel: [<ffffffff814ffd2e>] ? system_call_fastpath+0x12/0x71 Jul 01 23:13:58 matrix-central kernel: INFO: rcu_sched self-detected stall on CPU Jul 01 23:13:58 matrix-central kernel: 7: (84004 ticks this GP) idle=be3/140000000000001/0 softirq=4499/4499 fqs=27997 Jul 01 23:13:58 matrix-central kernel: (t=84004 jiffies g=12925 c=12924 q=594)
Jul 01 23:13:58 matrix-central kernel: Task dump for CPU 7:
Jul 01 23:13:58 matrix-central kernel: X R running task 0 2997 2969 0x00400088 Jul 01 23:13:58 matrix-central kernel: ffffffff8184e340 ffff88081edc3d68 ffffffff8109892f 0000000000000007 Jul 01 23:13:58 matrix-central kernel: ffffffff8184e340 ffff88081edc3d88 ffffffff8109be6d ffff88081edc3dc8 Jul 01 23:13:58 matrix-central kernel: 0000000000000008 ffff88081edc3db8 ffffffff810c4180 ffff88081edd6a40
Jul 01 23:13:58 matrix-central kernel: Call Trace:
Jul 01 23:13:58 matrix-central kernel: <IRQ> [<ffffffff8109892f>] sched_show_task+0xaf/0x120 Jul 01 23:13:58 matrix-central kernel: [<ffffffff8109be6d>] dump_cpu_task+0x3d/0x50 Jul 01 23:13:58 matrix-central kernel: [<ffffffff810c4180>] rcu_dump_cpu_stacks+0x90/0xe0 Jul 01 23:13:58 matrix-central kernel: [<ffffffff810c794e>] rcu_check_callbacks+0x53e/0x830 Jul 01 23:13:58 matrix-central kernel: [<ffffffff810cc629>] update_process_times+0x39/0x60 Jul 01 23:13:58 matrix-central kernel: [<ffffffff810dc396>] tick_sched_handle.isra.15+0x36/0x50 Jul 01 23:13:58 matrix-central kernel: [<ffffffff810dc3f4>] tick_sched_timer+0x44/0x80 Jul 01 23:13:58 matrix-central kernel: [<ffffffff810cd285>] __run_hrtimer+0x85/0x240 Jul 01 23:13:58 matrix-central kernel: [<ffffffff810dc3b0>] ? tick_sched_handle.isra.15+0x50/0x50 Jul 01 23:13:58 matrix-central kernel: [<ffffffff810cd7c3>] hrtimer_interrupt+0xf3/0x210 Jul 01 23:13:58 matrix-central kernel: [<ffffffff8104d51c>] local_apic_timer_interrupt+0x3c/0x70 Jul 01 23:13:58 matrix-central kernel: [<ffffffff81502b01>] smp_apic_timer_interrupt+0x41/0x60 Jul 01 23:13:58 matrix-central kernel: [<ffffffff81500b7e>] apic_timer_interrupt+0x6e/0x80 Jul 01 23:13:58 matrix-central kernel: <EOI> [<ffffffff8101dd99>] ? read_tsc+0x9/0x10 Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e90600>] ? _nv001840kms+0xd0/0x120 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e905e2>] ? _nv001840kms+0xb2/0x120 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e86b83>] ? _nv001737kms+0xa3/0x110 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1ea238e>] ? _nv001964kms+0x145e/0x2440 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa109d08d>] ? _nv009724rm+0xd/0x40 [nvidia] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e704bd>] ? nvkms_alloc+0x5d/0x70 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e71400>] ? _nv000171kms+0x40/0x40 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e7142e>] ? _nv000327kms+0x2e/0x40 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e727f1>] ? nvKmsIoctl+0x161/0x1e0 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e70cd5>] ? nvkms_ioctl_common+0x45/0x90 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e70d82>] ? nvkms_ioctl+0x62/0x90 [nvidia_modeset] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa0bf3080>] ? nvidia_frontend_compat_ioctl+0x40/0x50 [nvidia] Jul 01 23:13:58 matrix-central kernel: [<ffffffffa0bf309e>] ? nvidia_frontend_unlocked_ioctl+0xe/0x10 [nvidia] Jul 01 23:13:58 matrix-central kernel: [<ffffffff811d4586>] ? do_vfs_ioctl+0x2a6/0x4b0 Jul 01 23:13:58 matrix-central kernel: [<ffffffff811c2425>] ? __sb_end_write+0x35/0x70 Jul 01 23:13:58 matrix-central kernel: [<ffffffff8110344c>] ? __audit_syscall_entry+0xac/0x100 Jul 01 23:13:58 matrix-central kernel: [<ffffffff81022c95>] ? do_audit_syscall_entry+0x55/0x80 Jul 01 23:13:58 matrix-central kernel: [<ffffffff811d4811>] ? SyS_ioctl+0x81/0xa0 Jul 01 23:13:58 matrix-central kernel: [<ffffffff8102422e>] ? syscall_trace_leave+0x9e/0x120 Jul 01 23:13:58 matrix-central kernel: [<ffffffff814ffd2e>] ? system_call_fastpath+0x12/0x71

```

Looks like something to do with stalling and CPU 7 and NVIDIA?

Thanks,
Roger


On 1/07/2016 11:15 PM, Bjørn Forsman wrote:
On 1 July 2016 at 15:10, Roger Qiu <roger....@matrix.ai> wrote:
Hi Nixers,

I discovered this problem a while ago, basically upon closing the lid and
reopening it after 10 min, the screen stays blank, and I can't do anything.
Even switching to tty1 doesn't work.

This doesn't always happen. And when I close the lid and reopen after a few
seconds, it usually works and displays something.

What are the steps that I can take to debug this issue.

It was happening back on NixOS 15.09. and is continuing on 16.03.

I'm running NVIDIA and using NVIDIA video card driver.
Did you check the system log?

- Bjørn

--
Founder of Matrix AI
https://matrix.ai/
+61420925975

_______________________________________________
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev

Reply via email to