I found something interesting on my previous boot logs, since I had to
force shutdown.
The error is follows (from journalctl -b -1):
```
Jul 01 23:10:48 matrix-central kernel: nvidia-modeset: WARNING: GPU:0:
Lost display notification; continuing.
Jul 01 23:12:32 matrix-central kernel: nvidia-modeset: ERROR: GPU:0:
Idling EVO timed out: 0x0000957d:0:0:0x00000040
Jul 01 23:12:55 matrix-central kernel: INFO: rcu_sched self-detected
stall on CPU
Jul 01 23:12:55 matrix-central kernel: 7: (21000 ticks this GP)
idle=be3/140000000000001/0 softirq=4499/4499 fqs=7000
Jul 01 23:12:55 matrix-central kernel: (t=21000 jiffies g=12925
c=12924 q=135)
Jul 01 23:12:55 matrix-central kernel: Task dump for CPU 7:
Jul 01 23:12:55 matrix-central kernel: X R running
task 0 2997 2969 0x00400088
Jul 01 23:12:55 matrix-central kernel: ffffffff8184e340
ffff88081edc3d68 ffffffff8109892f 0000000000000007
Jul 01 23:12:55 matrix-central kernel: ffffffff8184e340
ffff88081edc3d88 ffffffff8109be6d ffff88081edc3dc8
Jul 01 23:12:55 matrix-central kernel: 0000000000000008
ffff88081edc3db8 ffffffff810c4180 ffff88081edd6a40
Jul 01 23:12:55 matrix-central kernel: Call Trace:
Jul 01 23:12:55 matrix-central kernel: <IRQ> [<ffffffff8109892f>]
sched_show_task+0xaf/0x120
Jul 01 23:12:55 matrix-central kernel: [<ffffffff8109be6d>]
dump_cpu_task+0x3d/0x50
Jul 01 23:12:55 matrix-central kernel: [<ffffffff810c4180>]
rcu_dump_cpu_stacks+0x90/0xe0
Jul 01 23:12:55 matrix-central kernel: [<ffffffff810c794e>]
rcu_check_callbacks+0x53e/0x830
Jul 01 23:12:55 matrix-central kernel: [<ffffffff8104ae5a>] ?
native_smp_send_reschedule+0x4a/0x70
Jul 01 23:12:55 matrix-central kernel: [<ffffffff810cc629>]
update_process_times+0x39/0x60
Jul 01 23:12:55 matrix-central kernel: [<ffffffff810dc396>]
tick_sched_handle.isra.15+0x36/0x50
Jul 01 23:12:55 matrix-central kernel: [<ffffffff810dc3f4>]
tick_sched_timer+0x44/0x80
Jul 01 23:12:55 matrix-central kernel: [<ffffffff810cd285>]
__run_hrtimer+0x85/0x240
Jul 01 23:12:55 matrix-central kernel: [<ffffffff810dc3b0>] ?
tick_sched_handle.isra.15+0x50/0x50
Jul 01 23:12:55 matrix-central kernel: [<ffffffff810cd7c3>]
hrtimer_interrupt+0xf3/0x210
Jul 01 23:12:55 matrix-central kernel: [<ffffffff8104d51c>]
local_apic_timer_interrupt+0x3c/0x70
Jul 01 23:12:55 matrix-central kernel: [<ffffffff81502b01>]
smp_apic_timer_interrupt+0x41/0x60
Jul 01 23:12:55 matrix-central kernel: [<ffffffff81500b7e>]
apic_timer_interrupt+0x6e/0x80
Jul 01 23:12:55 matrix-central kernel: <EOI> [<ffffffff8101dd99>] ?
read_tsc+0x9/0x10
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e90600>] ?
_nv001840kms+0xd0/0x120 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e905e2>] ?
_nv001840kms+0xb2/0x120 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e86b83>] ?
_nv001737kms+0xa3/0x110 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1ea238e>] ?
_nv001964kms+0x145e/0x2440 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa109d08d>] ?
_nv009724rm+0xd/0x40 [nvidia]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e704bd>] ?
nvkms_alloc+0x5d/0x70 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e71400>] ?
_nv000171kms+0x40/0x40 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e7142e>] ?
_nv000327kms+0x2e/0x40 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e727f1>] ?
nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e70cd5>] ?
nvkms_ioctl_common+0x45/0x90 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa1e70d82>] ?
nvkms_ioctl+0x62/0x90 [nvidia_modeset]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa0bf3080>] ?
nvidia_frontend_compat_ioctl+0x40/0x50 [nvidia]
Jul 01 23:12:55 matrix-central kernel: [<ffffffffa0bf309e>] ?
nvidia_frontend_unlocked_ioctl+0xe/0x10 [nvidia]
Jul 01 23:12:55 matrix-central kernel: [<ffffffff811d4586>] ?
do_vfs_ioctl+0x2a6/0x4b0
Jul 01 23:12:55 matrix-central kernel: [<ffffffff811c2425>] ?
__sb_end_write+0x35/0x70
Jul 01 23:12:55 matrix-central kernel: [<ffffffff8110344c>] ?
__audit_syscall_entry+0xac/0x100
Jul 01 23:12:55 matrix-central kernel: [<ffffffff81022c95>] ?
do_audit_syscall_entry+0x55/0x80
Jul 01 23:12:55 matrix-central kernel: [<ffffffff811d4811>] ?
SyS_ioctl+0x81/0xa0
Jul 01 23:12:55 matrix-central kernel: [<ffffffff8102422e>] ?
syscall_trace_leave+0x9e/0x120
Jul 01 23:12:55 matrix-central kernel: [<ffffffff814ffd2e>] ?
system_call_fastpath+0x12/0x71
Jul 01 23:13:58 matrix-central kernel: INFO: rcu_sched self-detected
stall on CPU
Jul 01 23:13:58 matrix-central kernel: 7: (84004 ticks this GP)
idle=be3/140000000000001/0 softirq=4499/4499 fqs=27997
Jul 01 23:13:58 matrix-central kernel: (t=84004 jiffies g=12925
c=12924 q=594)
Jul 01 23:13:58 matrix-central kernel: Task dump for CPU 7:
Jul 01 23:13:58 matrix-central kernel: X R running
task 0 2997 2969 0x00400088
Jul 01 23:13:58 matrix-central kernel: ffffffff8184e340
ffff88081edc3d68 ffffffff8109892f 0000000000000007
Jul 01 23:13:58 matrix-central kernel: ffffffff8184e340
ffff88081edc3d88 ffffffff8109be6d ffff88081edc3dc8
Jul 01 23:13:58 matrix-central kernel: 0000000000000008
ffff88081edc3db8 ffffffff810c4180 ffff88081edd6a40
Jul 01 23:13:58 matrix-central kernel: Call Trace:
Jul 01 23:13:58 matrix-central kernel: <IRQ> [<ffffffff8109892f>]
sched_show_task+0xaf/0x120
Jul 01 23:13:58 matrix-central kernel: [<ffffffff8109be6d>]
dump_cpu_task+0x3d/0x50
Jul 01 23:13:58 matrix-central kernel: [<ffffffff810c4180>]
rcu_dump_cpu_stacks+0x90/0xe0
Jul 01 23:13:58 matrix-central kernel: [<ffffffff810c794e>]
rcu_check_callbacks+0x53e/0x830
Jul 01 23:13:58 matrix-central kernel: [<ffffffff810cc629>]
update_process_times+0x39/0x60
Jul 01 23:13:58 matrix-central kernel: [<ffffffff810dc396>]
tick_sched_handle.isra.15+0x36/0x50
Jul 01 23:13:58 matrix-central kernel: [<ffffffff810dc3f4>]
tick_sched_timer+0x44/0x80
Jul 01 23:13:58 matrix-central kernel: [<ffffffff810cd285>]
__run_hrtimer+0x85/0x240
Jul 01 23:13:58 matrix-central kernel: [<ffffffff810dc3b0>] ?
tick_sched_handle.isra.15+0x50/0x50
Jul 01 23:13:58 matrix-central kernel: [<ffffffff810cd7c3>]
hrtimer_interrupt+0xf3/0x210
Jul 01 23:13:58 matrix-central kernel: [<ffffffff8104d51c>]
local_apic_timer_interrupt+0x3c/0x70
Jul 01 23:13:58 matrix-central kernel: [<ffffffff81502b01>]
smp_apic_timer_interrupt+0x41/0x60
Jul 01 23:13:58 matrix-central kernel: [<ffffffff81500b7e>]
apic_timer_interrupt+0x6e/0x80
Jul 01 23:13:58 matrix-central kernel: <EOI> [<ffffffff8101dd99>] ?
read_tsc+0x9/0x10
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e90600>] ?
_nv001840kms+0xd0/0x120 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e905e2>] ?
_nv001840kms+0xb2/0x120 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e86b83>] ?
_nv001737kms+0xa3/0x110 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1ea238e>] ?
_nv001964kms+0x145e/0x2440 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa109d08d>] ?
_nv009724rm+0xd/0x40 [nvidia]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e704bd>] ?
nvkms_alloc+0x5d/0x70 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e71400>] ?
_nv000171kms+0x40/0x40 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e7142e>] ?
_nv000327kms+0x2e/0x40 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e727f1>] ?
nvKmsIoctl+0x161/0x1e0 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e70cd5>] ?
nvkms_ioctl_common+0x45/0x90 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa1e70d82>] ?
nvkms_ioctl+0x62/0x90 [nvidia_modeset]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa0bf3080>] ?
nvidia_frontend_compat_ioctl+0x40/0x50 [nvidia]
Jul 01 23:13:58 matrix-central kernel: [<ffffffffa0bf309e>] ?
nvidia_frontend_unlocked_ioctl+0xe/0x10 [nvidia]
Jul 01 23:13:58 matrix-central kernel: [<ffffffff811d4586>] ?
do_vfs_ioctl+0x2a6/0x4b0
Jul 01 23:13:58 matrix-central kernel: [<ffffffff811c2425>] ?
__sb_end_write+0x35/0x70
Jul 01 23:13:58 matrix-central kernel: [<ffffffff8110344c>] ?
__audit_syscall_entry+0xac/0x100
Jul 01 23:13:58 matrix-central kernel: [<ffffffff81022c95>] ?
do_audit_syscall_entry+0x55/0x80
Jul 01 23:13:58 matrix-central kernel: [<ffffffff811d4811>] ?
SyS_ioctl+0x81/0xa0
Jul 01 23:13:58 matrix-central kernel: [<ffffffff8102422e>] ?
syscall_trace_leave+0x9e/0x120
Jul 01 23:13:58 matrix-central kernel: [<ffffffff814ffd2e>] ?
system_call_fastpath+0x12/0x71
```
Looks like something to do with stalling and CPU 7 and NVIDIA?
Thanks,
Roger
On 1/07/2016 11:15 PM, Bjørn Forsman wrote:
On 1 July 2016 at 15:10, Roger Qiu <roger....@matrix.ai> wrote:
Hi Nixers,
I discovered this problem a while ago, basically upon closing the lid and
reopening it after 10 min, the screen stays blank, and I can't do anything.
Even switching to tty1 doesn't work.
This doesn't always happen. And when I close the lid and reopen after a few
seconds, it usually works and displays something.
What are the steps that I can take to debug this issue.
It was happening back on NixOS 15.09. and is continuing on 16.03.
I'm running NVIDIA and using NVIDIA video card driver.
Did you check the system log?
- Bjørn
--
Founder of Matrix AI
https://matrix.ai/
+61420925975
_______________________________________________
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev