Okay. On a different machine with different load, we've now got the same problem:
KERNEL: /usr/lib/debug/boot/vmlinux-4.4.0-78-generic DUMPFILE: dump.201705300948 [PARTIAL DUMP] CPUS: 4 DATE: Tue May 30 09:48:05 2017 UPTIME: 01:47:38 LOAD AVERAGE: 0.05, 0.06, 0.04 TASKS: 292 NODENAME: ossoio-docker1-tcn RELEASE: 4.4.0-78-generic VERSION: #99-Ubuntu SMP Thu Apr 27 15:29:09 UTC 2017 MACHINE: x86_64 (2199 Mhz) MEMORY: 4 GB PANIC: "BUG: unable to handle kernel paging request at ffff88013a404000" PID: 0 COMMAND: "swapper/3" TASK: ffff88013abf1980 (1 of 4) [THREAD_INFO: ffff88013a400000] CPU: 3 STATE: TASK_RUNNING (PANIC) #9 [ffff88013a403e20] async_page_fault at ffffffff81842be8 #10 [ffff88013a403e38] tick_nohz_idle_exit at ffffffff810ff75e #11 [ffff88013a403ed8] cpu_startup_entry at ffffffff810c4736 #12 [ffff88013a403f30] start_secondary at ffffffff810517c4 Differences: - this machine does not use zfs, the other one does - this machine runs docker instances, the other one mainly mysqld - this machines has x2apic enabled according to dmesg (no idea what that is) Similarities: - both are KVM guests (same KVM cluster, different nodes) - the two KVM nodes have the same hardware, same kernel and same KVM host software - 4 cpus, 2200MHz, flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx lm constant_tsc nopl xtopology pni cx16 x2apic hypervisor lahf_lm - 512MB swap Modules loaded: - both: 8250_fintek autofs4 drm drm_kms_helper fb_sys_fops floppy i2c_piix4 input_leds mac_hid parport parport_pc pata_acpi ppdev psmouse serio_raw shpchp syscopyarea sysfillrect sysimgblt ttm - this: aufs bridge br_netfilter ip6table_filter ip6_tables iptable_filter iptable_nat ip_tables ipt_MASQUERADE llc nf_conntrack nf_conntrack_ipv4 nf_conntrack_netlink nf_defrag_ipv4 nf_nat nf_nat_ipv4 nf_nat_masquerade_ipv4 nfnetlink stp veth xfrm_algo xfrm_user x_tables xt_addrtype xt_conntrack xt_nat xt_tcpudp - other: spl(O) zavl(PO) zcommon(PO) zfs(PO) znvpair(PO) zunicode(PO) We should be able to run this node on the vanilla kernel and see how that goes. Will report back in a bit. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1691741 Title: Execute NX-protected page - 4.4.0-78-generic - kernel panic Status in linux package in Ubuntu: Confirmed Bug description: After upgrading from 4.4.0-77 to 4.4.0-78 I started getting kernel panics. The crashes do not happen immediately, but have happened generally after a couple of minutes, sometimes more. After enabling linux-crashdump stuff, I managed to extract this dmesg. [ 995.103846] kernel tried to execute NX-protected page - exploit attempt? (uid: 0) [ 995.104141] BUG: unable to handle kernel paging request at ffff88042a284000 [ 995.104407] IP: [<ffff88042a284000>] 0xffff88042a284000 [ 995.104594] PGD 43f20b067 PUD 43f20e067 PMD 42a3da063 PTE 800000042a284163 [ 995.104946] Oops: 0011 [#1] SMP [ 995.105143] Modules linked in: zfs(PO) zunicode(PO) zcommon(PO) znvpair(PO) spl(O) zavl(PO) ppdev input_leds shpchp serio_raw i2c_piix4 mac_hid parport_pc parport 8250_fintek autofs4 ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm psmouse floppy pata_acpi [ 995.107081] CPU: 1 PID: 0 Comm: swapper/1 Tainted: P O 4.4.0-78-generic #99-Ubuntu [ 995.107299] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.9.3-0-ge2fc41e-prebuilt.qemu-project.org 04/01/2014 [ 995.107573] task: ffff88042a278000 ti: ffff88042a280000 task.ti: ffff88042a280000 [ 995.108070] RIP: 0010:[<ffff88042a284000>] [<ffff88042a284000>] 0xffff88042a284000 [ 995.108637] RSP: 0018:ffff88042a283ed0 EFLAGS: 00010082 [ 995.109116] RAX: 0000000000000001 RBX: 000000e797438af0 RCX: 0000000000000000 [ 995.109638] RDX: 0000000000000001 RSI: 0000000000000083 RDI: 0000000000000083 [ 995.110143] RBP: ffffffff81f38d40 R08: 000000000000000a R09: 0000000000000000 [ 995.110665] R10: 000000010002a665 R11: 0000000000004c00 R12: ffff88042a283ed0 [ 995.111182] R13: ffffffff810ff75e R14: 0000000000000000 R15: ffff88042a280000 [ 995.111733] FS: 0000000000000000(0000) GS:ffff88043fc80000(0000) knlGS:0000000000000000 [ 995.112486] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [ 995.112978] CR2: ffff88042a284000 CR3: 000000043d246000 CR4: 00000000000006e0 [ 995.113497] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 995.114085] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [ 995.114612] Stack: [ 995.114965] ffff88042a283f28 ffffffff810c4736 ffff88042a280000 ffff88042a284000 [ 995.116204] ee041b0196f77cc4 a1abbcd2b8b123ce 0000000000000000 0000000000000000 [ 995.117389] 0000000000000000 0000000000000000 0000000000000000 ffff88042a283f48 [ 995.118425] Call Trace: [ 995.118811] [<ffffffff810c4736>] ? cpu_startup_entry+0x176/0x350 [ 995.119293] [<ffffffff810517c4>] ? start_secondary+0x154/0x190 [ 995.119775] Code: ff ff ff 00 00 00 00 00 00 00 00 10 00 00 00 00 00 00 00 02 02 00 00 00 00 00 00 58 3f 28 2a 04 88 ff ff 18 00 00 00 00 00 00 00 <c0> 8c 27 2a 04 88 ff ff 00 00 00 00 00 00 00 00 02 00 00 00 00 [ 995.125554] RIP [<ffff88042a284000>] 0xffff88042a284000 [ 995.126088] RSP <ffff88042a283ed0> [ 995.126453] CR2: ffff88042a284000 I've upgraded other machines as well, and only this particular VM shows this behaviour. I have a crash dump, but I haven't looked into the contents yet. Getting the dmesg was already a pain in the behind. The VM this happens on is: - a KVM guest - x86_64, 4 cores - 16gb ram lsb_release: Distributor ID: Ubuntu Description: Ubuntu 16.04.2 LTS Release: 16.04 Codename: xenial lspci says: 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] (rev 02) 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 00:01.1 IDE interface: Intel Corporation 82371SB PIIX3 IDE [Natoma/Triton II] 00:01.2 USB controller: Intel Corporation 82371SB PIIX3 USB [Natoma/Triton II] (rev 01) 00:01.3 Bridge: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 03) 00:02.0 VGA compatible controller: VMware SVGA II Adapter 00:03.0 Unclassified device [00ff]: Red Hat, Inc Virtio memory balloon 00:0a.0 SCSI storage controller: Red Hat, Inc Virtio block device 00:0b.0 SCSI storage controller: Red Hat, Inc Virtio block device 00:12.0 Ethernet controller: Red Hat, Inc Virtio network device 00:1e.0 PCI bridge: Red Hat, Inc. QEMU PCI-PCI bridge 00:1f.0 PCI bridge: Red Hat, Inc. QEMU PCI-PCI bridge Let me know if there are other helpful details I can provide. If I find out more, I'll update this ticket. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1691741/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp