------- Comment From [email protected] 2017-09-13 13:50 EDT-------
(In reply to comment #36)
> Hi
>
> Today I tested  kdump with 16.10 on talclp3
> Access info :
> HMC: hmc-lte2.isst.aus.stglabs.ibm.com   (hscroot/abc123)
>
> Console Access: rmvterm -m talc -p talclp3;mkvterm -m talc -p talclp3;
>
> Logs:
>
> root@talclp3:~# echo c > /proc/sysrq-trigger
> [  424.180480] sysrq: SysRq : Trigger a crash
> [  424.180497] Unable to handle kernel paging request for data at address
> 0x00000000
> [  424.180500] Faulting instruction address: 0xc0000000006a2428
> [  424.180504] Oops: Kernel access of bad area, sig: 11 [#1]
> [  424.180506] SMP NR_CPUS=2048 NUMA pSeries
> [  424.180509] Modules linked in: nfsv3 nfs_acl rpcsec_gss_krb5 auth_rpcgss
> nfsv4 nfs lockd grace fscache rdma_ucm(OE) ib_ucm(OE) rdma_cm(OE) iw_cm(OE)
> configfs ib_ipoib(OE) ib_cm(OE) ib_uverbs(OE) ib_umad(OE) mlx5_ib(OE)
> mlx5_core(OE) mlx4_ib(OE) pseries_rng ib_core(OE) vmx_crypto binfmt_misc
> dm_round_robin sunrpc dm_multipath knem(OE) ip_tables x_tables autofs4 btrfs
> xor raid6_pq mlx4_en(OE) ibmvfc scsi_transport_fc ibmvscsi bnx2x
> mlx4_core(OE) devlink mlx_compat(OE)
> mdio libcrc32c be2net crc32c_vpmsum
> [  424.180541] CPU: 0 PID: 2733 Comm: bash Tainted: G           OE
> 4.8.0-59-generic #64-Ubuntu
> [  424.180545] task: c0000000b3d78600 task.stack: c0000000a2104000
> [  424.180547] NIP: c0000000006a2428 LR: c0000000006a3478 CTR:
> c0000000006a2400
> [  424.180550] REGS: c0000000a21079f0 TRAP: 0300   Tainted: G           OE
> (4.8.0-59-generic)
> [  424.180553] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 28222222
> XER: 00000001
> [  424.180560] CFAR: c000000000008750 DAR: 0000000000000000 DSISR: 42000000
> SOFTE: 1
> GPR00: c0000000006a3478 c0000000a2107c70 c000000001467500 0000000000000063
> GPR04: c0000000bd00aca0 c0000000bd01fb40 c00000017fd2e300 000000000000b240
> GPR08: 0000000000000007 0000000000000001 0000000000000000 0000000000000001
> GPR12: c0000000006a2400 c000000007b30000 0000000000000000 0000000022000000
> GPR16: 0000000010170dc8 000001000df90258 0000000010140528 00000000100c6f60
> GPR20: 0000000000000000 000000001017dd58 0000000010152bf0 000000001017b608
> GPR24: 00003ffff97be144 00003ffff97be140 c00000000137e6e0 0000000000000004
> GPR28: c00000000137eaa0 0000000000000063 c000000001332590 0000000000000000
> [  424.180599] NIP [c0000000006a2428] sysrq_handle_crash+0x28/0x30
> [  424.180602] LR [c0000000006a3478] __handle_sysrq+0xe8/0x280
> [  424.180604] Call Trace:
> [  424.180606] [c0000000a2107c70] [c0000000006a3458]
> __handle_sysrq+0xc8/0x280 (unreliable)
> [  424.180610] [c0000000a2107d10] [c0000000006a3bcc]
> write_sysrq_trigger+0x6c/0x90
> [  424.180614] [c0000000a2107d40] [c0000000003adb48] proc_reg_write+0x88/0xd0
> [  424.180619] [c0000000a2107d70] [c0000000003105ac] __vfs_write+0x3c/0x70
> [  424.180622] [c0000000a2107d90] [c000000000311814] vfs_write+0xd4/0x240
> [  424.180625] [c0000000a2107de0] [c000000000313368] SyS_write+0x68/0x110
> [  424.180629] [c0000000a2107e30] [c000000000009584] system_call+0x38/0xec
> [  424.180631] Instruction dump:
> [  424.180633] 60000000 60000000 3c4c00dc 38425100 7c0802a6 60000000
> 3d22001a 3949bc60
> [  424.180639] 39200001 912a0000 7c0004ac 39400000 <992a0000> 4e800020
> 3c4c00dc 384250d0
> [  424.180645] ---[ end trace 8fd1cd00c31ebdd4 ]---
> [  424.183431]
> [  424.183450] Sending IPI to other CPUs
> [  424.183452] IPI complete
> I'm in purgatory
>  -> smp_release_cpus()
> spinning_secondaries = 47
>  <- smp_release_cpus()
> [    0.184530] pci 002b:50:00.0: of_irq_parse_pci() failed with rc=-22
> [    0.569039] Kernel panic - not syncing: Out of memory and no killable
> processes...
> [    0.569039]
> [    0.569066] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 4.8.0-59-generic
> #64-Ubuntu
> [    0.569069] Call Trace:
> [    0.569071] [c00000000d10b220] [c000000008b0fe4c] dump_stack+0xb0/0xf0
> (unreliable)
> [    0.569075] [c00000000d10b260] [c000000008b0bf58] panic+0x144/0x308
> [    0.569078] [c00000000d10b2f0] [c000000008249c2c]
> out_of_memory+0x48c/0x570
> [    0.569082] [c00000000d10b3a0] [c000000008250ad8]
> __alloc_pages_nodemask+0xdf8/0xe20
> [    0.569086] [c00000000d10b560] [c0000000082c6da8]
> alloc_page_interleave+0x58/0xc0
> [    0.569089] [c00000000d10b5a0] [c0000000082c7678]
> alloc_pages_current+0x168/0x1d0
> [    0.569093] [c00000000d10b600] [c0000000082435e8]
> __page_cache_alloc+0x118/0x160
> [    0.569096] [c00000000d10b640] [c0000000082437b4]
> pagecache_get_page+0x184/0x3c0
> [    0.569100] [c00000000d10b6b0] [c000000008243a34]
> grab_cache_page_write_begin+0x44/0x70
> [    0.569103] [c00000000d10b6e0] [c00000000834bf6c]
> simple_write_begin+0x4c/0x1b0
> [    0.569107] [c00000000d10b730] [c000000008243264]
> generic_perform_write+0x104/0x280
> [    0.569111] [c00000000d10b7d0] [c000000008245540]
> __generic_file_write_iter+0x1e0/0x230
> [    0.569114] [c00000000d10b830] [c00000000824567c]
> generic_file_write_iter+0xec/0x250
> [    0.569118] [c00000000d10b870] [c00000000831050c]
> new_sync_write+0xec/0x150
> [    0.569121] [c00000000d10b900] [c000000008311814] vfs_write+0xd4/0x240
> [    0.569124] [c00000000d10b950] [c000000008313368] SyS_write+0x68/0x110
> [    0.569127] [c00000000d10b9a0] [c000000008ea5d0c] xwrite+0x4c/0xb0
> [    0.569130] [c00000000d10b9e0] [c000000008ea5e60] do_copy+0xf0/0x170
> [    0.569133] [c00000000d10ba10] [c000000008ea59c4] write_buffer+0x5c/0x88
> [    0.569136] [c00000000d10ba40] [c000000008ea5a50] flush_buffer+0x60/0xec
> [    0.569140] [c00000000d10ba90] [c000000008eec4c8] __gunzip+0x378/0x47c
> [    0.569142] [c00000000d10bb10] [c000000008ea650c]
> unpack_to_rootfs+0x1c8/0x338
> [    0.569146] [c00000000d10bbc0] [c000000008ea688c]
> populate_rootfs+0x94/0x17c
> [    0.569149] [c00000000d10bc40] [c00000000800b948]
> do_one_initcall+0x68/0x1d0
> [    0.569152] [c00000000d10bd00] [c000000008ea42e8]
> kernel_init_freeable+0x278/0x360
> [    0.569156] [c00000000d10bdc0] [c00000000800c1b4] kernel_init+0x24/0x170
> [    0.569159] [c00000000d10be30] [c0000000080098f0]
> ret_from_kernel_thread+0x5c/0x6c
> [    0.571060] ---[ end Kernel panic - not syncing: Out of memory and no
> killable processes...
> [    0.571060]

Memory reserved for kdump was not good enough. Please spike the memory
reserved for kdump (crashkernel=) till OOM issues are not seen anymore and
report your observations:

https://wiki.ubuntu.com/ppc64el/Recommendations#Crash_Kernel_recommendations

Thanks
Hari

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1635597

Title:
  Ubuntu16.10:talclp1: Kdump failed with multipath disk

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1635597/+subscriptions

-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to