------- Comment From [email protected] 2018-01-24 15:10 EDT------- (In reply to comment #6) > I built a test kernel with commit 2621e945fbf1d6df5f3f0ba7be5bae3d2cf9b6a5. > The test kernel can be downloaded from: > http://kernel.ubuntu.com/~jsalisbury/lp1745104 > > Can you test this kernel and see if it resolves this bug? > > Note, to test this kernel, you need to install both the linux-image and > linux-image-extra .deb packages. > > Thanks in advance!
Thanks. It works but I see another error the "makedumpfile Failed". root@boslcp4g5:~# echo c > /proc/sysrq-trigger [ 80.255377] sysrq: SysRq : Trigger a crash [ 80.255490] Unable to handle kernel paging request for data at address 0x00000000 [ 80.255597] Faulting instruction address: 0xc00000000078f608 [ 80.255708] Oops: Kernel access of bad area, sig: 11 [#1] [ 80.255781] SMP NR_CPUS=2048 [ 80.255782] NUMA [ 80.255837] pSeries [ 80.255930] Modules linked in: rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache sctp_diag sctp libcrc32c dccp_diag dccp tcp_diag udp_diag raw_diag inet_diag unix_diag af_packet_diag netlink_diag binfmt_misc vmx_crypto crct10dif_vpmsum dm_service_time dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua sch_fq_codel sunrpc ip_tables x_tables autofs4 btrfs xor raid6_pq crc32c_vpmsum virtio_net virtio_scsi [ 80.256798] CPU: 12 PID: 1928 Comm: bash Not tainted 4.13.0-17-generic #20~lp1745104 [ 80.256907] task: c0000000053c8a00 task.stack: c0000001ec3dc000 [ 80.257010] NIP: c00000000078f608 LR: c000000000790538 CTR: c00000000078f5e0 [ 80.257116] REGS: c0000001ec3df9f0 TRAP: 0300 Not tainted (4.13.0-17-generic) [ 80.257221] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE> [ 80.257232] CR: 28422222 XER: 20040000 [ 80.257355] CFAR: c000000000790534 DAR: 0000000000000000 DSISR: 42000000 SOFTE: 1 [ 80.257355] GPR00: c000000000790538 c0000001ec3dfc70 c000000001606000 0000000000000063 [ 80.257355] GPR04: c0000001feeeade8 c0000001fef02068 6967676572206120 63726173680d0a72 [ 80.257355] GPR08: 0000000000000007 0000000000000001 0000000000000000 0000000000000000 [ 80.257355] GPR12: c00000000078f5e0 c000000007ac7e00 0000000010180df8 0000000010189e30 [ 80.257355] GPR16: 0000000010189ea8 0000000010151210 000000001018bd58 000000001018de48 [ 80.257355] GPR20: 00000000121f0248 0000000000000001 0000000010164590 0000000010163bb0 [ 80.257355] GPR24: 00007fffd06a0724 00007fffd06a0720 c00000000150a570 0000000000000002 [ 80.257355] GPR28: 0000000000000063 0000000000000004 c0000000014922f4 c00000000150a910 [ 80.258340] NIP [c00000000078f608] sysrq_handle_crash+0x28/0x30 [ 80.258433] LR [c000000000790538] __handle_sysrq+0xf8/0x2b0 [ 80.258504] Call Trace: [ 80.258546] [c0000001ec3dfc70] [c000000000790518] __handle_sysrq+0xd8/0x2b0 (unreliable) [ 80.258657] [c0000001ec3dfd10] [c000000000790d34] write_sysrq_trigger+0x64/0x90 [ 80.258789] [c0000001ec3dfd40] [c00000000044c0c8] proc_reg_write+0x88/0xd0 [ 80.258883] [c0000001ec3dfd70] [c00000000039db8c] __vfs_write+0x3c/0x70 [ 80.258975] [c0000001ec3dfd90] [c00000000039f7c8] vfs_write+0xd8/0x220 [ 80.259067] [c0000001ec3dfde0] [c0000000003a1648] SyS_write+0x68/0x110 [ 80.259159] [c0000001ec3dfe30] [c00000000000b184] system_call+0x58/0x6c [ 80.259248] Instruction dump: [ 80.259306] 4bfff9f1 4bfffe50 3c4c00e7 38426a20 7c0802a6 60000000 39200001 3d42001d [ 80.259420] 394ad788 912a0000 7c0004ac 39400000 <992a0000> 4e800020 3c4c00e7 384269f0 [ 80.259538] ---[ end trace 1a6f8bd87bc98f1a ]--- [ 80.266608] [ 80.266754] Sending IPI to other CPUs [ 80.332950] IPI complete [ 80.343090] kexec: Starting switchover sequence. I'm in purgatory -> smp_release_cpus() spinning_secondaries = 15 <- smp_release_cpus() Linux ppc64le #20~lp1745104 SM[ 0.964806] Unable to open file: /etc/keys/x509_ima.der (-2) [ 0.964813] Unable to open file: /etc/keys/x509_evm.der (-2) [ 1.129495] vio vio: uevent: failed to send synthetic uevent /dev/sde2: recovering journal /dev/sde2: clean, 87024/3932160 files, 1989241/15726336 blocks [ 2.090741] nvram: Failed to find partition of-config, err 0 [ 3.200128] vio vio: uevent: failed to send synthetic uevent [ OK ] Started Show Plymouth Boot Screen. plymouth-start.service [ OK ] Reached target Local Encrypted Volumes. [ OK ] Started Forward Password Requests to Plymouth Directory Watch. systemd-tmpfiles-setup.service [ OK ] Started Create Volatile Files and Directories. Starting Update UTMP about System Boot/Shutdown... Starting Network Time Synchronization... [ OK ] Started Update UTMP about System Boot/Shutdown. systemd-update-utmp.service [ OK ] Started Network Time Synchronization. [ OK ] Reached target System Time Synchronized. systemd-timesyncd.service apparmor.service [ OK ] Started AppArmor initialization. [ OK ] Reached target System Initialization. [ OK ] Listening on Load/Save RF Kill Switch Status /dev/rfkill Watch. [ OK ] Started Wait for Network to be Configured. systemd-networkd-wait-online.service [ OK ] Reached target Network is Online. Starting Kernel crash dump capture service... Starting RMC-Resource Monitioring and Control StartOnly... [ 8.478318] kdump-tools[868]: Starting kdump-tools: * running makedumpfile -c -d 31 /proc/vmcore /var/crash/201801240706/dump-incomplete Checking for memory holes : [100.0 %] | readmem: Can't convert a virtual address(c00a000000800000) to physical address. [ 8.524077] kdump-tools[868]: readmem: type_addr: 0, addr:c00a000000800000, size:16384 [ 8.524533] kdump-tools[868]: __exclude_unnecessary_pages: Can't read the buffer of struct page. [ 8.524978] kdump-tools[868]: create_2nd_bitmap: Can't exclude unnecessary pages. [ 8.525465] kdump-tools[868]: The kernel version is not supported. [ 8.525582] kdump-tools[868]: The makedumpfile operation may be incomplete. [ 8.525684] kdump-tools[868]: makedumpfile Failed. [ 8.526933] kdump-tools[868]: * kdump-tools: makedumpfile failed, falling back to 'cp' [ 15.367526] kdump-tools[868]: * kdump-tools: saved vmcore in /var/crash/201801240706 -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1745104 Title: ISST-LTE: KVM:UBUNTU1804: kdump is not working on UbuntuKVM guest Status in The Ubuntu-power-systems project: In Progress Status in linux package in Ubuntu: In Progress Status in linux source package in Bionic: In Progress Bug description: == Comment: #0 - Chanh H. Nguyen <[email protected]> - 2018-01-23 14:55:05 == root@boslcp4g5:~# cat /etc/os-release NAME="Ubuntu" VERSION="18.04 LTS (Bionic Beaver)" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu Bionic Beaver (development branch)" VERSION_ID="18.04" HOME_URL="https://www.ubuntu.com/" SUPPORT_URL="https://help.ubuntu.com/" BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" VERSION_CODENAME=bionic UBUNTU_CODENAME=bionic root@boslcp4g5:~# uname -r 4.13.0-25-generic root@boslcp4g5:~# kdump-config show DUMP_MODE: kdump USE_KDUMP: 1 KDUMP_SYSCTL: kernel.panic_on_oops=1 KDUMP_COREDIR: /var/crash crashkernel addr: /var/lib/kdump/vmlinuz: symbolic link to /boot/vmlinux-4.13.0-25-generic kdump initrd: /var/lib/kdump/initrd.img: symbolic link to /var/lib/kdump/initrd.img-4.13.0-25-generic current state: ready to kdump kexec command: /sbin/kexec -p --command-line="BOOT_IMAGE=/boot/vmlinux-4.13.0-25-generic root=UUID=8a776bc5-d9e0-4a1d-9218-135f9c702e11 ro splash quiet nr_cpus=1 systemd.unit=kdump-tools.service irqpoll noirqdistrib nousb" --initrd=/var/lib/kdump/initrd.img /var/lib/kdump/vmlinuz root@boslcp4g5:~# kdump-config status current state : ready to kdump root@boslcp4g5:~# echo c > /proc/sysrq-trigger [ 176.911191] sysrq: SysRq : This sysrq operation is disabled. root@boslcp4g5:~# sysctl -w kernel.sysrq=1 kernel.sysrq = 1 root@boslcp4g5:~# echo c > /proc/sysrq-trigger [ 240.304466] sysrq: SysRq : Trigger a crash [ 240.304545] Unable to handle kernel paging request for data at address 0x00000000 [ 240.304656] Faulting instruction address: 0xc000000000792f88 [ 240.304771] Oops: Kernel access of bad area, sig: 11 [#1] [ 240.304846] SMP NR_CPUS=2048 [ 240.304848] NUMA [ 240.304903] pSeries [ 240.305000] Modules linked in: sctp_diag sctp libcrc32c dccp_diag dccp tcp_diag udp_diag raw_diag inet_diag unix_diag af_packet_diag netlink_diag rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache dm_service_time vmx_crypto crct10dif_vpmsum binfmt_misc dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua sunrpc sch_fq_codel ip_tables x_tables autofs4 btrfs xor raid6_pq crc32c_vpmsum virtio_net virtio_scsi [ 240.305776] CPU: 12 PID: 1860 Comm: bash Not tainted 4.13.0-25-generic #29-Ubuntu [ 240.305886] task: c0000000ff904500 task.stack: c0000001f2d7c000 [ 240.305979] NIP: c000000000792f88 LR: c000000000793eb8 CTR: c000000000792f60 [ 240.306087] REGS: c0000001f2d7f9f0 TRAP: 0300 Not tainted (4.13.0-25-generic) [ 240.306195] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE> [ 240.306207] CR: 28422222 XER: 20040000 [ 240.306338] CFAR: c000000000793eb4 DAR: 0000000000000000 DSISR: 42000000 SOFTE: 1 [ 240.306338] GPR00: c000000000793eb8 c0000001f2d7fc70 c0000000015f6200 0000000000000063 [ 240.306338] GPR04: c0000001feeeade8 c0000001fef02068 6967676572206120 63726173680d0a72 [ 240.306338] GPR08: 0000000000000007 0000000000000001 0000000000000000 0000000000000000 [ 240.306338] GPR12: c000000000792f60 c000000007ac7e00 0000000010180df8 0000000010189e30 [ 240.306338] GPR16: 0000000010189ea8 0000000010151210 000000001018bd58 000000001018de48 [ 240.306338] GPR20: 0000000028bc0268 0000000000000001 0000000010164590 0000000010163bb0 [ 240.306338] GPR24: 00007fffdcb37d34 00007fffdcb37d30 c0000000014fa770 0000000000000002 [ 240.306338] GPR28: 0000000000000063 0000000000000007 c0000000014824f4 c0000000014fab10 [ 240.307346] NIP [c000000000792f88] sysrq_handle_crash+0x28/0x30 [ 240.307474] LR [c000000000793eb8] __handle_sysrq+0xf8/0x2b0 [ 240.307553] Call Trace: [ 240.307594] [c0000001f2d7fc70] [c000000000793e98] __handle_sysrq+0xd8/0x2b0 (unreliable) [ 240.307715] [c0000001f2d7fd10] [c0000000007946b4] write_sysrq_trigger+0x64/0x90 [ 240.307850] [c0000001f2d7fd40] [c00000000044fb28] proc_reg_write+0x88/0xd0 [ 240.307951] [c0000001f2d7fd70] [c0000000003a160c] __vfs_write+0x3c/0x70 [ 240.308049] [c0000001f2d7fd90] [c0000000003a3248] vfs_write+0xd8/0x220 [ 240.308149] [c0000001f2d7fde0] [c0000000003a50c8] SyS_write+0x68/0x110 [ 240.308248] [c0000001f2d7fe30] [c00000000000b184] system_call+0x58/0x6c [ 240.308340] Instruction dump: [ 240.308400] 4bfff9f1 4bfffe50 3c4c00e6 384232a0 7c0802a6 60000000 39200001 3d42001d [ 240.308522] 394adab0 912a0000 7c0004ac 39400000 <992a0000> 4e800020 3c4c00e6 38423270 [ 240.308644] ---[ end trace 97aaa45518689ad0 ]--- [ 240.314197] [ 240.314408] Sending IPI to other CPUs [ 240.357424] IPI complete [ 240.377038] kexec: Starting switchover sequence. <<<<<<<<<<<< it stops here..... == Comment: #2 - MAMATHA INAMDAR <[email protected]> - 2018-01-24 01:07:12 == Hi Chanh, We need following patch which will fix this issue. I think this patch is not integrated in 18.04 kernel From 2621e945fbf1d6df5f3f0ba7be5bae3d2cf9b6a5 Mon Sep 17 00:00:00 2001 From: Michael Ellerman <[email protected]> Date: Fri, 24 Nov 2017 14:51:02 +1100 Subject: [PATCH] powerpc/kexec: Fix kexec/kdump in P9 guest kernels The code that cleans up the IAMR/AMOR before kexec'ing failed to remember that when we're running as a guest AMOR is not writable, it's hypervisor privileged. They symptom is that the kexec stops before entering purgatory and nothing else is seen on the console. If you examine the state of the system all threads will be in the 0x700 program check handler. Fix it by making the write to AMOR dependent on HV mode. Fixes: 1e2a516e89fc ("powerpc/kexec: Fix radix to hash kexec due to IAMR/AMOR") Cc: [email protected] # v4.10+ Reported-by: Yilin Zhang <[email protected]> Debugged-by: David Gibson <[email protected]> Signed-off-by: Michael Ellerman <[email protected]> Acked-by: Balbir Singh <[email protected]> Reviewed-by: David Gibson <[email protected]> Tested-by: David Gibson <[email protected]> Signed-off-by: Michael Ellerman <[email protected]> diff --git a/arch/powerpc/kernel/misc_64.S b/arch/powerpc/kernel/misc_64.S index 8ac0bd2..3280953 100644 --- a/arch/powerpc/kernel/misc_64.S +++ b/arch/powerpc/kernel/misc_64.S @@ -623,7 +623,9 @@ BEGIN_FTR_SECTION * NOTE, we rely on r0 being 0 from above. */ mtspr SPRN_IAMR,r0 +BEGIN_FTR_SECTION_NESTED(42) mtspr SPRN_AMOR,r0 +END_FTR_SECTION_NESTED_IFSET(CPU_FTR_HVMODE, 42) END_FTR_SECTION_IFSET(CPU_FTR_ARCH_300) /* save regs for local vars on new stack. -- To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1745104/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : [email protected] Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp

