[Kernel-packages] [Bug 1398596] Re: [Revert] arm64: optimized copy_to_user and copy_from_user assembly code
** Tags added: hs-arm64 -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1398596 Title: [Revert] arm64: optimized copy_to_user and copy_from_user assembly code Status in linux package in Ubuntu: Triaged Status in linux source package in Trusty: Triaged Status in linux source package in Utopic: Triaged Bug description: [Impact] Fixes an issue where the kernel hits unhandled pagefaults when copy_to_user is called. http://www.spinics.net/lists/arm-kernel/msg381811.html [Test Case] Put the system under memory pressure. [Regression Potential] It will cause a performance regression on certain workloads on arm64 systems. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1398596/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1400349] Re: [SRU] new arm64 optimized copy_to_user and copy_from_user assembly code
** Tags added: hs-arm64 -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1400349 Title: [SRU] new arm64 optimized copy_to_user and copy_from_user assembly code Status in linux package in Ubuntu: New Bug description: [Impact] The copy_{to,from}_user implementations in the Ubuntu kernel are suboptimal. Optimized implementations have been submitted upstream and have shown a significant improvement in network performance. [Test Case] Generate traffic from one node to another using iperf. [Regression Risk] These functions are obviously used heavily throughout the kernel, so a defect here could have significant impact. This risk is mitigated by using an implementation heavily based on the linaro string libraries (which are used in other places already, e.g. glibc), and through active testing of this patch on real hardware using a trusty-kernel base. The new patch will also undergo regression testing to ensure the memory corruption issue of the previous patch does not exist. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1400349/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1386490] Re: HP ProLiant m400 nic doesn't work after trusty
** Tags added: arm-hs-vivid -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1386490 Title: HP ProLiant m400 nic doesn't work after trusty Status in linux package in Ubuntu: Confirmed Status in linux source package in Utopic: Fix Committed Status in linux source package in Vivid: Confirmed Bug description: Starting in 3.15, arm64 began defaulting to non-coherent dma_ops: commit c7a4a7658d689f664050c45493d79adf053f226e Author: Ritesh Harjani ritesh.harj...@gmail.com Date: Wed Apr 23 06:29:46 2014 +0100 arm64: Make default dma_ops to be noncoherent Firmware (dtb in the case of the m400) is responsible for telling the kernel when a device requires coherent dma_ops. However, as of utopic, this property is not being inherited by downstream devices. Specifically, the xgene-pcie device is marked as coherent, but the devices behind it (mellanox card) still get initialized with non- coherent ops. This results in the mlx4 driver bailing out with the following messages: [ 18.703635] mlx4_core :01:00.0: command 0x23 timed out (go bit not cleared) [ 18.710911] mlx4_core :01:00.0: Failed to initialize queue pair table, aborting There's an upstream discussion on the topic here: http://www.spinics.net/lists/arm-kernel/msg362320.html To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1386490/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1284433] Re: System can not reboot on APM X-Gene board
** Tags added: arm-hs-vivid -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1284433 Title: System can not reboot on APM X-Gene board Status in linux package in Ubuntu: Fix Released Status in linux source package in Trusty: Fix Released Status in linux source package in Utopic: Fix Released Bug description: * Stopping rsync daemon rsync [ OK ] * Asking all remaining processes to terminate...[ OK ] * All processes ended within 1 seconds... [ OK ] * Deactivating swap... [ OK ] * Unmounting weak filesystems...[ OK ] * Unmounting local filesystems... [ OK ] * Will now restart [ 151.242953] reboot: Restarting system System hangs when rebooting, need drivers from http://www.spinics.net/lists/arm-kernel/msg303898.html To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1284433/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1358949] Re: arm64 iperf performance suboptimal
** Tags added: arm-hs-vivid -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1358949 Title: arm64 iperf performance suboptimal Status in linux package in Ubuntu: Fix Released Status in linux source package in Trusty: Fix Released Status in linux source package in Utopic: Fix Released Bug description: [Impact] The copy_{to,from}_user implementations in the Ubuntu kernel are suboptimal. Optimized implementations have been submitted upstream and have shown a significant improvement in network performance. Iperf performance increase: -l (size) 1 core result Optimized 64B 44-51Mb/s 1500B 4.9Gb/s 3B 16.2Gb/s Original64B 34-50.7Mb/s 1500B 4.7Gb/s 3B 14.5Gb/s [Test Case] Generate traffic from one node to another using iperf (see above for config). [Regression Risk] These functions are obviously used heavily throughout the kernel, so a defect here could have significant impact. This risk is mitigated by using an implementation heavily based on the linaro string libraries (which are used in other places already, e.g. glibc), and through active testing of this patch on real hardware using a trusty-kernel base. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1358949/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1386490] Re: HP ProLiant m400 nic doesn't work after trusty
Quick history summary and status update: Ming originally proposed a fix for upstream, but this was rejected as upstream wanted a more complete, larger fix that included some PCI changes. Such a change has now been proposed to upstream, is being discussed and is nearing acceptance. Until the upstream fix is accepted and pulled into an Ubuntu release, we will continue to attempt to carry the reversion in the generic Ubuntu kernel. This bug is now tracking the propagation of the reversion, and the successful upstream submission of a fix for this issue. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1386490 Title: HP ProLiant m400 nic doesn't work after trusty Status in debian-installer package in Ubuntu: New Status in linux package in Ubuntu: Confirmed Status in debian-installer source package in Utopic: Fix Released Status in linux source package in Utopic: Fix Released Status in debian-installer source package in Vivid: New Status in linux source package in Vivid: Confirmed Bug description: Starting in 3.15, arm64 began defaulting to non-coherent dma_ops: commit c7a4a7658d689f664050c45493d79adf053f226e Author: Ritesh Harjani ritesh.harj...@gmail.com Date: Wed Apr 23 06:29:46 2014 +0100 arm64: Make default dma_ops to be noncoherent Firmware (dtb in the case of the m400) is responsible for telling the kernel when a device requires coherent dma_ops. However, as of utopic, this property is not being inherited by downstream devices. Specifically, the xgene-pcie device is marked as coherent, but the devices behind it (mellanox card) still get initialized with non- coherent ops. This results in the mlx4 driver bailing out with the following messages: [ 18.703635] mlx4_core :01:00.0: command 0x23 timed out (go bit not cleared) [ 18.710911] mlx4_core :01:00.0: Failed to initialize queue pair table, aborting There's an upstream discussion on the topic here: http://www.spinics.net/lists/arm-kernel/msg362320.html To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/debian-installer/+bug/1386490/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1460941] Re: arm64: crash: invalid/unsupported page size: 6144
From a private email thread, the next step is to merge this commit to the upstream Debian crash archive, and then re-sync the Ubuntu crash archive from Debian. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to crash in Ubuntu. https://bugs.launchpad.net/bugs/1460941 Title: arm64: crash: invalid/unsupported page size: 6144 Status in crash package in Ubuntu: New Bug description: After running crash from trusty, the failure log in [1] can be observed. Then I built crash from wily directly, the similar failure[2] can be observed too. [1] failure log ubuntu@am2:~/git/crash-wily$ sudo crash ~/vmlinux crash 7.0.3 Copyright (C) 2002-2013 Red Hat, Inc. Copyright (C) 2004, 2005, 2006, 2010 IBM Corporation Copyright (C) 1999-2006 Hewlett-Packard Co Copyright (C) 2005, 2006, 2011, 2012 Fujitsu Limited Copyright (C) 2006, 2007 VA Linux Systems Japan K.K. Copyright (C) 2005, 2011 NEC Corporation Copyright (C) 1999, 2002, 2007 Silicon Graphics, Inc. Copyright (C) 1999, 2000, 2001, 2002 Mission Critical Linux, Inc. This program is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Enter help copying to see the conditions. This program has absolutely no warranty. Enter help warranty for details. crash: invalid/unknown page size: 6144 [2] ubuntu@am2:~/git/crash-wily$ sudo ./crash ~/vmlinux [sudo] password for ubuntu: crash 7.0.8 Copyright (C) 2002-2014 Red Hat, Inc. Copyright (C) 2004, 2005, 2006, 2010 IBM Corporation Copyright (C) 1999-2006 Hewlett-Packard Co Copyright (C) 2005, 2006, 2011, 2012 Fujitsu Limited Copyright (C) 2006, 2007 VA Linux Systems Japan K.K. Copyright (C) 2005, 2011 NEC Corporation Copyright (C) 1999, 2002, 2007 Silicon Graphics, Inc. Copyright (C) 1999, 2000, 2001, 2002 Mission Critical Linux, Inc. This program is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Enter help copying to see the conditions. This program has absolutely no warranty. Enter help warranty for details. crash: invalid/unsupported page size: 6144 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/crash/+bug/1460941/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1460942] Re: crash: incompatible arguments: vmlinux is not SMP -- live system is SMP
From a private email thread, the next step is to merge this commit to the upstream Debian crash archive, and then re-sync the Ubuntu crash archive from Debian. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to crash in Ubuntu. https://bugs.launchpad.net/bugs/1460942 Title: crash: incompatible arguments: vmlinux is not SMP -- live system is SMP Status in crash package in Ubuntu: New Bug description: When I build crash from wily, the failure in [1] can be triggered if the kernel is 4.0+. [1] failure log ubuntu@am2:~/git/crash-wily$ sudo ./crash ~/vmlinux crash 7.0.8 Copyright (C) 2002-2014 Red Hat, Inc. Copyright (C) 2004, 2005, 2006, 2010 IBM Corporation Copyright (C) 1999-2006 Hewlett-Packard Co Copyright (C) 2005, 2006, 2011, 2012 Fujitsu Limited Copyright (C) 2006, 2007 VA Linux Systems Japan K.K. Copyright (C) 2005, 2011 NEC Corporation Copyright (C) 1999, 2002, 2007 Silicon Graphics, Inc. Copyright (C) 1999, 2000, 2001, 2002 Mission Critical Linux, Inc. This program is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Enter help copying to see the conditions. This program has absolutely no warranty. Enter help warranty for details. GNU gdb (GDB) 7.6 Copyright (C) 2013 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later http://gnu.org/licenses/gpl.html This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Type show copying and show warranty for details. This GDB was configured as aarch64-unknown-linux-gnu... WARNING: kernels compiled by different gcc versions: /home/ubuntu/vmlinux: (unknown) live system kernel: 4.8.2 WARNING: kernel version inconsistency between vmlinux and live memory crash: incompatible arguments: /home/ubuntu/vmlinux is not SMP -- live system is SMP Usage: crash [OPTION]... NAMELIST MEMORY-IMAGE[@ADDRESS] (dumpfile form) crash [OPTION]... [NAMELIST](live system form) Enter crash -h for details. ubuntu@am2:~/git/crash-wily$ To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/crash/+bug/1460942/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault
** Also affects: irqbalance (Ubuntu) Importance: Undecided Status: New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1469214 Title: HP ProLiant m400 Server crashes with unhandled level 3 translation fault Status in irqbalance package in Ubuntu: New Status in linux package in Ubuntu: Triaged Bug description: Running stress-ng on a HP ProLiant m400 server can cause unhandled level 3 translations faults: use stress-ng from git://kernel.ubuntu.com/cking/stress-ng ./stress-ng --seq 0 -t 60 -v and after some time this trips the following: Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922560] systemd-timesyn[481]: unhandled level 3 translation fault (7) at 0x7fa8ea6008, esr 0x9207 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922561] pgd = ffcfb563f000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922563] [7fa8ea6008] *pgd=004fb4f28003, *pud=004fb4f28003, *pmd=004fb4f38003, *pte=1d151c00 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922566] Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922569] CPU: 6 PID: 481 Comm: systemd-timesyn Not tainted 3.19.0-21-generic #21-Ubuntu Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922571] Hardware name: HP ProLiant m400 Server Cartridge (DT) Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922573] task: ffcfb4e3b100 ti: ffcfb4d2c000 task.ti: ffcfb4d2c000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922588] PC is at 0x7fa8d81824 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922589] LR is at 0x7fa8e3b3e4 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922590] pc : [007fa8d81824] lr : [007fa8e3b3e4] pstate: 8000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922591] sp : 007ff120d660 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922592] x29: 007ff120d660 x28: 007fa8f1c000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922594] x27: 007fa8f32084 x26: 007fa8f32000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922595] x25: 007fa8f1d788 x24: 007fa8f1d888 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922597] x23: 0001 x22: 007fa8f1faa0 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922599] x21: 007ff120d7f0 x20: 007ff120d7d0 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922600] x19: 007fa8f31000 x18: 007fa8f1e000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922602] x17: 007fa8e3b3b8 x16: 007fa8ea6000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922603] x15: 003b9aca x14: 00219bbdd000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922605] x13: aa751223 x12: Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922607] x11: 0101010101010101 x10: 7f7f7f7f7f7f7f7f Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922609] x9 : 37333c43484f5e46 x8 : 007ff120d818 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922610] x7 : 007ff120d8f0 x6 : 007ff120d828 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922612] x5 : ff80ffd0 x4 : 007ff120d8c0 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922613] x3 : 007ff120d7d0 x2 : 007fa8f1faa0 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922615] x1 : 0001 x0 : 0064 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922616] To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1469214] Re: HP ProLiant m400 Server crashes with unhandled level 3 translation fault
Following Ming's identification of an irqbalance patch that fixes this issue, I'm marking the Affected status on linux (Ubuntu) as being invalid. ** Changed in: linux (Ubuntu Trusty) Status: New = Invalid ** Changed in: linux (Ubuntu Utopic) Status: New = Invalid ** Changed in: linux (Ubuntu Vivid) Status: New = Invalid ** Changed in: linux (Ubuntu Wily) Status: Triaged = Invalid ** Changed in: irqbalance (Ubuntu Vivid) Status: New = In Progress -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1469214 Title: HP ProLiant m400 Server crashes with unhandled level 3 translation fault Status in irqbalance package in Ubuntu: New Status in linux package in Ubuntu: Invalid Status in irqbalance source package in Trusty: New Status in linux source package in Trusty: Invalid Status in irqbalance source package in Utopic: New Status in linux source package in Utopic: Invalid Status in irqbalance source package in Vivid: In Progress Status in linux source package in Vivid: Invalid Status in irqbalance source package in Wily: New Status in linux source package in Wily: Invalid Bug description: Running stress-ng on a HP ProLiant m400 server can cause unhandled level 3 translations faults: use stress-ng from git://kernel.ubuntu.com/cking/stress-ng ./stress-ng --seq 0 -t 60 -v and after some time this trips the following: Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922560] systemd-timesyn[481]: unhandled level 3 translation fault (7) at 0x7fa8ea6008, esr 0x9207 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922561] pgd = ffcfb563f000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922563] [7fa8ea6008] *pgd=004fb4f28003, *pud=004fb4f28003, *pmd=004fb4f38003, *pte=1d151c00 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922566] Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922569] CPU: 6 PID: 481 Comm: systemd-timesyn Not tainted 3.19.0-21-generic #21-Ubuntu Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922571] Hardware name: HP ProLiant m400 Server Cartridge (DT) Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922573] task: ffcfb4e3b100 ti: ffcfb4d2c000 task.ti: ffcfb4d2c000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922588] PC is at 0x7fa8d81824 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922589] LR is at 0x7fa8e3b3e4 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922590] pc : [007fa8d81824] lr : [007fa8e3b3e4] pstate: 8000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922591] sp : 007ff120d660 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922592] x29: 007ff120d660 x28: 007fa8f1c000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922594] x27: 007fa8f32084 x26: 007fa8f32000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922595] x25: 007fa8f1d788 x24: 007fa8f1d888 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922597] x23: 0001 x22: 007fa8f1faa0 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922599] x21: 007ff120d7f0 x20: 007ff120d7d0 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922600] x19: 007fa8f31000 x18: 007fa8f1e000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922602] x17: 007fa8e3b3b8 x16: 007fa8ea6000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922603] x15: 003b9aca x14: 00219bbdd000 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922605] x13: aa751223 x12: Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922607] x11: 0101010101010101 x10: 7f7f7f7f7f7f7f7f Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922609] x9 : 37333c43484f5e46 x8 : 007ff120d818 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922610] x7 : 007ff120d8f0 x6 : 007ff120d828 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922612] x5 : ff80ffd0 x4 : 007ff120d8c0 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922613] x3 : 007ff120d7d0 x2 : 007fa8f1faa0 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922615] x1 : 0001 x0 : 0064 Jun 26 14:01:54 ms10-34-proliant kernel: [150297.922616] To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/irqbalance/+bug/1469214/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1537666] Re: ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor
As per comment #4, marking this Fix Released. ** Changed in: linux (Ubuntu) Status: Triaged => Fix Released ** Changed in: linux (Ubuntu) Assignee: Taco Screen team (taco-screen-team) => (unassigned) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1537666 Title: ISST-LTE: Ubuntu 14.04.4 LPAR interrupts at check_and_cede_processor Status in linux package in Ubuntu: Fix Released Bug description: == Comment: #0 - YUECHANG E. MEI- 2015-12-11 17:19:07 == ---Problem Description--- We have an Ubuntu 14.04.4 LPAR, conelp2. It is running stress test: base, io, and tcp. When checking "dmesg", we see this interruption: [Fri Dec 11 13:58:50 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28 [Fri Dec 11 13:58:50 2015] LR = check_and_cede_processor+0x34/0x50 In the previous test, conelp2 stopped all the stress tests by itself because it ran out of memory. Is the out of memory issue relating to the interruption? Contact Information = Yuechang (Erin) Mei /ye...@us.ibm.com, Raja Sunkari /rajas...@in.ibm.com ---uname output--- Linux conelp2 4.2.0-21-generic #25~14.04.1-Ubuntu SMP Thu Dec 3 13:55:42 UTC 2015 ppc64le ppc64le ppc64le GNU/Linux Machine Type = EUH Alpine 8408-E8E ---Debugger--- A debugger is not configured ---Steps to Reproduce--- 1. install Ubuntu 14.04.4 in a LPAR, then update to the latest 14.04.4 kernel by using this workaround: echo "deb http://software.linux.ibm.com/pub/ubuntu-ppc64el-repository/ trusty-proposed main restricted universe multiverse" >> /etc/apt/sources.list apt-get update apt-get install linux-image-generic-lts-wily 2. Setup the Stress test, and start base,io, tcp 3. After an hour, check dmesg, then you will see the message about the interruption Stack trace output: no Oops output: no System Dump Info: The system is not configured to capture a system dump. *Additional Instructions for Yuechang (Erin) Mei /ye...@us.ibm.com, Raja Sunkari /rajas...@in.ibm.com: -Post a private note with access information to the machine that the bug is occuring on. -Attach sysctl -a output output to the bug. == Comment: #1 - YUECHANG E. MEI - 2015-12-11 17:23:00 == == Comment: #3 - YUECHANG E. MEI - 2015-12-14 15:23:33 == == Comment: #4 - MAMATHA INAMDAR - 2015-12-15 03:56:14 == dmrsg show page allocation failure [Fri Dec 11 13:45:38 2015] swapper/127: page allocation failure: order:0, mode:0x120 [Fri Dec 11 13:45:38 2015] CPU: 127 PID: 0 Comm: swapper/127 Not tainted 4.2.0-21-generic #25~14.04.1-Ubuntu [Fri Dec 11 13:45:38 2015] Call Trace: [Fri Dec 11 13:45:38 2015] [c0027fbc3890] [c0a805ec] dump_stack+0x90/0xbc (unreliable) [Fri Dec 11 13:45:38 2015] [c0027fbc38c0] [c021c118] warn_alloc_failed+0x118/0x160 [Fri Dec 11 13:45:38 2015] [c0027fbc3960] [c0221114] __alloc_pages_nodemask+0x834/0xa60 [Fri Dec 11 13:45:38 2015] [c0027fbc3b10] [c0221404] __alloc_page_frag+0xc4/0x190 [Fri Dec 11 13:45:38 2015] [c0027fbc3b50] [c08f6d20] netdev_alloc_frag+0x50/0x80 [Fri Dec 11 13:45:38 2015] [c0027fbc3b80] [c0764e80] tg3_alloc_rx_data+0xa0/0x2c0 [Fri Dec 11 13:45:38 2015] [c0027fbc3be0] [c0767344] tg3_poll_work+0x484/0x1070 [Fri Dec 11 13:45:38 2015] [c0027fbc3ce0] [c0767f8c] tg3_poll_msix+0x5c/0x210 [Fri Dec 11 13:45:38 2015] [c0027fbc3d30] [c090ebb8] net_rx_action+0x2d8/0x430 [Fri Dec 11 13:45:38 2015] [c0027fbc3e40] [c00ba124] __do_softirq+0x174/0x390 [Fri Dec 11 13:45:38 2015] [c0027fbc3f40] [c00ba6c8] irq_exit+0xc8/0x100 [Fri Dec 11 13:45:38 2015] [c0027fbc3f60] [c00111ec] __do_irq+0x8c/0x190 [Fri Dec 11 13:45:38 2015] [c0027fbc3f90] [c0024278] call_do_irq+0x14/0x24 [Fri Dec 11 13:45:38 2015] [c002763a39b0] [c0011390] do_IRQ+0xa0/0x120 [Fri Dec 11 13:45:38 2015] [c002763a3a10] [c00099b0] restore_check_irq_replay+0x2c/0x70 [Fri Dec 11 13:45:38 2015] --- interrupt: 501 at plpar_hcall_norets+0x1c/0x28 [Fri Dec 11 13:45:38 2015] LR = check_and_cede_processor+0x34/0x50 [Fri Dec 11 13:45:38 2015] [c002763a3d00] [c08a8d90] check_and_cede_processor+0x20/0x50 (unreliable) [Fri Dec 11 13:45:38 2015] [c002763a3d60] [c08a8fb8] shared_cede_loop+0x68/0x170 [Fri Dec 11 13:45:38 2015] [c002763a3da0] [c08a615c] cpuidle_enter_state+0xbc/0x350 [Fri Dec 11 13:45:38 2015] [c002763a3e00] [c0110f3c] call_cpuidle+0x7c/0xd0 [Fri Dec 11 13:45:38 2015] [c002763a3e40] [c01112d0] cpu_startup_entry+0x340/0x450 [Fri Dec 11 13:45:38 2015] [c002763a3f10]
[Kernel-packages] [Bug 1505178] Re: MFG: Habanero: hxestorage exerciser logs task blocked messages in dmesg when running disks under PMC Sierra
Marking bug as incomplete, as per comment #5, no response was received. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1505178 Title: MFG: Habanero: hxestorage exerciser logs task blocked messages in dmesg when running disks under PMC Sierra Status in linux package in Ubuntu: Incomplete Bug description: == Comment: #0 == When running STX on Habanero systems with PMC Sierra, the following linux error messages are found when running "dmesg -T --level=alert,crit,err" after the run. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18049 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18177 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18181 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18185 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18189 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18194 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18200 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18205 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18213 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18221 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. We are running the following code levels. ver 1.5.4.3 - OS, HTX, Firmware and Machine details OS: GNU/Linux OS Version: Ubuntu 14.04.3 LTS \n \l Kernel Version: 3.19.0-25-generic HTX Version: htxubuntu-357 Host Name: rcx2c357 Machine Serial No: 1035C5A Machine Type/Model: 8348-21C We have a very limited number of PMC Sierra configs. I've seen this error on both EC3S and ECSY PMC adapter types. We've only run systems with 6TB drives or a mix of 6TB and 8TB disk drives so far. == Comment: #5 == Call Trace: dmesg -T --- [Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18049 blocked for more than 120 seconds. [Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu [Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [Fri Oct 2 12:36:52 2015] hxestorage D 3fff78c69a20 0 18049 451 0x0004 [Fri Oct 2 12:36:52 2015] Call Trace: [Fri Oct 2 12:36:52 2015] [c0791de17490] [c079111f8980] 0xc079111f8980 (unreliable) [Fri Oct 2 12:36:52 2015] [c0791de17660] [c0015934] __switch_to+0x204/0x350 [Fri Oct 2 12:36:52 2015] [c0791de176c0] [c0a11948]
[Kernel-packages] [Bug 1546343] Re: ISST-LTE: kernel BUG at /build/linux-lts-wily-W0lTWH/linux-lts-wily-4.2.0/fs/btrfs/extent-tree.c:6016!
Is this issue persisting with newer, supported kernels? ** Changed in: linux (Ubuntu) Status: New => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1546343 Title: ISST-LTE: kernel BUG at /build/linux-lts-wily-W0lTWH/linux-lts- wily-4.2.0/fs/btrfs/extent-tree.c:6016! Status in linux package in Ubuntu: Incomplete Bug description: Mirroring for awareness: Last replicated on the 4.2.0-27-generic #32~14.04.1 kernel We are encountering btrfs panics like the following while doing stress testing: 68:mon> e cpu 0x68: Vector: 700 (Program Check) at [c00029a971d0] pc: d691f7b4: unpin_extent_range+0xe4/0x320 [btrfs] lr: d691f7a4: unpin_extent_range+0xd4/0x320 [btrfs] sp: c00029a97450 msr: 800100029033 current = 0xc000fbdae590 paca= 0xc7b1dc00 softe: 0irq_happened: 0x01 pid = 28091, comm = aio-stress kernel BUG at /build/linux-lts-wily-YKsqw8/linux-lts-wily-4.2.0/fs/btrfs/extent-tree.c:6022! 68:mon> t [c00029a974e0] d6936a68 btrfs_destroy_pinned_extent+0xe8/0x120 [btrfs] [c00029a97530] d693b8e0 btrfs_cleanup_one_transaction+0x3a0/0x7c0 [btrfs] [c00029a97610] d69405a0 cleanup_transaction+0x150/0x3b0 [btrfs] [c00029a976f0] d694184c btrfs_commit_transaction+0x40c/0xd40 [btrfs] [c00029a977f0] d692910c flush_space+0x10c/0x5f0 [btrfs] [c00029a978b0] d6929b78 reserve_metadata_bytes+0x258/0x640 [btrfs] [c00029a97990] d692a7b0 btrfs_delalloc_reserve_metadata+0x190/0x560 [btrfs] [c00029a97a10] d69591d0 __btrfs_buffered_write+0x1a0/0x5c0 [btrfs] [c00029a97af0] d695c9a8 btrfs_file_write_iter+0x208/0x540 [btrfs] [c00029a97b90] c0327dc0 aio_run_iocb+0x330/0x3c0 [c00029a97ce0] c0329664 do_io_submit+0x314/0x670 [c00029a97e30] c0009204 system_call+0x38/0xb4 --- Exception: c01 (System Call) at 3fffa79e0768 SP (3fffa27ce550) is in userspace 68:mon> r R00 = d691f7a4 R16 = 0d0f R01 = c00029a97450 R17 = c00203badee8 R02 = d6a134d0 R18 = 0060 R03 = R19 = R04 = 0041f20d R20 = R05 = 0001 R21 = 0001 R06 = c001e6d9a409 R22 = c00204c00138 R07 = c001e6d9a400 R23 = c15342d8 R08 = 1500 R24 = 0041f20e R09 = 0001 R25 = 0041f20d R10 = 004200a8 R26 = c00204c0 R11 = d69def18 R27 = R12 = c0a717c0 R28 = c0022fc8e698 R13 = c7b1dc00 R29 = R14 = 10004820 R30 = 0041f20d R15 = c00203badec4 R31 = pc = d691f7b4 unpin_extent_range+0xe4/0x320 [btrfs] cfar= d69174b4 block_group_cache_tree_search+0x104/0x180 [btrfs] lr = d691f7a4 unpin_extent_range+0xd4/0x320 [btrfs] msr = 800100029033 cr = 28022842 ctr = c0a717c0 xer = 2001 trap = 700 Partial Output for dump log: [269404.043284] [ cut here ] [269404.043288] WARNING: at /build/linux-lts-wily-YKsqw8/linux-lts-wily-4.2.0/fs/btrfs/extent-tree.c:3581 [269404.043325] CPU: 104 PID: 28091 Comm: aio-stress Not tainted 4.2.0-27-generic #32~14.04.1-Ubuntu [269404.043329] task: c000fbdae590 ti: c00029a94000 task.ti: c00029a94000 [269404.043331] NIP: d6928fec LR: d6928fe8 CTR: c053f720 [269404.043334] REGS: c00029a97310 TRAP: 0700 Not tainted (4.2.0-27-generic) [269404.043336] MSR: 800100029033CR: 28044822 XER: 2000 [269404.043346] CFAR: c0a79ec4 SOFTE: 1 [269404.043407] NIP [d6928fec] btrfs_write_dirty_block_groups+0x30c/0x320 [btrfs] [269404.043425] LR [d6928fe8] btrfs_write_dirty_block_groups+0x308/0x320 [btrfs] [269404.043428] Call Trace: [269404.043441] [c00029a97590] [d6928fe8] btrfs_write_dirty_block_groups+0x308/0x320 [btrfs] (unreliable) [269404.043453] [c00029a97660] [d69db42c] commit_cowonly_roots+0x23c/0x324 [btrfs] [269404.043468] [c00029a976f0] [d6941c0c] btrfs_commit_transaction+0x7cc/0xd40 [btrfs] [269404.043483] [c00029a977f0] [d692910c] flush_space+0x10c/0x5f0 [btrfs] [269404.043497] [c00029a978b0] [d6929b78] reserve_metadata_bytes+0x258/0x640 [btrfs] [269404.043511] [c00029a97990] [d692a7b0] btrfs_delalloc_reserve_metadata+0x190/0x560 [btrfs] [269404.043527] [c00029a97a10] [d69591d0] __btrfs_buffered_write+0x1a0/0x5c0 [btrfs] [269404.043541] [c00029a97af0] [d695c9a8]
[Kernel-packages] [Bug 1658968] Re: ubuntu 16.04.2: crashed at deactivate_slab+0x18c/0x640 when testing dlpar
As this is "for our awareness", marking as incomplete. ** Changed in: linux (Ubuntu) Status: New => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1658968 Title: ubuntu 16.04.2: crashed at deactivate_slab+0x18c/0x640 when testing dlpar Status in linux package in Ubuntu: Incomplete Bug description: Problem Description === When testing cpu, memory and slot DLPAR on roselp4, the system crashed. ---uname output--- Linux roselp4 4.8.0-34-generic #36~16.04.1-Ubuntu SMP Wed Dec 21 18:53:20 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux Machine Type = lpar Stack trace output: [ 3289.065350] Unable to handle kernel paging request for data at address 0xc404565d6a00 [ 3289.065375] Faulting instruction address: 0xc02e6eec [ 3289.065379] Oops: Kernel access of bad area, sig: 11 [#1] [ 3289.065382] SMP NR_CPUS=2048 NUMA pSeries [ 3289.065386] Modules linked in: rpadlpar_io rpaphp dccp_diag dccp tcp_diag udp_diag inet_diag unix_diag af_packet_diag netlink_diag rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache rdma_ucm(OE) ib_ucm(OE) rdma_cm(OE) iw_cm(OE) configfs ib_ipoib(OE) ib_cm(OE) ib_uverbs(OE) ib_umad(OE) mlx5_ib(OE) mlx5_core(OE) mlx4_ib(OE) mlx4_en(OE) ib_sa(OE) ib_mad(OE) ib_core(OE) ib_addr(OE) ib_netlink(OE) mlx4_core(OE) mlx_compat(OE) binfmt_misc pseries_rng vmx_crypto sunrpc knem(OE) autofs4 dm_round_robin btrfs xor raid6_pq lpfc crc32c_vpmsum ipr scsi_transport_fc devlink be2net scsi_dh_emc scsi_dh_rdac scsi_dh_alua dm_multipath [last unloaded: mlx4_core] [ 3289.065424] CPU: 82 PID: 40197 Comm: drmgr Tainted: G OE 4.8.0-34-generic #36~16.04.1-Ubuntu [ 3289.065427] task: c0045081ce00 task.stack: c0044d414000 [ 3289.065430] NIP: c02e6eec LR: c02e7718 CTR: c02e7630 [ 3289.065433] REGS: c0044d417470 TRAP: 0300 Tainted: G OE (4.8.0-34-generic) [ 3289.065435] MSR: 80010280b033CR: 24082822 XER: 2000 [ 3289.065446] CFAR: c0008750 DAR: c404565d6a00 DSISR: 4000 SOFTE: 0 GPR00: c02e7718 c0044d4176f0 c14a6600 c0047e01f480 GPR04: 0010 8275 0075 0001 GPR08: 0200 8275 0009 GPR12: 84002828 c7b4e200 GPR16: c0d7a800 GPR20: 1050 c0fd4e6c c003e7933840 c14daae0 GPR24: c138dc48 0001 c0047e00fe80 GPR28: c404565d6a00 c0047e01f480 c004565de700 f1159740 [ 3289.065486] NIP [c02e6eec] deactivate_slab+0x18c/0x640 [ 3289.065489] LR [c02e7718] slab_cpuup_callback+0xe8/0x170 [ 3289.065491] Call Trace: [ 3289.065493] [c0044d4176f0] [c02e715c] deactivate_slab+0x3fc/0x640 (unreliable) [ 3289.065498] [c0044d417810] [c02e7718] slab_cpuup_callback+0xe8/0x170 [ 3289.065502] [c0044d417880] [c00f98c8] notifier_call_chain+0x98/0x110 [ 3289.065506] [c0044d4178d0] [c00ca564] __cpu_notify+0x54/0xa0 [ 3289.065509] [c0044d4178f0] [c00ca77c] cpu_notify_nofail+0x2c/0x40 [ 3289.065512] [c0044d417910] [c00ca7e4] notify_dead+0x54/0x170 [ 3289.065515] [c0044d4179b0] [c00c98c4] cpuhp_invoke_callback+0x84/0x250 [ 3289.065519] [c0044d417a10] [c00c9bfc] cpuhp_down_callbacks+0x8c/0x110 [ 3289.065523] [c0044d417a60] [c024e328] _cpu_down+0x168/0x2b0 [ 3289.065526] [c0044d417ac0] [c00cc068] do_cpu_down+0x68/0xb0 [ 3289.065530] [c0044d417b00] [c0738448] cpu_subsys_offline+0x28/0x40 [ 3289.065534] [c0044d417b20] [c072f9e4] device_offline+0x104/0x140 [ 3289.065538] [c0044d417b60] [c009a7bc] dlpar_cpu_remove+0x24c/0x350 [ 3289.065542] [c0044d417c40] [c009aa50] dlpar_cpu_release+0x70/0xe0 [ 3289.065545] [c0044d417c90] [c0021a04] arch_cpu_release+0x44/0x80 [ 3289.065548] [c0044d417cb0] [c0738c8c] cpu_release_store+0x4c/0x80 [ 3289.065552] [c0044d417ce0] [c072b7b0] dev_attr_store+0x40/0x70 [ 3289.06] [c0044d417d00] [c03e1e1c] sysfs_kf_write+0x6c/0xa0 [ 3289.065559] [c0044d417d20] [c03e0cdc] kernfs_fop_write+0x17c/0x250 [ 3289.065563] [c0044d417d70] [c0322b20] __vfs_write+0x40/0x80 [ 3289.065566] [c0044d417d90] [c0323ec4] vfs_write+0xd4/0x270 [ 3289.065571] [c0044d417de0] [c0325acc]
[Kernel-packages] [Bug 1653489] Re: [LTCTest][OPAL][FW860.20] Upgrade to Ubuntu 16.04.2 Alpha from Ubuntu 16.04.1 is dropping to (initramfs)
** Changed in: initramfs-tools (Ubuntu) Assignee: Taco Screen team (taco-screen-team) => (unassigned) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1653489 Title: [LTCTest][OPAL][FW860.20] Upgrade to Ubuntu 16.04.2 Alpha from Ubuntu 16.04.1 is dropping to (initramfs) Status in initramfs-tools package in Ubuntu: New Status in linux package in Ubuntu: Confirmed Bug description: @kernel-team Please move ipr module from image-extra to image package. ---Problem Description--- Upgrade to Ubuntu 16.04.2 Alpha from Ubuntu 16.04.1 is dropping to (initramfs) Contact Information = pavsu...@in.ibm.com ---uname output--- Linux (none) 4.8.0-27-generic #29~16.04.1-Ubuntu SMP Fri Nov 4 17:24:37 UTC 2016 ppc64le GNU/Linux ---Additional Hardware Info--- root@powerkvm3-lp1:~# lspci :00:00.0 PCI bridge: IBM Device 03dc :01:00.0 RAID bus controller: IBM Obsidian-E PCI-E SCSI controller (rev 01) 0001:00:00.0 PCI bridge: IBM Device 03dc 0001:01:00.0 PCI bridge: PLX Technology, Inc. PEX 8732 32-lane, 8-Port PCI Express Gen 3 (8.0 GT/s) Switch (rev ca) 0001:02:01.0 PCI bridge: PLX Technology, Inc. PEX 8732 32-lane, 8-Port PCI Express Gen 3 (8.0 GT/s) Switch (rev ca) 0001:02:08.0 PCI bridge: PLX Technology, Inc. PEX 8732 32-lane, 8-Port PCI Express Gen 3 (8.0 GT/s) Switch (rev ca) 0001:02:09.0 PCI bridge: PLX Technology, Inc. PEX 8732 32-lane, 8-Port PCI Express Gen 3 (8.0 GT/s) Switch (rev ca) 0001:03:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0001:03:00.1 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0001:03:00.2 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0001:03:00.3 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0001:04:00.0 RAID bus controller: IBM PCI-E IPR SAS Adapter (ASIC) (rev 01) 0001:05:00.0 RAID bus controller: IBM PCI-E IPR SAS Adapter (ASIC) (rev 01) 0004:00:00.0 PCI bridge: IBM Device 03dc 0004:01:00.0 Fibre Channel: Emulex Corporation Lancer-X: LightPulse Fibre Channel Host Adapter (rev 10) 0004:01:00.1 Fibre Channel: Emulex Corporation Lancer-X: LightPulse Fibre Channel Host Adapter (rev 10) 0005:00:00.0 PCI bridge: IBM Device 03dc 0005:01:00.0 PCI bridge: PLX Technology, Inc. Device 8748 (rev ca) 0005:02:01.0 PCI bridge: PLX Technology, Inc. Device 8748 (rev ca) 0005:02:08.0 PCI bridge: PLX Technology, Inc. Device 8748 (rev ca) 0005:02:09.0 PCI bridge: PLX Technology, Inc. Device 8748 (rev ca) 0005:02:10.0 PCI bridge: PLX Technology, Inc. Device 8748 (rev ca) 0005:02:11.0 PCI bridge: PLX Technology, Inc. Device 8748 (rev ca) 0005:03:00.0 USB controller: Texas Instruments TUSB73x0 SuperSpeed USB 3.0 xHCI Host Controller (rev 02) 0005:09:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0005:09:00.1 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0005:09:00.2 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0005:09:00.3 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0005:0f:00.0 Fibre Channel: Emulex Corporation Saturn-X: LightPulse Fibre Channel Host Adapter (rev 03) 0005:0f:00.1 Fibre Channel: Emulex Corporation Saturn-X: LightPulse Fibre Channel Host Adapter (rev 03) 0040:00:00.0 PCI bridge: IBM Device 03dc 0044:00:00.0 PCI bridge: IBM Device 03dc 0044:01:00.0 Ethernet controller: Emulex Corporation OneConnect NIC (Lancer) (rev 10) 0044:01:00.1 Ethernet controller: Emulex Corporation OneConnect NIC (Lancer) (rev 10) 0044:01:00.2 Ethernet controller: Emulex Corporation OneConnect NIC (Lancer) (rev 10) 0044:01:00.3 Ethernet controller: Emulex Corporation OneConnect NIC (Lancer) (rev 10) 0044:01:00.4 Fibre Channel: Emulex Corporation OneConnect FCoE Initiator (Lancer) (rev 10) 0044:01:00.5 Fibre Channel: Emulex Corporation OneConnect FCoE Initiator (Lancer) (rev 10) 0045:00:00.0 PCI bridge: IBM Device 03dc 0045:01:00.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0045:01:00.1 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0045:01:00.2 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) 0045:01:00.3 Ethernet controller: Broadcom Corporation NetXtreme BCM5719 Gigabit Ethernet PCIe (rev 01) Machine Type = P8 ---Debugger--- A debugger is not configured ---Steps to Reproduce--- Install Ubuntu 16.04.1 OS using netboot images. Then upgrade the kernel by Installing the kernel 4.8 on the same. After upgrading the kernel, we are booting to the new 4.8
[Kernel-packages] [Bug 1628520] Re: nvme: Missing patch in Ubuntu-4.4.0-41.61
Marking "root" bug as fix released, as the bug is specifically against the Xenial 4.4 kernel, and that is "Fix Released". ** Changed in: linux (Ubuntu) Assignee: Taco Screen team (taco-screen-team) => (unassigned) ** Changed in: linux (Ubuntu) Status: New => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1628520 Title: nvme: Missing patch in Ubuntu-4.4.0-41.61 Status in linux package in Ubuntu: Fix Released Status in linux source package in Xenial: Fix Released Bug description: == Comment: #0 - Gabriel Krisman Bertazi- 2016-09-27 22:59:44 == Ubuntu-4.4.0-40.60 included my backport of : b00a726a9fd ("NVMe: Don't unmap controller registers on reset") #upstream commit [ 30d6592fce71 on the Ubuntu -proposed branch] But missed the fix up that came later: 81e9a969c441 ("nvme: Call pci_disable_device on the error path.") # 4.4.y tree This means that we may hit an Oops if we need to go into the error path of the nvme probe. Please cherry-pick this fix to your kernel. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1628520/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1632458] Re: [Ubuntu 16.10] - System crashes and gives out call traces when libhugetlbfs test suite is run.
Question to IBM: have you made any progress towards identifying a patch to address this issue? ** Changed in: linux (Ubuntu) Status: New => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1632458 Title: [Ubuntu 16.10] - System crashes and gives out call traces when libhugetlbfs test suite is run. Status in linux package in Ubuntu: Incomplete Bug description: == Comment: #0 - Santhosh G- 2016-09-27 01:55:00 == Issue: Kernel unable to handle page request when heapshrink test case is run from libhugetlbfs suite. Environment: arch - ppc64le ubuntu kvm guest Host related Info: Kernel: - uname -a Linux ltc-haba1 4.8.0-17-generic #19-Ubuntu SMP Sun Sep 25 06:35:40 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux Memory: oot@ltc-haba1:~# free -h totalusedfree shared buff/cache available Mem: 255G 65G187G 22M1.9G 188G Swap: 225G 0B225G Hugepages configured: root@ltc-haba1:~# cat /proc/meminfo | grep -i Huge AnonHugePages: 81920 kB ShmemHugePages:0 kB HugePages_Total:4096 HugePages_Free: 3584 HugePages_Rsvd:0 HugePages_Surp:0 Hugepagesize: 16384 kB Guest Related Info: -- - Kernel: - root@ubuntu:~/libhugetlbfs# uname -a Linux ubuntu 4.8.0-17-generic #19-Ubuntu SMP Sun Sep 25 06:35:40 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux Memory: - root@ubuntu:~/libhugetlbfs# free -h totalusedfree shared buff/cache available Mem: 8.0G133M7.7G 15M132M 7.5G Swap: 3.3G 0B3.3G Hugepages configured: --- root@ubuntu:~/libhugetlbfs# cat /proc/meminfo | grep -i Huge AnonHugePages: 0 kB ShmemHugePages:0 kB HugePages_Total: 256 HugePages_Free: 256 HugePages_Rsvd:0 HugePages_Surp:0 Hugepagesize: 16384 kB Steps to reproduce: 1- Install a ubuntu kvm guest with hugepages memory Backing. 2 - git clone the latest libhugetlbfs from https://github.com/libhugetlbfs/libhugetlbfs.git 3 - configure huge[pages in guest and run make check. xmon is configured in the system . The system gets call traces and enters xmon console: HUGETLB_VERBOSE=1 HUGETLB_MORECORE=yes heap-overflow (16M: 64): [ 281.735713] Unable to handle kernel paging request for data at address 0x42328e38 [ 281.735804] Faulting instruction address: 0xc027b410 cpu 0x1: Vector: 300 (Data Access) at [c001fa8c3730] pc: c027b410: shrink_active_list+0x300/0x4d0 lr: c027b3f4: shrink_active_list+0x2e4/0x4d0 sp: c001fa8c39b0 msr: 80010280b033 dar: 42328e38 dsisr: 4200 current = 0xc001fa8adc00 paca= 0xcfb80900 softe: 0irq_happened: 0x01 pid = 50, comm = kswapd0 Linux version 4.8.0-17-generic (buildd@bos01-ppc64el-025) (gcc version 6.2.0 20160914 (Ubuntu 6.2.0-3ubuntu15) ) #19-Ubuntu SMP Sun Sep 25 06:35:40 UTC 2016 (Ubuntu 4.8.0-17.19-generic 4.8.0-rc7) enter ? for help [c001fa8c3aa0] c027bbdc shrink_node_memcg+0x5fc/0x800 [c001fa8c3bc0] c027bf0c shrink_node+0x12c/0x3f0 [c001fa8c3c80] c027d500 kswapd+0x460/0x990 [c001fa8c3d80] c00fd120 kthread+0x110/0x130 [c001fa8c3e30] c00098f0 ret_from_kernel_thread+0x5c/0x6c xmon logs: 1:mon> e cpu 0x1: Vector: 300 (Data Access) at [c001fa8e7730] pc: c027b410: shrink_active_list+0x300/0x4d0 lr: c027b3f4: shrink_active_list+0x2e4/0x4d0 sp: c001fa8e79b0 msr: 80010280b033 dar: 420c58d0 dsisr: 4200 current = 0xc001fa8a paca= 0xcfb80900 softe: 0irq_happened: 0x01 pid = 50, comm = kswapd0 Linux version 4.8.0-17-generic (buildd@bos01-ppc64el-025) (gcc version 6.2.0 20160914 (Ubuntu 6.2.0-3ubuntu15) ) #19-Ubuntu SMP Sun Sep 25 06:35:40 UTC 2016 (Ubuntu 4.8.0-17.19-generic 4.8.0-rc7) 1:mon> r R00 = c027b3f4 R16 = c001fffcfe00 R01 = c001fa8e79b0 R17 = 010a R02 = c14e5e00 R18 = 420cbdd0 R03 = 0001 R19 = c001fffc6300 R04 = 0005 R20 = c001fa8e79e0 R05 = R21 = c001fe144800 R06 = f03bc9a0 R22 = 0001 R07 = 0001fee3 R23 =
[Kernel-packages] [Bug 1628988] Re: ISST-LTE:Ubuntu1610: UbuntuKVM 16.10 guest crashed after 30 hours of stress testing
Revisiting this bug, has this issue been reproduced, and if so, is there any more information that we can use to attempt to reproduce or debug? ** Changed in: linux (Ubuntu) Status: New => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1628988 Title: ISST-LTE:Ubuntu1610: UbuntuKVM 16.10 guest crashed after 30 hours of stress testing Status in linux package in Ubuntu: Incomplete Bug description: After running Stress tests for 30 hours, Ubuntu16.10 KVM guest crashed and entered xmon. Guest Build: -- 4.8.0-16-generic Tests started on guest: -- BASE: LTP Base tests.. IO: admndisk, aio, fstest tests (on btrfs file system over 6 partitions of 2 disks). TCP: TCP commands: telnet, ssh, rlogin, ping etc.. XMON traces: -- 4:mon> t [c0017ffcf800] c024385c end_page_writeback+0x7c/0x120 [c0017ffcf830] d22d8298 ext4_finish_bio+0x1f0/0x2e0 [ext4] [c0017ffcf910] d22d8928 ext4_end_bio+0x70/0x170 [ext4] [c0017ffcf9a0] c04c96cc bio_endio+0xfc/0x120 [c0017ffcf9d0] c04d5f50 blk_update_request+0xf0/0x4d0 [c0017ffcfa60] c06df2dc scsi_end_request+0x6c/0x260 [c0017ffcfad0] c06e32a4 scsi_io_completion+0x2d4/0x740 [c0017ffcfba0] c06d6714 scsi_finish_command+0x144/0x200 [c0017ffcfc20] c06e25a8 scsi_softirq_done+0x198/0x200 [c0017ffcfca0] c04e2e98 __blk_mq_complete_request_remote+0x38/0x50 [c0017ffcfcd0] c0183e80 flush_smp_call_function_queue+0xd0/0x220 [c0017ffcfd50] c0047aac smp_ipi_demux+0xac/0x110 [c0017ffcfd90] c00738e4 icp_hv_ipi_action+0x64/0xd0 [c0017ffcfe00] c01466d0 __handle_irq_event_percpu+0x90/0x340 [c0017ffcfec0] c01469bc handle_irq_event_percpu+0x3c/0x90 [c0017ffcff00] c014ced4 handle_percpu_irq+0x84/0xd0 [c0017ffcff30] c0145664 generic_handle_irq+0x54/0x80 [c0017ffcff60] c0015b20 __do_irq+0x80/0x230 [c0017ffcff90] c002a2e0 call_do_irq+0x14/0x24 [c0013073b210] c0015d68 do_IRQ+0x98/0x140 [c0013073b260] c00026d8 hardware_interrupt_common+0x158/0x180 --- Exception: 501 (Hardware Interrupt) at c008fe4c plpar_hcall_norets+0x1c/0x28 [link register ] c006c094 __spin_yield+0xa4/0xb0 [c0013073b550] c0017fe28b00 (unreliable) [c0013073b5c0] c0949758 _raw_spin_lock_irqsave+0x128/0x130 [c0013073b600] d17222cc ibmvscsi_queuecommand+0x54/0x4b0 [ibmvscsi] [c0013073b6b0] c06dfc80 scsi_dispatch_cmd+0x140/0x370 [c0013073b730] c06e1ad0 scsi_queue_rq+0x770/0x920 [c0013073b800] c04e62f4 __blk_mq_run_hw_queue+0x2e4/0x570 [c0013073b910] c04e5fc8 blk_mq_run_hw_queue+0xf8/0x140 [c0013073b940] c04e8f90 blk_mq_flush_plug_list+0x160/0x1b0 [c0013073b9c0] c04d7fbc blk_flush_plug_list+0xfc/0x2b0 [c0013073ba30] c04d8708 blk_finish_plug+0x58/0x80 [c0013073ba60] d22d270c ext4_writepages+0x6c4/0xe60 [ext4] [c0013073bbf0] c025ae80 do_writepages+0x60/0xc0 [c0013073bc20] c0246c18 __filemap_fdatawrite_range+0x108/0x190 [c0013073bcc0] c0246f20 filemap_write_and_wait_range+0x70/0xf0 [c0013073bd00] d22c5944 ext4_sync_file+0x24c/0x5a0 [ext4] [c0013073bd60] c0365a28 vfs_fsync_range+0x78/0x130 [c0013073bdb0] c0365b90 do_fsync+0x60/0xb0 [c0013073be00] c0366000 SyS_fsync+0x30/0x50 [c0013073be30] c00095e0 system_call+0x38/0x108 --- Exception: c00 (System Call) at 3fff7b26cc98 SP (3fffc42b5280) is in userspace 4:mon> e cpu 0x4: Vector: 300 (Data Access) at [c0017ffcf520] pc: c025b4ec: test_clear_page_writeback+0x1ec/0x300 lr: c025b4c0: test_clear_page_writeback+0x1c0/0x300 sp: c0017ffcf7a0 msr: 80009033 dar: 2d0 dsisr: 4000 current = 0xc00036f59880 paca= 0xc7b82400 softe: 0irq_happened: 0x09 pid = 1102, comm = create_datafile Linux version 4.8.0-16-generic (buildd@bos01-ppc64el-007) (gcc version 6.2.0 20160914 (Ubuntu 6.2.0-3ubuntu15) ) #17-Ubuntu SMP Thu Sep 22 22:45:44 UTC 2016 (Ubuntu 4.8.0-16.17-generic 4.8.0-rc7) 4:mon> 4:mon> 4:mon> r R00 = c025b4c0 R16 = 0550 R01 = c0017ffcf7a0 R17 = 7fff R02 = c10af400 R18 = R03 = R19 = c00172e35b00 R04 = R20 = c00035f831b0 R05 = ffe0 R21 = 0001 R06 = fffe R22 = 0002 R07 = f800 R23 = 0400 R08 = 00017f18 R24 =
[Kernel-packages] [Bug 1630304] Re: Ubuntu 16.10 KVM: Issue doing hotplug detach to SRIOV VF
Following the comment "Mirroring bug to Launchpad for Canonical's awareness..." marking as "opinion". ** Changed in: linux (Ubuntu) Status: New => Opinion -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1630304 Title: Ubuntu 16.10 KVM: Issue doing hotplug detach to SRIOV VF Status in linux package in Ubuntu: Opinion Bug description: ---Problem Description--- I can not get hotplug attach to work in Ubuntu but if I try to detach a CX4 VF from a guest I am getting some issues: Like in this case: [ 474.393308] vfio-pci 0001:01:00.3: No device request channel registered, blocked until released by user [ 474.393543] pci 0001:01: 0.3: [PE# 006] Removing DMA window #0 [ 474.393553] pci 0001:01: 0.3: [PE# 006] Removing DMA window #1 [ 474.393906] mlx5_core 0001:01:00.3: enabling device ( -> 0002) [ 474.393939] mlx5_core 0001:01:00.3: Using 32-bit DMA via iommu [ 474.400360] pci 0001:01: 0.3: [PE# 006] Setting up window#0 0..7fff pg=1000 [ 474.400380] mlx5_core 0001:01:00.3: firmware version: 12.17.226 [ 474.401341] pci 0001:01: 0.3: [PE# 006] Enabling 64-bit DMA bypass [ 474.402284] EEH: Frozen PE#6 on PHB#1 detected [ 474.402475] EEH: PE location: Slot4, PHB location: N/A [ 474.403699] EEH: This PCI device has failed 1 times in the last hour [ 474.403700] EEH: Notify device drivers to shutdown [ 474.403707] mlx5_core 0001:01:00.3: mlx5_pci_err_detected was called [ 474.403711] mlx5_core 0001:01:00.3: 0001:01:00.3:mlx5_enter_error_state:115:(pid 779): start [ 474.403870] mlx5_core 0001:01:00.3: 0001:01:00.3:mlx5_enter_error_state:120:(pid 779): end One time I saw SSep 13 09:41:32 ltc-fire1 kernel: [70437.943722] vfio-pci 0001:01:00.3: No device request channel registered, blocked until released by user Sep 13 09:41:32 ltc-fire1 kernel: [70437.944076] mlx5_core 0001:01:00.3: enabling device ( -> 0002) Sep 13 09:41:32 ltc-fire1 kernel: [70437.944110] mlx5_core 0001:01:00.3: Using 32-bit DMA via iommu Sep 13 09:41:32 ltc-fire1 kernel: [70437.944145] pci 0001:01: 0.3: [PE# 006] Removing DMA window #0 Sep 13 09:41:32 ltc-fire1 kernel: [70437.944152] pci 0001:01: 0.3: [PE# 006] Removing DMA window #1 Sep 13 09:41:32 ltc-fire1 kernel: [70437.944195] mlx5_core 0001:01:00.3: firmware version: 12.17.226 Sep 13 09:41:32 ltc-fire1 kernel: [70437.944260] Unable to handle kernel paging request for data at address 0x Sep 13 09:41:32 ltc-fire1 kernel: [70437.944533] Faulting instruction address: 0xc05b37e0 Sep 13 09:41:32 ltc-fire1 kernel: [70437.944592] Oops: Kernel access of bad area, sig: 11 [#1] Sep 13 09:41:32 ltc-fire1 kernel: [70437.944636] SMP NR_CPUS=2048 NUMA PowerNV Sep 13 09:41:32 ltc-fire1 kernel: [70437.944851] Modules linked in: vfio_pci irqbypass vfio_iommu_spapr_tce vfio_virqfd vfio vfio_spapr_eeh xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp kvm_hv kvm_pr kvm ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter rdma_ucm(OE) ib_ucm(OE) rdma_cm(OE) iw_cm(OE) configfs ib_ipoib(OE) ib_cm(OE) ib_uverbs(OE) ib_umad(OE) mlx5_ib(OE) mlx5_core(OE) mlx4_ib(OE) ib_sa(OE) ib_mad(OE) ib_core(OE) mlx4_en(OE) ib_addr(OE) ib_netlink(OE) mlx4_core(OE) mlx_compat(OE) bridge stp llc joydev input_leds mac_hid ofpart at24 cmdlinepart powernv_flash ipmi_powernv nvmem_core uio_pdrv_genirq opal_prd mtd ipmi_msghandler uio ibmpowernv powernv_rng binfmt_misc dm_multipath knem(OE) ip_tables x_tables autofs4 hid_generic usbhid hid uas usb_storage ast i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sys imgblt fb_sys_fops drm ahci devlink libahci [last unloaded: mlx4_core] Sep 13 09:41:32 ltc-fire1 kernel: [70437.946007] CPU: 40 PID: 12501 Comm: libvirtd Tainted: G OE 4.7.0unofficial #5 Sep 13 09:41:32 ltc-fire1 kernel: [70437.946074] task: c00ec319a200 ti: c00ec324c000 task.ti: c00ec324c000 Sep 13 09:41:32 ltc-fire1 kernel: [70437.946140] NIP: c05b37e0 LR: c05ad070 CTR: Sep 13 09:41:32 ltc-fire1 kernel: [70437.946208] REGS: c00ec324f100 TRAP: 0300 Tainted: G OE(4.7.0unofficial) Sep 13 09:41:32 ltc-fire1 kernel: [70437.946286] MSR: 90010280b033CR: 84028844 XER: 2000 Sep 13 09:41:32 ltc-fire1 kernel: [70437.946533] CFAR: c0008468 DAR: DSISR: 4000 SOFTE: 0 Sep 13 09:41:32 ltc-fire1 kernel: [70437.946533] GPR00: c05d19c8 c00ec324f380 c13bef00 Sep 13 09:41:32 ltc-fire1 kernel: [70437.946533] GPR04: Sep 13 09:41:32 ltc-fire1 kernel:
[Kernel-packages] [Bug 1633223] Re: rcu_sched detected stalls with kernel 3.19.0-58, NVIDIA driver, and docker
Revisiting this bug, is this issue still persisting? ** Changed in: linux (Ubuntu) Status: New => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1633223 Title: rcu_sched detected stalls with kernel 3.19.0-58, NVIDIA driver, and docker Status in linux package in Ubuntu: Incomplete Bug description: ---Problem Description--- Seeing occasional rcu_sched detected stalls on 14.04 LTS with kernel 3.19.0-58. The system is running docker containers, and has the NVIDIA GPU driver loaded. We've seen about 4 stalls in the last month, all with the 3.19.0-58 kernel, and with the NVIDIA 352.93 and 361.49 drivers. ---uname output--- Linux dldev1 3.19.0-58-generic #64~14.04.1-Ubuntu SMP Fri Mar 18 19:05:01 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux ---Additional Hardware Info--- 2 x NVIDIA K80 GPU adapter: $ lspci | grep NV 0002:03:00.0 3D controller: NVIDIA Corporation GK210GL [Tesla K80] (rev a1) 0002:04:00.0 3D controller: NVIDIA Corporation GK210GL [Tesla K80] (rev a1) 0006:03:00.0 3D controller: NVIDIA Corporation GK210GL [Tesla K80] (rev a1) 0006:04:00.0 3D controller: NVIDIA Corporation GK210GL [Tesla K80] (rev a1) Machine Type = 8247-42L ---System Hang--- Usual symptom is that the system is unresponsive except maybe for ping and writing the stall-detection messages to the console. Login/getty isn't available either via ssh nor on the console. System must be power cycled to recover. Attached is the kernel log from a stall detection on May 18th. The detection first occurs at: May 18 15:17:55. The system is later rebooted and those messages indicate the kernel (3.19.0-58) and NVIDIA driver version (352.93) that were active at the time. We've suffered 3 or 4 stalls since, all with the same kernel, but some with a newer NVIDIA driver (361.49). Unfortunately, information about the newer stalls wasn't preserved in the various log files (and we're not capturing the console constantly), so we don't have detailed data for those. We'd welcome any suggestions for how to collect additional data for these occurrences. I can't say for sure that we haven't seen the stalls on other systems, but they're occuring fairly frequently on this system, and it's unusual in that it's running both Docker and NVIDIA GPU driver. So maybe aufs or the NVIDIA driver are somehow involved. From the kern.log, The Call trace points to some kind of deadlock in aufs - May 18 15:17:55 dldev1 kernel: [713670.798624] Task dump for CPU 3: May 18 15:17:55 dldev1 kernel: [713670.798628] cc1 R running task0 99183 99173 0x00040004 May 18 15:17:55 dldev1 kernel: [713670.798633] Call Trace: May 18 15:17:55 dldev1 kernel: [713670.798643] [c00fa64673a0] [c00cf004] wake_up_worker+0x44/0x60 (unreliable) May 18 15:17:55 dldev1 kernel: [713670.798671] [c00fa6467570] [c00fa64675d0] 0xc00fa64675d0 May 18 15:17:55 dldev1 kernel: [713670.798676] [c00fa64675d0] [c0a1b050] __schedule+0x370/0x900 May 18 15:17:55 dldev1 kernel: [713670.798679] [c00fa64677f0] [c00fa6467850] 0xc00fa6467850 May 18 15:17:55 dldev1 kernel: [713670.798682] Task dump for CPU 75: May 18 15:17:55 dldev1 kernel: [713670.798684] cc1 D 105d9410 0 99427 99405 0x00040004 May 18 15:17:55 dldev1 kernel: [713670.798688] Call Trace: May 18 15:17:55 dldev1 kernel: [713670.798691] [c017efdd3460] [c017efdd34a0] 0xc017efdd34a0 (unreliable) May 18 15:17:55 dldev1 kernel: [713670.798695] [c017efdd3630] [c017efdd3690] 0xc017efdd3690 May 18 15:17:55 dldev1 kernel: [713670.798698] [c017efdd3690] [c0a1b050] __schedule+0x370/0x900 May 18 15:17:55 dldev1 kernel: [713670.798702] [c017efdd38b0] [c0a1f128] rwsem_down_write_failed+0x288/0x400 May 18 15:17:55 dldev1 kernel: [713670.798706] [c017efdd3940] [c0a1e538] down_write+0x88/0x90 May 18 15:17:55 dldev1 kernel: [713670.798716] [c017efdd3970] [d0001ead562c] do_ii_write_lock+0x8c/0xd0 [aufs] May 18 15:17:55 dldev1 kernel: [713670.798724] [c017efdd39a0] [d0001eac0e98] aufs_read_lock+0xb8/0xd0 [aufs] May 18 15:17:55 dldev1 kernel: [713670.798733] [c017efdd39e0] [d0001ead8208] aufs_d_revalidate+0x98/0x7a0 [aufs] May 18 15:17:55 dldev1 kernel: [713670.798737] [c017efdd3aa0] [c02c88f8] lookup_fast+0x368/0x3b0 May 18 15:17:55 dldev1 kernel: [713670.798740] [c017efdd3b10] [c02cb620] path_lookupat+0x180/0x970 May 18 15:17:55 dldev1 kernel: [713670.798743] [c017efdd3be0] [c02cbe68] filename_lookup+0x58/0x140 May 18 15:17:55 dldev1 kernel: [713670.798746] [c017efdd3c30] [c02cde04] user_path_at_empty+0x84/0xe0 May 18 15:17:55 dldev1 kernel: [713670.798749]
[Kernel-packages] [Bug 1710922] Re: Ubuntu 17.10 ppc64el guest with MEMORY_HOTPLUG_DEFAULT_ONLINE=y gets a "kernel BUG at mm/memory_hotplug.c:2185" when hotplugging LMBs with QEMU upstream
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1710922 Title: Ubuntu 17.10 ppc64el guest with MEMORY_HOTPLUG_DEFAULT_ONLINE=y gets a "kernel BUG at mm/memory_hotplug.c:2185" when hotplugging LMBs with QEMU upstream Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Daniel Henrique Barboza- 2017-08-08 09:13:31 == - Host information: Ubuntu 16.10 running upstream QEMU $ uname -a Linux louis 4.10.0-20-generic #22-Ubuntu SMP Thu Apr 20 09:22:16 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux $ cat /proc/cpuinfo processor : 0 cpu : POWER8E (raw), altivec supported clock : 2061.00MHz revision : 2.1 (pvr 004b 0201) (...) timebase : 51200 platform : PowerNV model : 8247-42L machine : PowerNV 8247-42L firmware : OPAL - qemu command line that launched the Ubuntu 17.10 ppc64el guest: sudo ./qemu-system-ppc64 -name migrate_qemu -boot strict=on --enable- kvm -device nec-usb-xhci,id=usb,bus=pci.0,addr=0xf -device spapr- vscsi,id=scsi0,reg=0x2000 -smp 1,maxcpus=4,sockets=4,cores=1,threads=1 --machine pseries,accel=kvm,usb=off,dump-guest-core=off -m 4G,slots=32,maxmem=32G -drive file=/home/danielhb/vm_imgs/ub1710.qcow2,format=qcow2,if=none,id =drive-virtio-disk0,cache=none -device virtio-blk- pci,scsi=off,bus=pci.0,addr=0x2,drive=drive-virtio-disk0,id=virtio- disk0,bootindex=1 -nographic - guest information: Ubuntu 17.10 ppc64el: root@ubuntu1710:~# uname -a Linux ubuntu1710 4.11.0-10-generic #15-Ubuntu SMP Thu Jun 29 15:02:54 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux root@ubuntu1710:~# - Problem: hotplugging a LMB generates a guest kernel Oops: root@ubuntu1710:~# QEMU 2.9.90 monitor - type 'help' for more information (qemu) (qemu) object_add memory-backend-ram,id=ram1,size=1G (qemu) device_add pc-dimm,id=dimm1,memdev=ram1 (qemu) [ 126.850952] kernel BUG at /build/linux-S1V_3d/linux-4.11.0/mm/memory_hotplug.c:2185! [ 126.851285] Oops: Exception in kernel mode, sig: 5 [#1] [ 126.851428] SMP NR_CPUS=2048 [ 126.851428] NUMA [ 126.851546] pSeries [ 126.851714] Modules linked in: vmx_crypto ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ibmvscsi crc32c_vpmsum virtio_blk [ 126.852447] CPU: 0 PID: 5 Comm: kworker/u8:0 Not tainted 4.11.0-10-generic #15-Ubuntu [ 126.852656] Workqueue: pseries hotplug workque pseries_hp_work_fn [ 126.852828] task: c000fea8 task.stack: c000fe118000 [ 126.853000] NIP: c0350268 LR: c03501e0 CTR: [ 126.853190] REGS: c000fe11b780 TRAP: 0700 Not tainted (4.11.0-10-generic) [ 126.853390] MSR: 8282b033 [ 126.853396] CR: 42002422 XER: 2000 [ 126.853672] CFAR: c03501e4 SOFTE: 1 [ 126.853672] GPR00: c03501e0 c000fe11ba00 c149eb00 0001 [ 126.853672] GPR04: c000f9901480 c000ffe21c00 003e 0003 [ 126.853672] GPR08: 0002 0003 0003 303078302d303030 [ 126.853672] GPR12: 2200 cfb8 c0110008 c000fe1810c0 [ 126.853672] GPR16: c000fe050ea8 0010 c000fc30 c000fea0 [ 126.853672] GPR20: c000f951a1a4 0004 0001 0010 [ 126.853672] GPR24: 0001 c000f951a1a0 0004 [ 126.853672] GPR28: 0001 1000 00014000 [ 126.855221] NIP [c0350268] remove_memory+0xf8/0x100 [ 126.855338] LR [c03501e0] remove_memory+0x70/0x100 [ 126.855453] Call Trace: [ 126.855520] [c000fe11ba00] [c03501e0] remove_memory+0x70/0x100 (unreliable) [ 126.855684] [c000fe11ba40] [c00b0880] dlpar_add_lmb+0x370/0x3f0 [ 126.855822] [c000fe11bb20] [c00b174c] dlpar_memory+0x7cc/0xd20 [ 126.855959] [c000fe11bbf0] [c00a9af8] handle_dlpar_errorlog+0xa8/0x170 [ 126.856118] [c000fe11bc60] [c00a9c54] pseries_hp_work_fn+0x94/0xa0 [ 126.856275] [c000fe11bc90] [c01071d0] process_one_work+0x2b0/0x5a0 [ 126.856430] [c000fe11bd20]
[Kernel-packages] [Bug 1710690] Re: Ubuntu16.04.3: System running network stress crashes with Alignment exception
** Changed in: ubuntu-power-systems Assignee: Canonical Server Team (canonical-server) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1710690 Title: Ubuntu16.04.3: System running network stress crashes with Alignment exception Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: State: Open by: nguyenp on 11 August 2017 11:03:32 Contact: === Paul Nguyen nguy...@us.ibm.com BMC: bos1u1 Firmware Revision : 00.25 Firmware Build Time : 20170807BMC MAC address : 0c:c4:7a:f4:4d:60 PNOR Build Time : 20170729 CPLD Version : B2.91.00 Ubuntu 16.04.3: === bos1u1p1 ver 1.5.4.5 - OS, HTX, Firmware and Machine details OS: GNU/Linux OS Version: Ubuntu 16.04.3 LTS \n \l Kernel Version: 4.11.0-12-generic HTX Version: htxubuntu-448 Host Name: bos1u1p1 Machine Serial No: C819UAF32B2 Machine Type/Model: 9006-12C root@bos1u1p1:~# dpkg -l |grep mlx ii libmlx4-1 41mlnx1-OFED.4.1.0.1.0.41014 ppc64el Userspace driver for Mellanox ConnectX InfiniBand HCAs ii libmlx4-1-dbg 41mlnx1-OFED.4.1.0.1.0.41014 ppc64el Debugging symbols for the libmlx4 driver ii libmlx4-dev41mlnx1-OFED.4.1.0.1.0.41014 ppc64el Development files for the libmlx4 driver ii libmlx5-1 41mlnx1-OFED.4.1.0.1.3.0.1.41014 ppc64el Userspace driver for Mellanox ConnectX InfiniBand HCAs ii libmlx5-1-dbg 41mlnx1-OFED.4.1.0.1.3.0.1.41014 ppc64el Debugging symbols for the libmlx5 driver ii libmlx5-dev41mlnx1-OFED.4.1.0.1.3.0.1.41014 ppc64el Development files for the libmlx5 driver root@bos1u1p1:~# lsscsi [0:2:0:0]diskSEAGATE ST4000NM0034 E005 /dev/sda [0:3:123:0] enclosu ADAPTEC Smart Adapter2.99 - root@bos1u1p1:~# lspci :00:00.0 PCI bridge: IBM Device 04c1 0001:00:00.0 PCI bridge: IBM Device 04c1 0002:00:00.0 PCI bridge: IBM Device 04c1 0002:01:00.0 Ethernet controller: Intel Corporation Ethernet Controller X710/X557-AT 10GBASE-T (rev 02) 0002:01:00.1 Ethernet controller: Intel Corporation Ethernet Controller X710/X557-AT 10GBASE-T (rev 02) 0002:01:00.2 Ethernet controller: Intel Corporation Ethernet Controller X710/X557-AT 10GBASE-T (rev 02) 0002:01:00.3 Ethernet controller: Intel Corporation Ethernet Controller X710/X557-AT 10GBASE-T (rev 02) 0003:00:00.0 PCI bridge: IBM Device 04c1 0003:01:00.0 Serial Attached SCSI controller: Adaptec Series 8 12G SAS/PCIe 3 (rev 01) 0004:00:00.0 PCI bridge: IBM Device 04c1 0004:01:00.0 PCI bridge: ASPEED Technology, Inc. AST1150 PCI-to-PCI Bridge (rev 04) 0004:02:00.0 VGA compatible controller: ASPEED Technology, Inc. ASPEED Graphics Family (rev 41) 0005:00:00.0 PCI bridge: IBM Device 04c1 0005:01:00.0 USB controller: Texas Instruments TUSB73x0 SuperSpeed USB 3.0 xHCI Host Controller (rev 02) 0030:00:00.0 PCI bridge: IBM Device 04c1 0030:01:00.0 Infiniband controller: Mellanox Technologies Device 1019 0030:01:00.1 Infiniband controller: Mellanox Technologies Device 1019 0031:00:00.0 PCI bridge: IBM Device 04c1 0031:01:00.0 Ethernet controller: Intel Corporation 82599ES 10-Gigabit SFI/SFP+ Network Connection (rev 01) 0031:01:00.1 Ethernet controller: Intel Corporation 82599ES 10-Gigabit SFI/SFP+ Network Connection (rev 01) 0032:00:00.0 PCI bridge: IBM Device 04c1 0033:00:00.0 PCI bridge: IBM Device 04c1 0033:01:00.0 Ethernet controller: Intel Corporation 82575EB Gigabit Network Connection (rev 02) 0033:01:00.1 Ethernet controller: Intel Corporation 82575EB Gigabit Network Connection (rev 02) root@bos1u1p1:~# ifconfig -a enP2p1s0f0 Link encap:Ethernet HWaddr ac:1f:6b:09:c0:9e inet addr:9.3.20.217 Bcast:9.3.21.255 Mask:255.255.254.0 inet6 addr: fe80::ae1f:6bff:fe09:c09e/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:62603 errors:0 dropped:0 overruns:0 frame:0 TX packets:105 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:4784741 (4.7 MB) TX bytes:14043 (14.0 KB) enP2p1s0f1 Link encap:Ethernet HWaddr ac:1f:6b:09:c0:9f BROADCAST MULTICAST MTU:1500 Metric:1 RX packets:0 errors:0 dropped:0 overruns:0 frame:0 TX packets:0 errors:0 dropped:0 overruns:0 carrier:0 collisions:0
[Kernel-packages] [Bug 1709784] Re: KVM on 16.04.3 throws an error
** Also affects: qemu Importance: Undecided Status: New ** Also affects: qemu (Ubuntu) Importance: Undecided Status: New ** No longer affects: qemu ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Server Team (canonical-server) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709784 Title: KVM on 16.04.3 throws an error Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Status in qemu package in Ubuntu: New Bug description: Problem Description KVM on Ubuntu 16.04.3 throws an error when used ---uname output--- Linux bastion-1 4.4.0-89-generic #112-Ubuntu SMP Mon Jul 31 19:37:08 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = 8348-21C Habanero ---Steps to Reproduce--- Install 16.04.3 install KVM like: apt-get install libvirt-bin qemu qemu-slof qemu-system qemu-utils then exit and log back in so virsh will work without sudo then run my spawn script $ cat spawn.sh #!/bin/bash img=$1 qemu-system-ppc64 \ -machine pseries,accel=kvm,usb=off -cpu host -m 512 \ -display none -nographic \ -net nic -net user \ -drive "file=$img" with a freshly downloaded ubuntu cloud image sudo ./spawn.sh xenial-server-cloudimg-ppc64el-disk1.img And I get nothing on the output. and errors in dmesg ubuntu@bastion-1:~$ [ 340.180295] Facility 'TM' unavailable, exception at 0xd000148b7f10, MSR=90009033 [ 340.180399] Oops: Unexpected facility unavailable exception, sig: 6 [#1] [ 340.180513] SMP NR_CPUS=2048 NUMA PowerNV [ 340.180547] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables kvm_hv kvm binfmt_misc joydev input_leds mac_hid opal_prd ofpart cmdlinepart powernv_flash ipmi_powernv ipmi_msghandler mtd at24 uio_pdrv_genirq uio ibmpowernv powernv_rng vmx_crypto ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 multipath linear mlx4_en hid_generic usbhid hid uas usb_storage ast i2c_algo_bit bnx2x ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops mlx4_core drm ahci vxlan libahci ip6_udp_tunnel udp_tunnel mdio libcrc32c [ 340.181331] CPU: 46 PID: 5252 Comm: qemu-system-ppc Not tainted 4.4.0-89-generic #112-Ubuntu [ 340.181382] task: c01e34c30b50 ti: c01e34ce4000 task.ti: c01e34ce4000 [ 340.181432] NIP: d000148b7f10 LR: d00014822a14 CTR: d000148b7e40 [ 340.181475] REGS: c01e34ce77b0 TRAP: 0f60 Not tainted (4.4.0-89-generic) [ 340.181519] MSR: 90009033CR: 22024848 XER: [ 340.181629] CFAR: d000148b7ea4 SOFTE: 1 GPR00: d00014822a14 c01e34ce7a30 d000148cc018 c01e37bc GPR04: c01db9ac c01e34ce7bc0 GPR08: 0001 c01e34c30b50 0001 d000148278f8 GPR12: d000148b7e40 cfb5b500 001f GPR16: 3fff91c3 0080 3fffa8e34390 3fff9242f200 GPR20: 3fff92430010 01001de5c030 3fff9242eb60 100c1ff0 GPR24: 3fffc91fe990 3fff91c10028 c01e37bc GPR28: c01db9ac c01e37bc c01db9ac [ 340.182315] NIP [d000148b7f10] kvmppc_vcpu_run_hv+0xd0/0xff0 [kvm_hv] [ 340.182357] LR [d00014822a14] kvmppc_vcpu_run+0x44/0x60 [kvm] [ 340.182394] Call Trace: [ 340.182413] [c01e34ce7a30] [c01e34ce7ab0] 0xc01e34ce7ab0 (unreliable) [ 340.182468] [c01e34ce7b70] [d00014822a14] kvmppc_vcpu_run+0x44/0x60 [kvm] [ 340.182522] [c01e34ce7ba0] [d0001481f674] kvm_arch_vcpu_ioctl_run+0x64/0x170 [kvm] [ 340.182581] [c01e34ce7be0] [d00014813918] kvm_vcpu_ioctl+0x528/0x7b0 [kvm] [ 340.182634] [c01e34ce7d40] [c02fffa0] do_vfs_ioctl+0x480/0x7d0 [ 340.182678] [c01e34ce7de0] [c03003c4] SyS_ioctl+0xd4/0xf0 [ 340.182723] [c01e34ce7e30] [c0009204] system_call+0x38/0xb4 [ 340.182766] Instruction dump: [ 340.182788] e92d02a0 e9290a50 e9290108 792a07e3 41820058 e92d02a0 e9290a50 e9290108 [ 340.182863] 7927e8a4 78e71f87 40820ed8 e92d02a0 <7d4022a6> f9490ee8 e92d02a0 7d4122a6 [ 340.182938] ---[ end trace bc5080cb7d18f102 ]--- [ 340.276202] This was with the latest ubuntu cloud image. I get the same thing when trying to
[Kernel-packages] [Bug 1709738] Re: xfstest sanity checks 17 fails on data-hole-data inside page
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => Medium ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709738 Title: xfstest sanity checks 17 fails on data-hole-data inside page Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: Problem Description xfstests fails with Metadata corruption at leaf on ext4 filesystem Environment -- Kernel Build: 4.12.1-041201-generic System Name : ltc-test-ci2 Model : 8247-22L Platform: PowerNV ( P8 ) Issue observed in P9 also. Uname output --- # uname -a Linux ltc-test-ci2 4.12.1-041201-generic #201707121132 SMP Wed Jul 12 17:03:25 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Steps to reproduce: 1. Create a loop device with ext4 filesystem 2. git clone git://git.kernel.org/pub/scm/fs/xfs/xfstests-dev.git; cd xfstests-dev 3. make 4. Create a local.config for running with created loop device 5. Run xfstests-dev test : ./check tests/ext4/445 generic/445 [failed, exit status 1] - output mismatch (see /root/harish/xfstests-dev/results//generic/445.out.bad) --- tests/generic/445.out 2017-07-13 06:04:36.244322946 -0400 +++ /root/harish/xfstests-dev/results//generic/445.out.bad 2017-07-14 02:49:06.540352923 -0400 @@ -1,2 +1,3 @@ QA output created by 445 -Silence is golden +seek sanity check failed! +(see /root/harish/xfstests-dev/results//generic/445.full for details) ... (Run 'diff -u tests/generic/445.out /root/harish/xfstests-dev/results//generic/445.out.bad' to see the entire diff) Nothing observed in dmesg. Full log is attached. Note: Issue is also observed on distro kernel - 4.11.0-10-generic. It needs to have two disks you run the test on. Make sure you create those directories before running the test. # cat local.config export TEST_DEV=/dev/loop0 export TEST_DIR=/mnt/test export SCRATCH_DEV=/dev/loop1 export SCRATCH_MNT=/mnt/scratch To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1709738/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1709784] Re: KVM on 16.04.3 throws an error
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => Critical -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709784 Title: KVM on 16.04.3 throws an error Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Status in qemu package in Ubuntu: New Bug description: Problem Description KVM on Ubuntu 16.04.3 throws an error when used ---uname output--- Linux bastion-1 4.4.0-89-generic #112-Ubuntu SMP Mon Jul 31 19:37:08 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = 8348-21C Habanero ---Steps to Reproduce--- Install 16.04.3 install KVM like: apt-get install libvirt-bin qemu qemu-slof qemu-system qemu-utils then exit and log back in so virsh will work without sudo then run my spawn script $ cat spawn.sh #!/bin/bash img=$1 qemu-system-ppc64 \ -machine pseries,accel=kvm,usb=off -cpu host -m 512 \ -display none -nographic \ -net nic -net user \ -drive "file=$img" with a freshly downloaded ubuntu cloud image sudo ./spawn.sh xenial-server-cloudimg-ppc64el-disk1.img And I get nothing on the output. and errors in dmesg ubuntu@bastion-1:~$ [ 340.180295] Facility 'TM' unavailable, exception at 0xd000148b7f10, MSR=90009033 [ 340.180399] Oops: Unexpected facility unavailable exception, sig: 6 [#1] [ 340.180513] SMP NR_CPUS=2048 NUMA PowerNV [ 340.180547] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables kvm_hv kvm binfmt_misc joydev input_leds mac_hid opal_prd ofpart cmdlinepart powernv_flash ipmi_powernv ipmi_msghandler mtd at24 uio_pdrv_genirq uio ibmpowernv powernv_rng vmx_crypto ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 multipath linear mlx4_en hid_generic usbhid hid uas usb_storage ast i2c_algo_bit bnx2x ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops mlx4_core drm ahci vxlan libahci ip6_udp_tunnel udp_tunnel mdio libcrc32c [ 340.181331] CPU: 46 PID: 5252 Comm: qemu-system-ppc Not tainted 4.4.0-89-generic #112-Ubuntu [ 340.181382] task: c01e34c30b50 ti: c01e34ce4000 task.ti: c01e34ce4000 [ 340.181432] NIP: d000148b7f10 LR: d00014822a14 CTR: d000148b7e40 [ 340.181475] REGS: c01e34ce77b0 TRAP: 0f60 Not tainted (4.4.0-89-generic) [ 340.181519] MSR: 90009033CR: 22024848 XER: [ 340.181629] CFAR: d000148b7ea4 SOFTE: 1 GPR00: d00014822a14 c01e34ce7a30 d000148cc018 c01e37bc GPR04: c01db9ac c01e34ce7bc0 GPR08: 0001 c01e34c30b50 0001 d000148278f8 GPR12: d000148b7e40 cfb5b500 001f GPR16: 3fff91c3 0080 3fffa8e34390 3fff9242f200 GPR20: 3fff92430010 01001de5c030 3fff9242eb60 100c1ff0 GPR24: 3fffc91fe990 3fff91c10028 c01e37bc GPR28: c01db9ac c01e37bc c01db9ac [ 340.182315] NIP [d000148b7f10] kvmppc_vcpu_run_hv+0xd0/0xff0 [kvm_hv] [ 340.182357] LR [d00014822a14] kvmppc_vcpu_run+0x44/0x60 [kvm] [ 340.182394] Call Trace: [ 340.182413] [c01e34ce7a30] [c01e34ce7ab0] 0xc01e34ce7ab0 (unreliable) [ 340.182468] [c01e34ce7b70] [d00014822a14] kvmppc_vcpu_run+0x44/0x60 [kvm] [ 340.182522] [c01e34ce7ba0] [d0001481f674] kvm_arch_vcpu_ioctl_run+0x64/0x170 [kvm] [ 340.182581] [c01e34ce7be0] [d00014813918] kvm_vcpu_ioctl+0x528/0x7b0 [kvm] [ 340.182634] [c01e34ce7d40] [c02fffa0] do_vfs_ioctl+0x480/0x7d0 [ 340.182678] [c01e34ce7de0] [c03003c4] SyS_ioctl+0xd4/0xf0 [ 340.182723] [c01e34ce7e30] [c0009204] system_call+0x38/0xb4 [ 340.182766] Instruction dump: [ 340.182788] e92d02a0 e9290a50 e9290108 792a07e3 41820058 e92d02a0 e9290a50 e9290108 [ 340.182863] 7927e8a4 78e71f87 40820ed8 e92d02a0 <7d4022a6> f9490ee8 e92d02a0 7d4122a6 [ 340.182938] ---[ end trace bc5080cb7d18f102 ]--- [ 340.276202] This was with the latest ubuntu cloud image. I get the same thing when trying to use virt-install with an ISO image. I have no way of loading a KVM on 16.04.3 == Comment: #2 - Jason M. Furmanek
[Kernel-packages] [Bug 1709784] Re: KVM on 16.04.3 throws an error
Thanks Christian. Reassigning to kernel team. ** Changed in: ubuntu-power-systems Assignee: Canonical Server Team (canonical-server) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709784 Title: KVM on 16.04.3 throws an error Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Status in qemu package in Ubuntu: Won't Fix Bug description: Problem Description KVM on Ubuntu 16.04.3 throws an error when used ---uname output--- Linux bastion-1 4.4.0-89-generic #112-Ubuntu SMP Mon Jul 31 19:37:08 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = 8348-21C Habanero ---Steps to Reproduce--- Install 16.04.3 install KVM like: apt-get install libvirt-bin qemu qemu-slof qemu-system qemu-utils then exit and log back in so virsh will work without sudo then run my spawn script $ cat spawn.sh #!/bin/bash img=$1 qemu-system-ppc64 \ -machine pseries,accel=kvm,usb=off -cpu host -m 512 \ -display none -nographic \ -net nic -net user \ -drive "file=$img" with a freshly downloaded ubuntu cloud image sudo ./spawn.sh xenial-server-cloudimg-ppc64el-disk1.img And I get nothing on the output. and errors in dmesg ubuntu@bastion-1:~$ [ 340.180295] Facility 'TM' unavailable, exception at 0xd000148b7f10, MSR=90009033 [ 340.180399] Oops: Unexpected facility unavailable exception, sig: 6 [#1] [ 340.180513] SMP NR_CPUS=2048 NUMA PowerNV [ 340.180547] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables kvm_hv kvm binfmt_misc joydev input_leds mac_hid opal_prd ofpart cmdlinepart powernv_flash ipmi_powernv ipmi_msghandler mtd at24 uio_pdrv_genirq uio ibmpowernv powernv_rng vmx_crypto ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 multipath linear mlx4_en hid_generic usbhid hid uas usb_storage ast i2c_algo_bit bnx2x ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops mlx4_core drm ahci vxlan libahci ip6_udp_tunnel udp_tunnel mdio libcrc32c [ 340.181331] CPU: 46 PID: 5252 Comm: qemu-system-ppc Not tainted 4.4.0-89-generic #112-Ubuntu [ 340.181382] task: c01e34c30b50 ti: c01e34ce4000 task.ti: c01e34ce4000 [ 340.181432] NIP: d000148b7f10 LR: d00014822a14 CTR: d000148b7e40 [ 340.181475] REGS: c01e34ce77b0 TRAP: 0f60 Not tainted (4.4.0-89-generic) [ 340.181519] MSR: 90009033CR: 22024848 XER: [ 340.181629] CFAR: d000148b7ea4 SOFTE: 1 GPR00: d00014822a14 c01e34ce7a30 d000148cc018 c01e37bc GPR04: c01db9ac c01e34ce7bc0 GPR08: 0001 c01e34c30b50 0001 d000148278f8 GPR12: d000148b7e40 cfb5b500 001f GPR16: 3fff91c3 0080 3fffa8e34390 3fff9242f200 GPR20: 3fff92430010 01001de5c030 3fff9242eb60 100c1ff0 GPR24: 3fffc91fe990 3fff91c10028 c01e37bc GPR28: c01db9ac c01e37bc c01db9ac [ 340.182315] NIP [d000148b7f10] kvmppc_vcpu_run_hv+0xd0/0xff0 [kvm_hv] [ 340.182357] LR [d00014822a14] kvmppc_vcpu_run+0x44/0x60 [kvm] [ 340.182394] Call Trace: [ 340.182413] [c01e34ce7a30] [c01e34ce7ab0] 0xc01e34ce7ab0 (unreliable) [ 340.182468] [c01e34ce7b70] [d00014822a14] kvmppc_vcpu_run+0x44/0x60 [kvm] [ 340.182522] [c01e34ce7ba0] [d0001481f674] kvm_arch_vcpu_ioctl_run+0x64/0x170 [kvm] [ 340.182581] [c01e34ce7be0] [d00014813918] kvm_vcpu_ioctl+0x528/0x7b0 [kvm] [ 340.182634] [c01e34ce7d40] [c02fffa0] do_vfs_ioctl+0x480/0x7d0 [ 340.182678] [c01e34ce7de0] [c03003c4] SyS_ioctl+0xd4/0xf0 [ 340.182723] [c01e34ce7e30] [c0009204] system_call+0x38/0xb4 [ 340.182766] Instruction dump: [ 340.182788] e92d02a0 e9290a50 e9290108 792a07e3 41820058 e92d02a0 e9290a50 e9290108 [ 340.182863] 7927e8a4 78e71f87 40820ed8 e92d02a0 <7d4022a6> f9490ee8 e92d02a0 7d4122a6 [ 340.182938] ---[ end trace bc5080cb7d18f102 ]--- [ 340.276202] This was with the latest ubuntu cloud image. I get the same thing when trying to use virt-install with an ISO image. I have no way of loading a KVM on 16.04.3
[Kernel-packages] [Bug 1708399] Re: kernel panic -not syncing: Fatal exception: panic_on_oops
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => Critical ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1708399 Title: kernel panic -not syncing: Fatal exception: panic_on_oops Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - QI YE- 2017-08-02 04:11:25 == ---Problem Description--- Ubuntu got kernel panic ---uname output--- #110-Ubuntu SMP Tue Jul 18 12:56:43 UTC 2017 s390x s390x s390x GNU/Linux ---Debugger Data--- PID: 10991 TASK: 19872a0e8 CPU: 2 COMMAND: "hyperkube" LOWCORE INFO: -psw : 0x0004c0018000 0x00115fa6 -function : pcpu_delegate at 115fa6 -prefix : 0x7fe42000 -cpu timer: 0x7ffab2827828aa50 -clock cmp: 0xd2eb8b31445e4200 -general registers: 0x0004e001 0x001283b6 0xc001 0x8380fcb8 0x00115f9e 0x0056f6e2 0x0004 0x00cf9070 0x0001f3bfc000 0x00112fd8 0x0001c72bb400 0x0002 0x7fffc000 0x007c9ef0 0x00115f9e 0x8380fc18 -access registers: 0x03ff 0x7910 00 00 00 00 00 00 00 00 00 00 00 00 00 00 -control registers: 0x14066a12 0x7e6d81c7 0x00011140 00 0x2aef 0x0400 0x5000 0x7e6d81c7 00 00 00 00 00 0x00cfc007 0xdb00 0x00011280 -floating point registers: 0x409c7e258000 0x401de4e0 00 0x3fd24407ab0e073a 0x3ff0 0x3fee 0x3fef218f8a7a41a0 0x3fee 0x0080 00 0x03ff7f80 0x02aa4940e9e0 0xd401 0x03ffe81fe110 00 0x03fff2cfe638 #0 [8380fc78] smp_find_processor_id at 1160f8 #1 [8380fc90] machine_kexec at 1135d4 #2 [8380fcb8] crash_kexec at 1fbb8a #3 [8380fd88] panic at 27d0e0 #4 [8380fe28] die at 1142cc #5 [8380fe90] do_low_address at 12215e #6 [8380fea8] pgm_check_handler at 7c2ab4 PSW: 070520018000 02aa267e0e42 (user space) GPRS: 02aa2c4fd690 0001 02aa2c4fd690 03ff7fffee38 0002 00029c0f 00c42001ea00 0001 0001 00c42001c5c8 00c42082c1a0 02aa2666325e 03ff7fffed90 Contact Information = Chee Ye / y...@cn.ibm.com Stack trace output: no Oops output: [43200.761465] docker0: port 10(vethb9132e9) entered forwarding state [50008.560926] hrtimer: interrupt took 1698076 ns [123483.768984] systemd[1]: apt-daily.timer: Adding 7h 34min 22.582204s random time. [123483.930058] systemd[1]: apt-daily.timer: Adding 2h 18min 14.857162s random time. [123484.064879] systemd[1]: apt-daily.timer: Adding 10h 46min 2.301756s random time. [123484.824760] systemd[1]: apt-daily.timer: Adding 6h 16min 22.178655s random time. [153113.703126] conntrack: generic helper won't handle protocol 47. Please consider loading the specific helper module. [477085.704538] Low-address protection: 0004 ilc:2 [#1] SMP [477085.704551] Modules linked in: xt_physdev veth xt_recent xt_comment xt_mark xt_nat ipt_MASQUERADE nf_nat_masquerade_ipv4 xfrm_user xfrm_algo iptable_nat nf_nat_ipv4 xt_addrtype nf_nat br_netfilter bridge stp llc aufs ipt_REJECT nf_reject_ipv4 xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack iptable_filter ip_tables x_tables ghash_s390 prng aes_s390 des_s390 des_generic sha512_s390 qeth_l2 sha256_s390 qeth sha1_s390 qdio sha_common ccwgroup vmur dasd_eckd_mod dasd_mod [477085.705522] CPU: 2 PID: 10991 Comm: hyperkube Not tainted 4.4.0-87-generic #110-Ubuntu [477085.705525] task: 00019872a0e8 ti: 8380c000 task.ti: 8380c000 [477085.705529] User PSW : 070520018000 02aa267e0e42 [477085.705532]R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:1 AS:0 CC:2 PM:0 EA:3 User GPRS: 02aa2c4fd690 0001 [477085.705539]02aa2c4fd690 03ff7fffee38 0002 [477085.705553]
[Kernel-packages] [Bug 1709889] Re: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => Critical ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709889 Title: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time. Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: ---Problem Description--- When running stress test, sometimes seeing IO hung in dmesg or seeing "Host adapter abort request" error. ---Steps to Reproduce--- There are two ways to re-create the issues: (1)running HTX, you will see IO timeout backtrace in dmesg in several hours (2)running some IO test, then reboot system, repeat this two steps, it takes long time to re-create the issue. ---uname output--- 4.10.0-11-generic The bulk of the effort for this issue is currently being worked in MicroSemi's JIRA https://jira.pmcs.com/browse/ESDIBMOP-133. Ran an interesting test: Ran HTX until I started getting the "stall" messages on the console, then shutdown HTX and examined the I/O counters for the tested disks in sysfs: root@bostonp15:~# for i in /sys/devices/pci0003:00/0003:00:00.0/0003:01:00.0/host0/target0:2:[2345]/0:2:[2345]:0; do echo ${i##*/} $(<${i}/iorequest_cnt) $(<${i}/iodone_cnt); done 0:2:2:0 0x5eba3d 0x5eba3d 0:2:3:0 0x773cc9 0x773cc9 0:2:4:0 0x782c61 0x782c61 0:2:5:0 0x5ca134 0x5ca134 root@bostonp15:~# So, none of the disks showed any evidence of having lost an I/O. I then restarted HTX and aside from having to manually restart one of the disks, see no problems with the testing. It appears that what was "hung" was purely in userland. This does not absolve the kernel or aacraid driver from blame, but it shows that the OS "believes" that it completed the I/O and thus removed it from the queue. What we don't know is whether the OS truly notified HTX about the completion, or if HTX (or userland libraries) just failed to process the notification. Tests are running again, will see what happens next. Update from JIRA: I have run some more experiments. Not sure what it tells us, but here's what I've seen. First test, ran until I got kernel messages about stalled tasks, then shutdown HTX. After HTX was down, I checked the above mentioned counters and found that on each disk iorequest_cnt matched iodone_cnt. The disks were usable and I could restart HTX. This suggests that the problem is not in the PM8069 firmware, and makes the case for the aacraid driver having a bug somewhat weaker. However, this merely says that the driver "completed" the I/O as far as the kernel is concerned, not that a completion rippled back to the application. I restarted HTX and have run until errors. This time, I am leaving HTX running and observing. Two of the disks reached the HTX error threshold and the testers stopped (those 2 disks are now idle). Another disks saw errors but then stopped and appears to be running fine now. The last disk has not seen any errors (yet). On the two idle (errored-out) disks I see iorequest_cnt matches iodone_cnt. I am able to "terminate and restart" the two idle disks and HTX appears to be testing them again "normally". Note that no reboot was required, further supporting the evidence that, as far as the kernel is concerned, there is nothing wrong with the disks and their I/O paths. So, I don't believe this completely eliminates aacraid from the picture, especially given we don't see this behavior on other systems/drivers. But, it probably moves the focus of the investigation away form the adapter firmware. Tried build upstream 4.11 kernel on Ubuntu. This still gets the hangs. Both Ubuntu 4.10 and upstream 4.11 have aacraid driver 1.2.1[50792]-custom. Good new/bad news... While doing an initial evaluation of the LSI-3008 SAS HBA on Boston and Ubuntu 17.04, I am hitting this same problem. So, it appears to have nothing specific to do with the PM8069 or aacraid driver. Some notes on reproduce this. I have been using the github release of HTX, built using the following steps: 1. apt install make gcc g++ git libncurses5-dev libcxl-dev libdapl-dev (others may be required) 2. git clone https://github.com/open-power/HTX 3. cd HTX 4. make 5. make deb Then install the resulting "htxubuntu.deb" package. Note, HTX will not test disks that have a filesystem or OS installed, so there must be at least two disks made available to HTX by clearing any previous data. A partition table is optional, in my testing I have none. Also, it may be desirable to run HTX somewhere other than the console,
[Kernel-packages] [Bug 1709894] Re: Support for multiple PHBs in pseries virtual machines
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) ** Changed in: ubuntu-power-systems Importance: Undecided => High -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709894 Title: Support for multiple PHBs in pseries virtual machines Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Leonardo Augusto Guimaraes Garcia- 2017-07-27 13:24:28 == Power servers can have multiple PCI Host Bridges (PHBs), different from x86, where it is rare to have multiple PHBs. Multiple PHBs are important on Power virtual machines because it allows EEH to work correctly for multiple devices. If a virtual machine has only one vPHB and an EEH occurs to an adapter passed through to the virtual machine and attached to that vPHB, all other adapters connected to the same vPHB will be reset because of the EEH recovery routine. With multiple vPHBs we can attach one device per PHB and EEH will work independently without problems for each adapter. This feature has already been integrated upstream and will be available with libvirt 3.6.0. QEMU already allows multiple PHBs. This feature is to add this support into libvirt. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1709894/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1709964] Re: Ubuntu 16.04.03: perf tool does not count pm_run_inst_cmpl with rcode on POWER9 DD2.0
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709964 Title: Ubuntu 16.04.03: perf tool does not count pm_run_inst_cmpl with rcode on POWER9 DD2.0 Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: ---Problem Description--- Unable to count pm_run_inst_cmpl using perf tool on boston dd2.0 root@isvbos3:~# perf stat -e r500fa,r600f4 -a sleep 1 Performance counter stats for 'system wide': r500fa 8,162,912 r600f4 1.001562716 seconds time elapsed ---uname output--- Linux isvbos3 4.10.0-29-generic #33~16.04.1-Ubuntu SMP Tue Jul 25 18:17:06 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = p9 dd2.0 ---Steps to Reproduce--- root@isvbos3:~# perf stat -e r500fa,r600f4 -a sleep 1 Performance counter stats for 'system wide': r500fa 8,162,912 r600f4 1.001562716 seconds time elapsed > Also we need this upstream fix 8c218578fcbbbdb10416c8614658bf32e3bf1655 in > the distro. $ git log 8c218578fcbbbdb10416c8614658 -1 commit 8c218578fcbbbdb10416c8614658bf32e3bf1655 Author: Madhavan SrinivasanDate: Fri May 26 13:38:27 2017 +0530 powerpc/perf: Fix Power9 test_adder fields Commit 8d911904f3ce4 ('powerpc/perf: Add restrictions to PMC5 in power9 DD1') was added to restrict the use of PMC5 in Power9 DD1. Intention was to disable the use of PMC5 using raw event code. But instead of updating the power9_isa207_pmu structure (used on DD1), the commit incorrectly updated the power9_pmu structure. Fix it. Fixes: 8d911904f3ce ("powerpc/perf: Add restrictions to PMC5 in power9 DD1") Reported-by: Shriya Signed-off-by: Madhavan Srinivasan Tested-by: Shriya Signed-off-by: Michael Ellerman $ git describe --contains 8c218578fcbbbdb v4.12-rc7~9^2~13 $ Commit 8c218578fc is available with kernel 4.12-rc7 afterwards. I don't see this fix in the Ubuntu 16.04.3 Zesty tree (the P9 hwe kernel, 4.10 based). Need to mirror to Canonical to request they pull in: 8c218578fcbbbdb10416c8614658bf32e3bf1655 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1709964/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1696154] Re: [17.10 FEAT] Sign POWER host/NV kernels
** Changed in: ubuntu-power-systems Assignee: Canonical Kernel Team (canonical-kernel-team) => Steve Langasek (vorlon) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1696154 Title: [17.10 FEAT] Sign POWER host/NV kernels Status in The Ubuntu-power-systems project: In Progress Status in linux package in Ubuntu: In Progress Bug description: Feature Description: Sign POWER host and NV kernels with sign-file in anticipation of POWER secure boot. Provide the associated certificate. Ideally it would be possible to reuse the UEFI shim private key and certificate used to sign and verify x86_64 kernels. More details to follow. Guest kernels will be addressed in a future separate feature request. Business Case: As a system administrator I want to verify the integrity of my kernels so that I can prevent malicious kernels from being executed. Use Case: Signed POWER kernels will be validated by OPAL as OpenPOWER systems boot when keys are properly installed and the system is booted in secure mode. Test Case: Sign and install a POWER kernel on an OpenPOWER machine with a firmware level that supports secure boot. Install a PK, distro KEK certificat, and distro DB certificate. Boot the system and verify that it will boot the kernel. Negative tests: Separately remove the signature, install an usigned kernel, and modify the kernel image and test that the kernel will not boot. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1696154/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1703663] Re: ISST-LTE: Briggs:Stratton:UbuntuKVM: ics_opal_set_affinity on host kernel log using Intel X710 (i40e driver)
** Tags added: triage-g -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1703663 Title: ISST-LTE: Briggs:Stratton:UbuntuKVM: ics_opal_set_affinity on host kernel log using Intel X710 (i40e driver) Status in The Ubuntu-power-systems project: Fix Committed Status in linux package in Ubuntu: Fix Committed Status in linux source package in Xenial: Fix Committed Bug description: SRU Justification: Impact: We have an issue with the i40e driver, present in kernels pre-4.8, like Xenial v4.4 (from Ubuntu 16.04.1). i40e driver has an issue in interrupt vector assignment in case of sparse CPU masks, like when some CPUs are disabled. The driver does a linear mapping not taking gaps on cpumask into account. This is a problem in systems with multiple logical CPUs per core (like in SMT scenarios). If we disable some logical CPUs, by turning SMT off for example, we will end up with a sparse cpu_online_mask, i.e., only the first CPU in a core is online, and the incremental/linear filling in the interrupt cpumask might lead to multiple offline CPUs being assigned to the struct q_vector in the driver, leading to issues when interrupts are requested, like the following message in PowerPC systems: [141.97] ics_opal_set_affinity: No online cpus in the mask 0020,,, for irq 302 This message is output of interrupt controller in Power systems, a complain that the cpumask does not make sense to that particular system. Fix: Cherry-picking 7f6c553902bfa1c4e3f6cfa955c5ea036c7fe8e4 i40e: use valid online CPU on q_vector initialization into Xenial. Change limited to specific driver. Regression potential should be low and results testable. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1703663/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1704479] Re: Change CONFIG_IBMVETH to module
** Tags added: triage-g -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1704479 Title: Change CONFIG_IBMVETH to module Status in The Ubuntu-power-systems project: Fix Committed Status in linux package in Ubuntu: Fix Committed Status in linux source package in Zesty: Fix Released Status in linux source package in Artful: Fix Committed Bug description: == Comment: #0 - Breno Leitao- 2017-07-14 15:07:47 == Dear Canonical, Please change the CONFIG_IBMVETH=y to module on artful kernel. Thank you, Breno To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1704479/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1704730] Re: xfstest fails with corrupt file /mnt/scratch/1 - non-zero size but no extents ( ext4 )
** Tags added: triage-g -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1704730 Title: xfstest fails with corrupt file /mnt/scratch/1 - non-zero size but no extents ( ext4 ) Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: Incomplete Bug description: xfstests fails non-zero size but no extents on ext4 filesystem Environment -- Kernel Build: 4.12.1-041201-generic Model : 8247-22L Platform: PowerNV ( P8 ) Uname output --- # uname -a Linux ltc-test-ci2 4.12.1-041201-generic #201707121132 SMP Wed Jul 12 17:03:25 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Steps to reproduce: 1. Create a loop device with ext4 filesystem 2. git clone git://git.kernel.org/pub/scm/fs/xfs/xfstests-dev.git; cd xfstests-dev 3. make 4. Create a local.config for running with created loop device 5. Run xfstests-dev test : ./check tests/generic/044 The test 044 fails with following generic/044 - output mismatch (see /root/harish/xfstests-dev/results//generic/044.out.bad) --- tests/generic/044.out 2017-07-13 06:04:36.208323135 -0400 +++ /root/harish/xfstests-dev/results//generic/044.out.bad 2017-07-14 06:24:08.153731112 -0400 @@ -1 +1,1000 @@ QA output created by 044 +corrupt file /mnt/scratch/1 - non-zero size but no extents +corrupt file /mnt/scratch/2 - non-zero size but no extents +corrupt file /mnt/scratch/3 - non-zero size but no extents +corrupt file /mnt/scratch/4 - non-zero size but no extents +corrupt file /mnt/scratch/5 - non-zero size but no extents +corrupt file /mnt/scratch/6 - non-zero size but no extents ... (Run 'diff -u tests/generic/044.out /root/harish/xfstests-dev/results//generic/044.out.bad' to see the entire diff) Ran: generic/044 Failures: generic/044 Failed 1 of 1 tests Dmesg: -- [17244.878673] EXT4-fs (loop0): mounted filesystem with ordered data mode. Opts: (null) [17245.517227] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17245.697100] EXT4-fs (loop0): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17245.710634] run fstests generic/044 at 2017-07-14 06:23:49 [17246.534410] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17246.535534] EXT4-fs (loop2): shut down requested (1) [17246.535625] Aborting journal on device loop2-8. [17247.278467] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17259.888304] EXT4-fs (loop2): shut down requested (2) [17259.995751] Aborting journal on device loop2-8. [17260.113582] EXT4-fs (loop2): recovery complete [17260.113902] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17260.190076] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17264.821978] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr == Comment: #2 - SEETEENA THOUFEEK- 2017-07-17 02:10:52 == Issue does not happen when running the same test with xfs file system. (ie, creating loop device with xfs file system). To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1704730/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1704972] Re: [LTCTest][Opal][FW910] HMI TFMR HDEC parity error is throwing Severe Machine check interrupt
** Tags added: triage-g -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1704972 Title: [LTCTest][Opal][FW910] HMI TFMR HDEC parity error is throwing Severe Machine check interrupt Status in The Ubuntu-power-systems project: Triaged Status in linux package in Ubuntu: Triaged Status in linux source package in Zesty: Triaged Bug description: == Comment: #0 - PAVAMAN SUBRAMANIYAM- 2017-05-22 05:12:38 == ---Problem Description--- HMI TFMR HDEC parity error is throwing Severe Machine check interrupt ---uname output--- Linux zz376p1 4.10.0-21-generic #23~16.04.1-Ubuntu SMP Tue May 2 12:54:57 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = P9 ---System Hang--- The system hangs indefinitely and we have to reboot the system to recover back. ---Debugger--- A debugger is not configured Immediately after injecting the above error, we get Severe Machine check interrupt [[Not recovered] Contact Information = pavsu...@in.ibm.com Stack trace output: no Oops output: [ 288.655336] Severe Machine check interrupt [[Not recovered] [ 288.655339] Severe Machine check interrupt [[Not recovered] [ 288.655342] Severe Machine check interrupt [[Not recovered] [ 288.655345] Severe Machine check interrupt [[Not recovered] [ 288.655348] Initiator: CPU [ 288.655349] Initiator: CPU [ 288.655352] Error type: Real address [Load/Store (foreign)] [ 288.655354] Initiator: CPU [ 288.655357] Effective address: 333035342dfe3030 [ 288.655360] Error type: Real address [Load/Store (foreign)] [ 288.655366] Error type: Real address [Load/Store (foreign)] [ 288.655369] Effective address: 333035342e013030 [ 288.655371] Effective address: 333035342e073030 [ 288.655418] opal: Reboot type 1 not supported [ 288.655420] opal: Reboot type 1 not supported [ 288.655422] opal: Reboot type 1 not supported [ 288.655423] Kernel panic - not syncing: PowerNV Unrecovered Machine Check [ 288.655430] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G M 4.10.0-21-generic #23~16.04.1-Ubuntu [ 288.655433] Call Trace: [ 288.655450] Sending IPI to other CPUs [ 288.656767] Initiator: CPU [ 288.656834] Error type: Real address [Load/Store (foreign)] [ 288.656945] Effective address: 333035342e043030 [ 288.657060] opal: Reboot type 1 not supported [ 298.655034] ERROR: 3 cpu(s) not responding [ 298.655183] Activate system reset (dumprestart) to stop other cpu(s) System Dump Info: The system is not configured to capture a system dump. *Additional Instructions for pavsu...@in.ibm.com: -Attach sysctl -a output output to the bug. == Comment: #3 - MAHESH J. SALGAONKAR - 2017-06-29 03:23:30 == (In reply to comment #2) > We need upstream commit > https://git.kernel.org/powerpc/c/be5c5e843c4afa1c8397cb740b6032 that fixes > this issue. > > Hi Breno, > We will be needing this upstream commit to be included in Ubuntu 16.04.3 Did this patch make into Ubuntu 16.04.3 ? To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1704972/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1706033] Re: [P9, Power NV][ WSP][Ubuntu 16.04.03] : perf hw breakpoint command results in call traces and system goes for reboot.
** Tags added: triage-g -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1706033 Title: [P9, Power NV][ WSP][Ubuntu 16.04.03] : perf hw breakpoint command results in call traces and system goes for reboot. Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Shriya R. Kulkarni <> - 2017-06-14 04:38:16 == Problem Description : = While running perftool - testsuite, the perf hw breakpoint fails and it result in call traces , hence system goes for reboot . Machine details : == System : P9 , WSP , Bare metal. OS : Ubuntu 16.04.03 uname -a : Linux ltc-wspoon3 4.10.0-23-generic #25~16.04.1-Ubuntu SMP Fri Jun 9 10:43:34 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Steps to reproduce: 1. Install perf. 2. git clone perftool-testsuite. https://github.com/rfmvh/perftool-testsuite 3. Do make. 4. Test fails at step : -- [ FAIL ] -- perf_stat :: test_hw_breakpoints :: kspace address execution mem:0xc035c020:x (command exitcode + output regexp parsing and call trace is seen as system goes for reboot. Call traces : === ubuntu@ltc-wspoon3:~$ [1602513.518414] Unable to handle kernel paging request for data at address 0xc135d3b8 [1602513.518553] Faulting instruction address: 0xc02869bc [1602513.518694] Oops: Kernel access of bad area, sig: 11 [#1] [1602513.518782] SMP NR_CPUS=2048 [1602513.518784] NUMA [1602513.518842] PowerNV [1602513.518922] Modules linked in: vmx_crypto ofpart ipmi_powernv cmdlinepart ipmi_devintf powernv_flash ipmi_msghandler ibmpowernv opal_prd mtd at24 nvmem_core uio_pdrv_genirq uio autofs4 ast i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops crc32c_vpmsum ttm drm tg3 ahci libahci [1602513.519399] CPU: 27 PID: 4069 Comm: sysctl Not tainted 4.10.0-22-generic #24 [1602513.519524] task: c000203968c42c00 task.stack: c00020396571 [1602513.519624] NIP: c02869bc LR: c03f7348 CTR: c0286990 [1602513.519747] REGS: c000203965713a40 TRAP: 0300 Not tainted (4.10.0-22-generic) [1602513.519876] MSR: 90009033[1602513.519889] CR: 22002448 XER: [1602513.520058] CFAR: c03f7344 DAR: c135d3b8 DSISR: 0040 SOFTE: 1 [1602513.520058] GPR00: c03f7348 c000203965713cc0 c145d100 c134af00 [1602513.520058] GPR04: 4ee50300 c000203965713d20 c000203965713e00 [1602513.520058] GPR08: c135d100 c0b71020 [1602513.520058] GPR12: c0286990 c7b4f300 [1602513.520058] GPR16: [1602513.520058] GPR20: 0001 [1602513.520058] GPR24: 3fffc542f5a0 0400 c000203965713e00 4ee50300 [1602513.520058] GPR28: c134af00 c03fee038800 [1602513.521280] NIP [c02869bc] dirty_ratio_handler+0x2c/0x90 [1602513.521374] LR [c03f7348] proc_sys_call_handler+0x138/0x1c0 [1602513.521481] Call Trace: [1602513.521526] [c000203965713cc0] [c000203965713d00] 0xc000203965713d00 (unreliable) [1602513.521655] [c000203965713d00] [c03f7348] proc_sys_call_handler+0x138/0x1c0 [1602513.521797] [c000203965713d70] [c03436ec] __vfs_read+0x3c/0x70 [1602513.521907] [c000203965713d90] [c034516c] vfs_read+0xbc/0x1b0 [1602513.522016] [c000203965713de0] [c0346dd8] SyS_read+0x68/0x110 [1602513.522112] [c000203965713e30] [c000b184] system_call+0x38/0xe0 [1602513.522243] Instruction dump: [1602513.522303] 6042 3c4c011d 38426770 7c0802a6 6000 7c0802a6 fbc1fff0 fbe1fff8 [1602513.522445] f8010010 f821ffc1 3d22fff0 7c9f2378 4be66da9 6000 3d22fff0 [1602513.522564] ---[ end trace 17c76e13e641d3c6 ]--- [1602513.522657] It goes for reboot : After booting to Ubuntu , I see series of call traces. Ubuntu 16.04.2 LTS ltc-wspoon3 hvc0 ltc-wspoon3 login: [ 3476.626263] Unable to handle kernel paging request for data at address 0xc13ad438 [ 3476.626422] Faulting instruction address: 0xc029a140 [ 3476.626537] Oops: Kernel access of bad area, sig: 11 [#1] [ 3476.626615] SMP NR_CPUS=2048 [ 3476.626616] NUMA [ 3476.626673] PowerNV [ 3476.626746] Modules linked in: ipmi_powernv at24 ipmi_devintf nvmem_core ipmi_msghandler ofpart cmdlinepart powernv_flash mtd opal_prd vmx_crypto ibmpowernv uio_pdrv_genirq uio autofs4 ast i2c_algo_bit ttm crc32c_vpmsum drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops tg3 drm ahci libahci [
[Kernel-packages] [Bug 1702521] Re: cxlflash update request in the Xenial SRU stream
** Changed in: ubuntu-power-systems Importance: Medium => High -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1702521 Title: cxlflash update request in the Xenial SRU stream Status in The Ubuntu-power-systems project: Fix Committed Status in linux package in Ubuntu: Fix Committed Status in linux source package in Zesty: Fix Committed Bug description: ---Problem Description--- Request to update cxlflash in the Xenial SRU stream There are multiple patches in upstream that are of interest and would like to see them pulled into the Xenial SRU stream. I have listed the patches that are in upstream/master tree. 41b99e1a30a6d04df7585905e5ffc7251099c6d3 scsi: cxlflash: Avoid mutex when destroying context 44ef38f9a2af8644c24ace6cbe1132dc70174c39 scsi: cxlflash: Cache owning adapter within context 888baf069f49529f33c0b1dfb0fc4811dc0ca1d2 scsi: cxlflash: Add kref to context cd34af40a09c678abad36304eb68e1774640e908 scsi: cxlflash: Transition to application close model de9f0b0cbb86da288a2d38e35f2953a85608a6aa scsi: cxlflash: Remove adapter file descriptor cache 9442c9b0ed5c6f3a86dce0d6f714ef43f7f6cd53 scsi: cxlflash: Update documentation f80132613d576e7e705344d0c33f3b5e65d9e16a scsi: cxlflash: Refactor WWPN setup c4a11827b760ef8dcda26b5731d072b1d8fb7c81 scsi: cxlflash: Fix context reference tracking on detach 68ab2d76e4be785a7003fdb42b7c4ed8bba56ae2 scsi: cxlflash: Set sg_tablesize to 1 instead of SG_NONE 8a2605430a64bdf0361af5a18043717a2c59972f scsi: cxlflash: Fix crash in cxlflash_restore_luntable() 3d2f617d448f5e1d15d2844b803c13763ed51f1f scsi: cxlflash: Improve context_reset() logic 11f7b1844ac01d0298aad6a0ec2591bef4a1c3a2 scsi: cxlflash: Avoid command room violation e7ab2d401dbf633eaafe5bd1f39e84492848668f scsi: cxlflash: Remove unused buffer from AFU command 350bb478f57387df1e0b830fc64be2d1c3d55b6b scsi: cxlflash: Allocate memory instead of using command pool for AFU sync 5fbb96c8f1ba89fb220efb7e4eeed7cb5112becd scsi: cxlflash: Use cmd_size for private commands 25bced2b61b43b6372a73008dafa2183c5d53c39 scsi: cxlflash: Remove private command pool de01283baa334b1d938cfd9121198c517ad6dc89 scsi: cxlflash: Wait for active AFU commands to timeout upon tear down 9ba848acbf4fbc6d99a0992df9ef5eb1b4842ba9 scsi: cxlflash: Remove AFU command lock d4ace35166e55e73afe72a05d166342996063d35 scsi: cxlflash: Cleanup send_tmf() 9d89326c6660bc287b74983b51239460da10e189 scsi: cxlflash: Cleanup queuecommand() 48b4be36edf8a2cb0dedcb2d28f598e51249e805 scsi: cxlflash: Migrate IOARRIN specific routines to function pointers fe7f96982a4e7103ffab45fba34c57ee19b62639 scsi: cxlflash: Migrate scsi command pointer to AFU command 9c7d1ee5f13a7130f6d3df307ec010e9e003fa98 scsi: cxlflash: Refactor context reset to share reset logic 696d0b0c715360ce28fedd3c8b009d3771a5ddeb scsi: cxlflash: Support SQ Command Mode fb67d44dfbdf85d984b9b40284e90636a3a7b21d scsi: cxlflash: Cleanup prints 0df5bef739601f18bffc0d256ae451f239a826bd scsi: cxlflash: Cancel scheduled workers before stopping AFU 943445200b049d5179b95297e5372d399c8ab0e2 scsi: cxlflash: Enable PCI device ID for future IBM CXL Flash AFU 76a6ebbeef26b004c36a0c8ee0496bae5428fc31 scsi: cxlflash: Separate RRQ processing from the RRQ interrupt handler f918b4a8e6f8bb59c44045f85d10fd9cc7e5a4c0 scsi: cxlflash: Serialize RRQ access and support offlevel processing cba06e6de4038cd44a3e93a92ad982c372b8a14e scsi: cxlflash: Implement IRQ polling for RRQ processing 3b225cd32a05b627a6ca366f364a824beaabecc5 scsi: cxlflash: Update sysfs helper routines to pass config structure 78ae028e823701148e4915759459ee79597ea8ec scsi: cxlflash: Support dynamic number of FC ports 8fa4f1770d56af6f0a5a862f1fd298a4eeea94f3 scsi: cxlflash: Remove port configuration assumptions 0aa14887c60c27e3385295ee85f5ac079ae2ffb5 scsi: cxlflash: Hide FC internals behind common access routine 565180723294b06b3e60030033847277b9d6d4bb scsi: cxlflash: SISlite updates to support 4 ports 1cd7fabc82eb06c834956113ff287f8848811fb8 scsi: cxlflash: Support up to 4 ports 323e33428ea23bfb1ae5010b18b4540048b2ad51 scsi: cxlflash: Fence EEH during probe 50b787f7235efbd074bbdf4315e0cc261d85b4d7 scsi: cxlflash: Remove unnecessary DMA mapping cd41e18daf1a21fea5a195a5a74c97c6b183c15a scsi: cxlflash: Fix power-of-two validations fcc87e74a987dc9c0c85f53546df944ede76486a scsi: cxlflash: Fix warnings/errors e2ef33fa5958c51ebf0c6f18db19fe927e2185fa scsi: cxlflash: Improve asynchronous interrupt processing bfc0bab172cabf3bb25c48c4c521b317ff4a909d scsi: cxlflash: Support multiple hardware queues 3065267a80c88d775e8eb34196280e8eee33322f scsi: cxlflash: Add hardware queues attribute 1dd0c0e4fd02dc5e5bfaf89bd4656aabe4ae3cb3 scsi: cxlflash: Introduce hardware queue steering The patches are required on 16.04.3
[Kernel-packages] [Bug 1696154] Re: [17.10 FEAT] Sign POWER host/NV kernels
Work is ongoing to clarify the timeline for enabling signed Power kernels. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1696154 Title: [17.10 FEAT] Sign POWER host/NV kernels Status in The Ubuntu-power-systems project: In Progress Status in linux package in Ubuntu: In Progress Bug description: Feature Description: Sign POWER host and NV kernels with sign-file in anticipation of POWER secure boot. Provide the associated certificate. Ideally it would be possible to reuse the UEFI shim private key and certificate used to sign and verify x86_64 kernels. More details to follow. Guest kernels will be addressed in a future separate feature request. Business Case: As a system administrator I want to verify the integrity of my kernels so that I can prevent malicious kernels from being executed. Use Case: Signed POWER kernels will be validated by OPAL as OpenPOWER systems boot when keys are properly installed and the system is booted in secure mode. Test Case: Sign and install a POWER kernel on an OpenPOWER machine with a firmware level that supports secure boot. Install a PK, distro KEK certificat, and distro DB certificate. Boot the system and verify that it will boot the kernel. Negative tests: Separately remove the signature, install an usigned kernel, and modify the kernel image and test that the kernel will not boot. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1696154/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1649513] Re: [Ubuntu 16.10] NMI watchdog and soft lockup while running htx memory tests in kernel 4.8.0-17-generic
** Also affects: ubuntu-power-systems Importance: Undecided Status: New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1649513 Title: [Ubuntu 16.10] NMI watchdog and soft lockup while running htx memory tests in kernel 4.8.0-17-generic Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: Incomplete Bug description: Issue: -- NMI Watchdog Bug and soft lockup occurs when htx memory test is run in ubuntu 16.10. Environment: -- Arch : ppc64le Platform : Ubuntu KVM Guest Host : ubuntu 16.10 [4.8.0-17 -kernel ] Guest : ubuntu 16.10 [4.8.0-17 - Kernel] Steps To Reproduce: --- 1 - Install a Ubuntu KVM Guest and install htx package in the guest got from the link, http://ausgsa.ibm.com/projects/h/htx/public_html/htxonly/htxubuntu-413.deb 2 - Run the Htx mdt.mem 3 - The system Hits soft lockup Issue as below: dmesg o/p: [60287.590335] NMI watchdog: BUG: soft lockup - CPU#3 stuck for 1141s! [hxemem64:23468] [60287.590572] Modules linked in: vmx_crypto ip_tables x_tables autofs4 ibmvscsi crc32c_vpmsum [60287.590585] CPU: 3 PID: 23468 Comm: hxemem64 Tainted: G L 4.8.0-17-generic #19-Ubuntu [60287.590587] task: c012a0971e00 task.stack: c012a2d4 [60287.590589] NIP: c0015004 LR: c0015004 CTR: c0165e90 [60287.590591] REGS: c012a2d439a0 TRAP: 0901 Tainted: G L (4.8.0-17-generic) [60287.590592] MSR: 80009033CR: 48004244 XER: [60287.590603] CFAR: c0165890 SOFTE: 1 GPR00: c0165f9c c012a2d43c20 c14e5e00 0900 GPR04: 0008 000100e4d61a GPR08: 0006 000100e4d619 c012bfee3130 GPR12: 3fffae6cdc70 3fffae436900 [60287.590627] NIP [c0015004] arch_local_irq_restore+0x74/0x90 [60287.590630] LR [c0015004] arch_local_irq_restore+0x74/0x90 [60287.590631] Call Trace: [60287.590634] [c012a2d43c20] [c012bfeccd80] 0xc012bfeccd80 (unreliable) [60287.590639] [c012a2d43c40] [c0165f9c] run_timer_softirq+0x10c/0x230 [60287.590644] [c012a2d43ce0] [c0b94adc] __do_softirq+0x18c/0x3fc [60287.590648] [c012a2d43de0] [c00d5828] irq_exit+0xc8/0x100 [60287.590653] [c012a2d43e00] [c0024810] timer_interrupt+0xa0/0xe0 [60287.590657] [c012a2d43e30] [c0002814] decrementer_common+0x114/0x180 [60287.590659] Instruction dump: [60287.590662] 994d023a 2fa3 409e0024 e92d0020 61298000 7d210164 38210020 e8010010 [60287.590670] 7c0803a6 4e800020 6042 4bfed259 <6000> 4be4 6042 e92d0020 [63127.581494] NMI watchdog: BUG: soft lockup - CPU#2 stuck for 339s! [hxemem64:23467] [63127.629682] Modules linked in: vmx_crypto ip_tables x_tables autofs4 ibmvscsi crc32c_vpmsum [63127.629699] CPU: 2 PID: 23467 Comm: hxemem64 Tainted: G L 4.8.0-17-generic #19-Ubuntu [63127.629701] task: c012a0965800 task.stack: c012a2d58000 [63127.629703] NIP: 10011e60 LR: 1000ec6c CTR: 00f33196 [63127.629706] REGS: c012a2d5bea0 TRAP: 0901 Tainted: G L (4.8.0-17-generic) [63127.629707] MSR: 8001d033 CR: 42004482 XER: [63127.629719] CFAR: 10011e68 SOFTE: 1 GPR00: 1000e854 3fffadc2e540 10047f00 000d GPR04: 0200 3ff5a800 5a5a5a5a5a5a5a5a 3ff5b0667348 GPR08: 1006c8e0 1006ca04 f001 GPR12: 3fffae6cdc70 3fffadc36900 [63127.629740] NIP [10011e60] 0x10011e60 [63127.629742] LR [1000ec6c] 0x1000ec6c [63127.629743] Call Trace: == Comment: #3 - Santhosh G - 2016-09-28 02:17:29 == Memory Info : root@ubuntu:~# cat /proc/meminfo MemTotal: 78539776 kB MemFree:72219392 kB MemAvailable: 77217088 kB Buffers: 212544 kB Cached: 5249088 kB SwapCached:0 kB Active: 1440832 kB Inactive:4107264 kB Active(anon): 93888 kB Inactive(anon): 8640 kB Active(file):1346944 kB Inactive(file): 4098624 kB Unevictable: 0 kB Mlocked: 0 kB SwapTotal: 3443648 kB SwapFree:3443648 kB Dirty: 0 kB Writeback: 0 kB AnonPages: 87296 kB Mapped:30400 kB Shmem: 16128 kB Slab: 381440 kB SReclaimable: 295872 kB
[Kernel-packages] [Bug 1709171] Re: Disable CONFIG_MEMORY_HOTPLUG_DEFAULT_ONLINE
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709171 Title: Disable CONFIG_MEMORY_HOTPLUG_DEFAULT_ONLINE Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: Canonical, Could you please disable CONFIG_MEMORY_HOTPLUG_DEFAULT_ONLINE in the current ppc64el kernels up to 16.04.3? It was requested originally LP#1706380, but it was not integrated in that bug, so, I am creating this new one to track this config change. Thank you, Breno To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1709171/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1709352] Re: Avoid spurious PMU interrupts after idle
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709352 Title: Avoid spurious PMU interrupts after idle Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: Hello Canonical, For POWER9 enablement, we need to backport the following commit ID up to Zesty. 101dd590a7fa ("powerpc/perf: Avoid spurious PMU interrupts after idle") To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1709352/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1635597] Re: Ubuntu16.10:talclp1: Kdump failed with multipath disk
** Changed in: linux (Ubuntu) Status: Invalid => New ** Changed in: linux (Ubuntu Trusty) Status: Invalid => New ** Changed in: linux (Ubuntu Xenial) Status: Invalid => New ** Changed in: linux (Ubuntu Zesty) Status: Invalid => New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1635597 Title: Ubuntu16.10:talclp1: Kdump failed with multipath disk Status in The Ubuntu-power-systems project: In Progress Status in linux package in Ubuntu: New Status in makedumpfile package in Ubuntu: Fix Released Status in linux source package in Trusty: New Status in makedumpfile source package in Trusty: Confirmed Status in linux source package in Xenial: New Status in makedumpfile source package in Xenial: Confirmed Status in linux source package in Zesty: New Status in makedumpfile source package in Zesty: Confirmed Bug description: Problem Description == On talclp1, I enabled kdump. But kdump failed and it drop to BusyBox. root@talclp1:~# echo c> /proc/sysrq-trigger [ 132.643690] sysrq: SysRq : Trigger a crash [ 132.643739] Unable to handle kernel paging request for data at address 0x [ 132.643745] Faulting instruction address: 0xc05c28f4 [ 132.643749] Oops: Kernel access of bad area, sig: 11 [#1] [ 132.643753] SMP NR_CPUS=2048 NUMA pSeries [ 132.643758] Modules linked in: fuse ufs qnx4 hfsplus hfs minix ntfs msdos jfs rpadlpar_io rpaphp rpcsec_gss_krb5 nfsv4 dccp_diag cifs nfs dns_resolver dccp tcp_diag fscache udp_diag inet_diag unix_diag af_packet_diag netlink_diag binfmt_misc xfs libcrc32c pseries_rng rng_core ghash_generic gf128mul vmx_crypto sg nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables x_tables autofs4 ext4 crc16 jbd2 fscrypto mbcache crc32c_generic btrfs xor raid6_pq dm_round_robin sr_mod sd_mod cdrom ses enclosure scsi_transport_sas ibmveth crc32c_vpmsum ipr scsi_dh_emc scsi_dh_rdac scsi_dh_alua dm_multipath dm_mod [ 132.643819] CPU: 49 PID: 10174 Comm: bash Not tainted 4.8.0-15-generic #16-Ubuntu [ 132.643824] task: c00111767080 task.stack: c000d82e [ 132.643828] NIP: c05c28f4 LR: c05c39d8 CTR: c05c28c0 [ 132.643832] REGS: c000d82e3990 TRAP: 0300 Not tainted (4.8.0-15-generic) [ 132.643836] MSR: 80009033CR: 28242422 XER: 0001 [ 132.643848] CFAR: c00087d0 DAR: DSISR: 4200 SOFTE: 1 GPR00: c05c39d8 c000d82e3c10 c0f67b00 0063 GPR04: c0011d04a9b8 c0011d05f7e0 c0047fb0 00015998 GPR08: 0007 0001 0001 GPR12: c05c28c0 c7b4b900 2200 GPR16: 10170dc8 01002b566368 10140f58 100c7570 GPR20: 1017dd58 10153618 1017b608 GPR24: 3e87a294 0001 c0ebff60 0004 GPR28: c0ec0320 0063 c0e72a90 [ 132.643906] NIP [c05c28f4] sysrq_handle_crash+0x34/0x50 [ 132.643911] LR [c05c39d8] __handle_sysrq+0xe8/0x280 [ 132.643914] Call Trace: [ 132.643917] [c000d82e3c10] [c0a245e8] 0xc0a245e8 (unreliable) [ 132.643923] [c000d82e3c30] [c05c39d8] __handle_sysrq+0xe8/0x280 [ 132.643928] [c000d82e3cd0] [c05c4188] write_sysrq_trigger+0x78/0xa0 [ 132.643935] [c000d82e3d00] [c03ad770] proc_reg_write+0xb0/0x110 [ 132.643941] [c000d82e3d50] [c030fc3c] __vfs_write+0x6c/0xe0 [ 132.643946] [c000d82e3d90] [c0311144] vfs_write+0xd4/0x240 [ 132.643950] [c000d82e3de0] [c0312e5c] SyS_write+0x6c/0x110 [ 132.643957] [c000d82e3e30] [c00095e0] system_call+0x38/0x108 [ 132.643961] Instruction dump: [ 132.643963] 38425240 7c0802a6 f8010010 f821ffe1 6000 6000 3d220019 3949ba60 [ 132.643972] 3921 912a 7c0004ac 3940 <992a> 38210020 e8010010 7c0803a6 [ 132.643981] ---[ end trace eed6bbcd2c3bdfdf ]--- [ 132.646105] [ 132.646176] Sending IPI to other CPUs [ 132.647490] IPI complete I'm in purgatory -> smp_release_cpus() spinning_secondaries = 104 <- smp_release_cpus() [2.011346] alg: hash: Test 1 failed for crc32c-vpmsum [2.729254] sd 0:2:0:0: [sda] Assuming drive cache: write through [2.731554] sd 1:2:5:0: [sdn] Assuming drive cache: write through [2.739087] sd 1:2:4:0: [sdm] Assuming drive cache: write through [2.739089] sd 1:2:6:0: [sdo] Assuming drive cache: write through [2.739110] sd 1:2:7:0: [sdp] Assuming drive cache: write through [2.739115] sd 1:2:0:0: [sdi] Assuming drive cache: write through [2.739122] sd
[Kernel-packages] [Bug 1716924] Re: [Ubuntu 17.10 FEAT] Include cxlflash package in Ubuntu
Other bugs impacting the cxlflash driver have been handled by the kernel team, e.g. https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1702521 and https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1605405. Is this a request to package a userspace application that leverages the cxlflash driver's functionality? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1716924 Title: [Ubuntu 17.10 FEAT] Include cxlflash package in Ubuntu Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Rodrigo R. Rosatti Galvao- 2017-09-12 13:34:52 == CAPI can be thought of as a special tunneling protocol through PCIe that allow PCIe adapters to look like special purpose co-processors which can read or write an application's memory and generate page faults. The cxlflash driver is responsible for the initialization of the Coherent Accelerator (CXL) Flash Adapter, setting up the special path for user space access, and performing error recovery. == Comment: #2 - Rodrigo R. Rosatti Galvao - 2017-09-12 13:35:24 == Hello, Canonical We'd like to include cxlflash package into Artful. There's already a RFS opened for it on Debian (RFS 870909) and it's uploaded into mentors.debian as well. But, since the FeatureFreeze for Artful was on August 24th we'd like to make progress on it directly with Ubuntu. RFS 870909: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=870909 package on mentors: https://mentors.debian.net/package/cxlflash To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1716924/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1716924] Re: [Ubuntu 17.10 FEAT] Include cxlflash package in Ubuntu
** Changed in: ubuntu-power-systems Assignee: Canonical Kernel Team (canonical-kernel-team) => Canonical Foundations Team (canonical-foundations) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1716924 Title: [Ubuntu 17.10 FEAT] Include cxlflash package in Ubuntu Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Rodrigo R. Rosatti Galvao- 2017-09-12 13:34:52 == CAPI can be thought of as a special tunneling protocol through PCIe that allow PCIe adapters to look like special purpose co-processors which can read or write an application's memory and generate page faults. The cxlflash driver is responsible for the initialization of the Coherent Accelerator (CXL) Flash Adapter, setting up the special path for user space access, and performing error recovery. == Comment: #2 - Rodrigo R. Rosatti Galvao - 2017-09-12 13:35:24 == Hello, Canonical We'd like to include cxlflash package into Artful. There's already a RFS opened for it on Debian (RFS 870909) and it's uploaded into mentors.debian as well. But, since the FeatureFreeze for Artful was on August 24th we'd like to make progress on it directly with Ubuntu. RFS 870909: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=870909 package on mentors: https://mentors.debian.net/package/cxlflash To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1716924/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1716924] Re: [Ubuntu 17.10 FEAT] Include cxlflash package in Ubuntu
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => Medium ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1716924 Title: [Ubuntu 17.10 FEAT] Include cxlflash package in Ubuntu Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Rodrigo R. Rosatti Galvao- 2017-09-12 13:34:52 == CAPI can be thought of as a special tunneling protocol through PCIe that allow PCIe adapters to look like special purpose co-processors which can read or write an application's memory and generate page faults. The cxlflash driver is responsible for the initialization of the Coherent Accelerator (CXL) Flash Adapter, setting up the special path for user space access, and performing error recovery. == Comment: #2 - Rodrigo R. Rosatti Galvao - 2017-09-12 13:35:24 == Hello, Canonical We'd like to include cxlflash package into Artful. There's already a RFS opened for it on Debian (RFS 870909) and it's uploaded into mentors.debian as well. But, since the FeatureFreeze for Artful was on August 24th we'd like to make progress on it directly with Ubuntu. RFS 870909: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=870909 package on mentors: https://mentors.debian.net/package/cxlflash To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1716924/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1702768] Re: Ubuntu 17.04 KVM: stack trace generated when enabling SRIOV in power
*** This bug is a duplicate of bug 1701272 *** https://bugs.launchpad.net/bugs/1701272 This bug is marked as a duplicate of https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1701272 which is marked as "Fix Released" for 17.04. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1702768 Title: Ubuntu 17.04 KVM: stack trace generated when enabling SRIOV in power Status in The Ubuntu-power-systems project: In Progress Status in linux package in Ubuntu: In Progress Status in linux source package in Zesty: In Progress Bug description: ---Problem Description--- When enabling SRIOV with kernel 4.10.0-26-generic in power will see this stack trace: [ 2084.079575] [ cut here ] [ 2084.079583] WARNING: CPU: 120 PID: 734 at /build/linux-TAhFXm/linux-4.10.0/arch/powerpc/platforms/powernv/npu-dma.c:78 pnv_pci_get_npu_dev+0x40/0xb0 [ 2084.079584] Modules linked in: mst_pciconf(OE) mst_pci(OE) xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp kvm_hv kvm_pr kvm ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter rdma_ucm(OE) ib_ucm(OE) ib_ipoib(OE) ib_uverbs(OE) ib_umad(OE) mlx5_ib(OE) mlx4_ib(OE) binfmt_misc bridge stp llc ipmi_powernv ipmi_devintf ipmi_msghandler powernv_rng powernv_op_panel uio_pdrv_genirq leds_powernv uio ibmpowernv vmx_crypto sunrpc ib_iser(OE) rdma_cm(OE) iw_cm(OE) ib_cm(OE) ib_core(OE) configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi knem(OE) ip_tables x_tables autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx [ 2084.079640] xor raid6_pq libcrc32c raid1 raid0 multipath linear mlx4_en(OE) ses enclosure scsi_transport_sas crc32c_vpmsum tg3 mlx5_core(OE) mlx4_core(OE) ipr devlink mlx_compat(OE) [ 2084.079658] CPU: 120 PID: 734 Comm: kworker/120:0 Tainted: GW OE 4.10.0-26-generic #30-Ubuntu [ 2084.079663] Workqueue: events work_for_cpu_fn [ 2084.079665] task: c00fee60dc00 task.stack: c00fee534000 [ 2084.079666] NIP: c009c210 LR: c009d404 CTR: [ 2084.079668] REGS: c00fee537700 TRAP: 0700 Tainted: GW OE (4.10.0-26-generic) [ 2084.079669] MSR: 9282b033[ 2084.079677] CR: 42004428 XER: 2000 [ 2084.079678] CFAR: c009d400 SOFTE: 1 GPR00: c009d404 c00fee537980 c145d100 GPR04: 0aa6 c01fff70 00049188 GPR08: 0007 0001 0001 GPR12: 2200 cfbc3800 c010ef48 c00ff70ec540 GPR16: c00ffa622c58 c00ffa622a10 c00ffa6229a0 0001 GPR20: c1318de8 c0d700e8 0001 GPR24: c0d6f070 c0d6f050 c3d02000 c3d02098 GPR28: c00e92680060 081f [ 2084.079702] NIP [c009c210] pnv_pci_get_npu_dev+0x40/0xb0 [ 2084.079704] LR [c009d404] pnv_npu_try_dma_set_bypass+0x144/0x250 [ 2084.079705] Call Trace: [ 2084.079708] [c00fee5379b0] [c009d404] pnv_npu_try_dma_set_bypass+0x144/0x250 [ 2084.079710] [c00fee537a80] [c0096c74] pnv_pci_ioda_dma_set_mask+0xa4/0x150 [ 2084.079714] [c00fee537b00] [c00291a0] dma_set_mask+0x40/0xc0 [ 2084.079728] [c00fee537b20] [d000143531e4] init_one+0x33c/0x6a0 [mlx5_core] [ 2084.079732] [c00fee537bd0] [c066ba9c] local_pci_probe+0x6c/0x140 [ 2084.079734] [c00fee537c60] [c01016b8] work_for_cpu_fn+0x38/0x60 [ 2084.079737] [c00fee537c90] [c01061a0] process_one_work+0x2b0/0x5a0 [ 2084.079740] [c00fee537d20] [c0106780] worker_thread+0x2f0/0x650 [ 2084.079742] [c00fee537dc0] [c010f0a4] kthread+0x164/0x1b0 [ 2084.079746] [c00fee537e30] [c000b4e8] ret_from_kernel_thread+0x5c/0x74 [ 2084.079747] Instruction dump: [ 2084.079748] 7c0802a6 fbe1fff8 f8010010 f821ffd1 7c690074 7929d182 0b09 2fa3 [ 2084.079753] 419e0060 e8630330 7c690074 7929d182 <0b09> 2fa3 419e0048 7c852378 [ 2084.079759] ---[ end trace 7bf01a937efd69d8 ]--- This issue was introduced by this commit: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=4c3b89effc281704d5395282c800c45e453235f6 (Subject: powerpc/powernv: Add sanity checks to pnv_pci_get_{gpu|npu}_dev ) and the solution will be to add this commit:
[Kernel-packages] [Bug 1717443] Re: Call trace observed when running xfstests on btrfs filesystem
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1717443 Title: Call trace observed when running xfstests on btrfs filesystem Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Harish Sriram <> - 2017-07-21 00:34:58 == Problem Description Call trace observed when running xfstests on btrfs filesystem Environment -- Kernel Build: 4.11.0-10-generic System Name : ltc-boston124 Model : 9006-22C Platform: PowerNV ( P9 ) Uname output --- # uname -a Linux ltc-boston124 4.11.0-10-generic #15-Ubuntu SMP Thu Jun 29 15:02:54 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Steps to reproduce: 1. Create a loop device with btrfs filesystem 2. git clone git://git.kernel.org/pub/scm/fs/xfs/xfstests-dev.git; cd xfstests-dev 3. make 4. Create a local.config for running with created loop device 5.. Run xfstests-dev test : ./check tests/btrfs/130 Dmesg: -- [Fri Jul 21 01:26:55 2017] INFO: rcu_sched self-detected stall on CPU [Fri Jul 21 01:26:55 2017]4-...: (210013 ticks this GP) idle=6db/141/0 softirq=18227470/18227470 fqs=104791 [Fri Jul 21 01:26:55 2017] (t=210039 jiffies g=10179019 c=10179018 q=1388) [Fri Jul 21 01:26:55 2017] Task dump for CPU 4: [Fri Jul 21 01:26:55 2017] btrfs R running task0 24224 19847 0x00042006 [Fri Jul 21 01:26:55 2017] Call Trace: [Fri Jul 21 01:26:55 2017] [c0002003dd62efb0] [c01240dc] sched_show_task+0xcc/0x150 (unreliable) [Fri Jul 21 01:26:55 2017] [c0002003dd62f020] [c0b80d20] rcu_dump_cpu_stacks+0xd0/0x134 [Fri Jul 21 01:26:55 2017] [c0002003dd62f070] [c0177214] rcu_check_callbacks+0x9f4/0xb40 [Fri Jul 21 01:26:55 2017] [c0002003dd62f1a0] [c0180968] update_process_times+0x48/0x90 [Fri Jul 21 01:26:55 2017] [c0002003dd62f1d0] [c0197a70] tick_sched_handle.isra.7+0x30/0xb0 [Fri Jul 21 01:26:55 2017] [c0002003dd62f200] [c0197b54] tick_sched_timer+0x64/0xd0 [Fri Jul 21 01:26:55 2017] [c0002003dd62f240] [c0181408] __hrtimer_run_queues+0x108/0x390 [Fri Jul 21 01:26:55 2017] [c0002003dd62f2c0] [c0182458] hrtimer_interrupt+0xf8/0x330 [Fri Jul 21 01:26:55 2017] [c0002003dd62f390] [c0023dcc] __timer_interrupt+0x8c/0x270 [Fri Jul 21 01:26:55 2017] [c0002003dd62f3e0] [c00241cc] timer_interrupt+0x9c/0xe0 [Fri Jul 21 01:26:55 2017] [c0002003dd62f410] [c00090a4] decrementer_common+0x114/0x120 [Fri Jul 21 01:26:55 2017] --- interrupt: 901 at find_parent_nodes+0x97c/0x1680 [btrfs] LR = find_parent_nodes+0x930/0x1680 [btrfs] [Fri Jul 21 01:26:55 2017] [c0002003dd62f700] [c008571bcbfc] find_parent_nodes+0x874/0x1680 [btrfs] (unreliable) [Fri Jul 21 01:26:55 2017] [c0002003dd62f840] [c008571bee64] iterate_extent_inodes+0x3ec/0x440 [btrfs] [Fri Jul 21 01:26:55 2017] [c0002003dd62f960] [c008571ccddc] process_extent+0x804/0xc00 [btrfs] [Fri Jul 21 01:26:55 2017] [c0002003dd62fa80] [c008571d06cc] changed_cb+0x574/0xe00 [btrfs] [Fri Jul 21 01:26:55 2017] [c0002003dd62fb40] [c008571d1f74] btrfs_ioctl_send+0x101c/0x10e0 [btrfs] [Fri Jul 21 01:26:55 2017] [c0002003dd62fc40] [c00857186218] btrfs_ioctl+0x16e0/0x23f0 [btrfs] [Fri Jul 21 01:26:55 2017] [c0002003dd62fd40] [c0372124] do_vfs_ioctl+0xd4/0x8c0 [Fri Jul 21 01:26:55 2017] [c0002003dd62fde0] [c03729e4] SyS_ioctl+0xd4/0xf0 [Fri Jul 21 01:26:55 2017] [c0002003dd62fe30] [c000b184] system_call+0x38/0xe0 [Fri Jul 21 01:27:58 2017] INFO: rcu_sched self-detected stall on CPU [Fri Jul 21 01:27:58 2017]4-...: (225763 ticks this GP) idle=6db/141/0 softirq=18227470/18227470 fqs=112651 [Fri Jul 21 01:27:58 2017] (t=225792 jiffies g=10179019 c=10179018 q=2542) [Fri Jul 21 01:27:58 2017] Task dump for CPU 4: [Fri Jul 21 01:27:58 2017] btrfs R running task0 24224 19847 0x00042006 [Fri Jul 21 01:27:58 2017] Call Trace: [Fri Jul 21 01:27:58 2017] [c0002003dd62efb0] [c01240dc] sched_show_task+0xcc/0x150 (unreliable) [Fri Jul 21 01:27:58 2017] [c0002003dd62f020] [c0b80d20] rcu_dump_cpu_stacks+0xd0/0x134 [Fri Jul 21 01:27:58 2017] [c0002003dd62f070] [c0177214] rcu_check_callbacks+0x9f4/0xb40 [Fri Jul 21 01:27:58 2017] [c0002003dd62f1a0] [c0180968] update_process_times+0x48/0x90 [Fri Jul 21
[Kernel-packages] [Bug 1709889] Re: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.
Marking as "incomplete" until fix lands upstream. ** Changed in: ubuntu-power-systems Status: In Progress => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709889 Title: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time. Status in Linux: Unknown Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: In Progress Status in linux source package in Zesty: In Progress Bug description: ---Problem Description--- When running stress test, sometimes seeing IO hung in dmesg or seeing "Host adapter abort request" error. ---Steps to Reproduce--- There are two ways to re-create the issues: (1)running HTX, you will see IO timeout backtrace in dmesg in several hours (2)running some IO test, then reboot system, repeat this two steps, it takes long time to re-create the issue. ---uname output--- 4.10.0-11-generic The bulk of the effort for this issue is currently being worked in MicroSemi's JIRA https://jira.pmcs.com/browse/ESDIBMOP-133. Ran an interesting test: Ran HTX until I started getting the "stall" messages on the console, then shutdown HTX and examined the I/O counters for the tested disks in sysfs: root@bostonp15:~# for i in /sys/devices/pci0003:00/0003:00:00.0/0003:01:00.0/host0/target0:2:[2345]/0:2:[2345]:0; do echo ${i##*/} $(<${i}/iorequest_cnt) $(<${i}/iodone_cnt); done 0:2:2:0 0x5eba3d 0x5eba3d 0:2:3:0 0x773cc9 0x773cc9 0:2:4:0 0x782c61 0x782c61 0:2:5:0 0x5ca134 0x5ca134 root@bostonp15:~# So, none of the disks showed any evidence of having lost an I/O. I then restarted HTX and aside from having to manually restart one of the disks, see no problems with the testing. It appears that what was "hung" was purely in userland. This does not absolve the kernel or aacraid driver from blame, but it shows that the OS "believes" that it completed the I/O and thus removed it from the queue. What we don't know is whether the OS truly notified HTX about the completion, or if HTX (or userland libraries) just failed to process the notification. Tests are running again, will see what happens next. Update from JIRA: I have run some more experiments. Not sure what it tells us, but here's what I've seen. First test, ran until I got kernel messages about stalled tasks, then shutdown HTX. After HTX was down, I checked the above mentioned counters and found that on each disk iorequest_cnt matched iodone_cnt. The disks were usable and I could restart HTX. This suggests that the problem is not in the PM8069 firmware, and makes the case for the aacraid driver having a bug somewhat weaker. However, this merely says that the driver "completed" the I/O as far as the kernel is concerned, not that a completion rippled back to the application. I restarted HTX and have run until errors. This time, I am leaving HTX running and observing. Two of the disks reached the HTX error threshold and the testers stopped (those 2 disks are now idle). Another disks saw errors but then stopped and appears to be running fine now. The last disk has not seen any errors (yet). On the two idle (errored-out) disks I see iorequest_cnt matches iodone_cnt. I am able to "terminate and restart" the two idle disks and HTX appears to be testing them again "normally". Note that no reboot was required, further supporting the evidence that, as far as the kernel is concerned, there is nothing wrong with the disks and their I/O paths. So, I don't believe this completely eliminates aacraid from the picture, especially given we don't see this behavior on other systems/drivers. But, it probably moves the focus of the investigation away form the adapter firmware. Tried build upstream 4.11 kernel on Ubuntu. This still gets the hangs. Both Ubuntu 4.10 and upstream 4.11 have aacraid driver 1.2.1[50792]-custom. Good new/bad news... While doing an initial evaluation of the LSI-3008 SAS HBA on Boston and Ubuntu 17.04, I am hitting this same problem. So, it appears to have nothing specific to do with the PM8069 or aacraid driver. Some notes on reproduce this. I have been using the github release of HTX, built using the following steps: 1. apt install make gcc g++ git libncurses5-dev libcxl-dev libdapl-dev (others may be required) 2. git clone https://github.com/open-power/HTX 3. cd HTX 4. make 5. make deb Then install the resulting "htxubuntu.deb" package. Note, HTX will not test disks that have a filesystem or OS installed, so there must be at least two disks made available to HTX by clearing any previous data. A partition table is optional, in my testing I have none. Also, it may be desirable to run HTX somewhere other than the console, leaving the console free to watch for
[Kernel-packages] [Bug 1694485] Re: Ubuntu17.04: CAPI: call trace seen while error injection to the CAPI card.
Moving to "incomplete", please update bug status back when you are able to validate. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1694485 Title: Ubuntu17.04: CAPI: call trace seen while error injection to the CAPI card. Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: Incomplete Status in linux source package in Zesty: In Progress Bug description: == Comment: #0 - SUDEESH JOHN - 2017-03-18 13:55:03 == ---Problem Description--- call trace while injecting error to the CAPI card. " WARNING: CPU: 31 PID: 491 at /build/linux-VtwHOM/linux-4.10.0/drivers/misc/cxl/main.c:325 cxl_adapter_context_unlock+0x68/0x90 [cxl] " ---uname output--- Linux freak 4.10.0-13-generic #15-Ubuntu SMP Thu Mar 9 20:27:28 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = PowerNV 8247-21L ---Steps to Reproduce--- 1. echo 1 > /sys/kernel/debug/powerpc/eeh_max_freezes 2. echo 1 > /sys/class/cxl/card0/perst_reloads_same_image 3. echo 0x8000 > /sys/kernel/debug/powerpc/PCI/err_injct_outbound ---The complete call trace --- Mar 18 14:39:09 freak kernel: [ 289.675421] [ cut here ] Mar 18 14:39:09 freak kernel: [ 289.675431] WARNING: CPU: 5 PID: 491 at /build/linux-VtwHOM/linux-4.10.0/drivers/misc/cxl/main.c:325 cxl_adapter_context_unlock+0x68/0x90 [cxl] Mar 18 14:39:09 freak kernel: [ 289.675432] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc kvm_hv kvm_pr kvm ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter uio_pdrv_genirq uio ipmi_powernv ipmi_devintf ipmi_msghandler powernv_op_panel powernv_rng vmx_crypto ibmpowernv leds_powernv ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid1 raid0 multipath linear ses enclosure scsi_transport_sas bnx2x mlx5_core tg3 cxl mdio ipr libcrc32c devlink crc32c_vpmsum pnv_php Mar 18 14:39:09 freak kernel: [ 289.675490] CPU: 5 PID: 491 Comm: eehd Not tainted 4.10.0-13-generic #15-Ubuntu Mar 18 14:39:09 freak kernel: [ 289.675492] task: c003bfbfde00 task.stack: c003bfc5c000 Mar 18 14:39:09 freak kernel: [ 289.675493] NIP: d5cc0ca0 LR: d5cc0c9c CTR: c0605aa0 Mar 18 14:39:09 freak kernel: [ 289.675495] REGS: c003bfc5f6a0 TRAP: 0700 Not tainted (4.10.0-13-generic) Mar 18 14:39:09 freak kernel: [ 289.675496] MSR: 9282b033Mar 18 14:39:09 freak kernel: [ 289.675504] CR: 28008282 XER: 2000 Mar 18 14:39:09 freak kernel: [ 289.675504] CFAR: c0b568dc SOFTE: 1 Mar 18 14:39:09 freak kernel: [ 289.675504] GPR00: d5cc0c9c c003bfc5f920 d5cf2d88 002f Mar 18 14:39:09 freak kernel: [ 289.675504] GPR04: 0001 03fd 63206576 Mar 18 14:39:09 freak kernel: [ 289.675504] GPR08: c15dc700 0001 Mar 18 14:39:09 freak kernel: [ 289.675504] GPR12: 8800 cfb82d00 c0108c88 c003c51f9f00 Mar 18 14:39:09 freak kernel: [ 289.675504] GPR16: Mar 18 14:39:09 freak kernel: [ 289.675504] GPR20: c0d53990 Mar 18 14:39:09 freak kernel: [ 289.675504] GPR24: c0d53968 c14a4330 c003ab8fa800 c003bd2c20c0 Mar 18 14:39:09 freak kernel: [ 289.675504] GPR28: c003c5051098 c003ab8fa800 Mar 18 14:39:09 freak kernel: [ 289.675535] NIP [d5cc0ca0] cxl_adapter_context_unlock+0x68/0x90 [cxl] Mar 18 14:39:09 freak kernel: [ 289.675540] LR [d5cc0c9c] cxl_adapter_context_unlock+0x64/0x90 [cxl] Mar 18 14:39:09 freak kernel: [ 289.675541] Call Trace: Mar 18 14:39:09 freak kernel: [ 289.675547] [c003bfc5f920] [d5cc0c9c] cxl_adapter_context_unlock+0x64/0x90 [cxl] (unreliable) Mar 18 14:39:09 freak kernel: [ 289.675556] [c003bfc5f980] [d5cd022c] cxl_configure_adapter+0x954/0x990 [cxl] Mar 18 14:39:09 freak kernel: [ 289.675563] [c003bfc5fa30] [d5cd02c0] cxl_pci_slot_reset+0x58/0x240 [cxl] Mar 18 14:39:09 freak kernel: [ 289.675568] [c003bfc5fae0] [c003b0d4] eeh_report_reset+0x154/0x190 Mar 18 14:39:09 freak kernel: [ 289.675571] [c003bfc5fb20] [c0039428] eeh_pe_dev_traverse+0x98/0x170 Mar 18 14:39:09 freak kernel: [
[Kernel-packages] [Bug 1689360] Re: TCMU: Fix possible overwrite of t_data_sg's last iov[] and wrongly calculating base_command_size
** Changed in: ubuntu-power-systems Status: In Progress => Invalid -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1689360 Title: TCMU: Fix possible overwrite of t_data_sg's last iov[] and wrongly calculating base_command_size Status in The Ubuntu-power-systems project: Invalid Status in linux package in Ubuntu: Invalid Status in linux source package in Yakkety: Won't Fix Bug description: ---Problem Description--- If there has BIDI data, its first iov[] will overwrite the last iov[] for se_cmd->t_data_sg. ---uname output--- Latest Yakkety master branch Machine Type = P8 ---Steps to Reproduce--- Just have a system do workload using tcmu. Stack trace output: I have seen this in my environment: (gdb) print *((tcmulib_cmd->iovec)+0) $7 = {iov_base = 0x3fff7c3d, iov_len = 8192} (gdb) print *((tcmulib_cmd->iovec)+1) $3 = {iov_base = 0x3fff7c3da000, iov_len = 4096} (gdb) print *((tcmulib_cmd->iovec)+2) $4 = {iov_base = 0x3fff7c3dc000, iov_len = 16384} (gdb) print *((tcmulib_cmd->iovec)+3) $5 = {iov_base = 0x3fff7c3f7000, iov_len = 12288} (gdb) print *((tcmulib_cmd->iovec)+4) $6 = {iov_base = 0x1306e853c0028, iov_len = 128} <--- bad pointer and length cmu: Fix wrongly calculating of the base_command_size https://patchwork.kernel.org/patch/9687657/ tcmu: Fix possible overwrite of t_data_sg's last iov[] https://patchwork.kernel.org/patch/9687565/ tcmu: Skip Data-Out blocks before gathering Data-In buffer for BIDI case https://patchwork.kernel.org/patch/9655423/ This patch should also be a part of these fixes. WITH BIDI op fixes. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1689360/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1715073] Re: SRIOV: warning if unload VFs
** Changed in: ubuntu-power-systems Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1715073 Title: SRIOV: warning if unload VFs Status in The Ubuntu-power-systems project: Fix Committed Status in linux package in Ubuntu: Fix Committed Status in linux source package in Zesty: Fix Committed Status in linux source package in Artful: Fix Committed Bug description: == Comment: #0 - Carol L. Soto- 2017-02-23 16:11:47 == ---Problem Description--- When doing SRIOV if I unload VFs will see a warning: Feb 23 16:05:56 powerio-le11 kernel: [ 201.343397] mlx5_3:wait_for_async_commands:674:(pid 6272): done with all pending requests Feb 23 16:05:56 powerio-le11 kernel: [ 201.603999] iommu: Removing device 0004:01:00.2 from group 7 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604078] pci 0004:01: 0.2: [PE# 00] Removing DMA window #0 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604084] pci 0004:01: 0.2: [PE# 00] Disabling 64-bit DMA bypass Feb 23 16:05:56 powerio-le11 kernel: [ 201.604350] mlx5_core 0004:01:00.0: VF BAR0: [mem 0x2400-0x2401 64bit pref] shifted to [mem 0x2400-0x2401 64bit pref] (Disabling 1 VFs shifted by 0) Feb 23 16:05:56 powerio-le11 kernel: [ 201.604363] mlx5_core 0004:01:00.0: can't update enabled VF BAR0 [mem 0x2400-0x2401 64bit pref] Feb 23 16:05:56 powerio-le11 kernel: [ 201.604379] [ cut here ] Feb 23 16:05:56 powerio-le11 kernel: [ 201.604386] WARNING: CPU: 14 PID: 6272 at /build/linux-twbIHf/linux-4.10.0/drivers/pci/iov.c:584 pci_iov_update_resource+0x178/0x1d0 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604388] Modules linked in: mlx5_ib xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp kvm_hv kvm_pr kvm ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter bridge stp llc binfmt_misc ipmi_powernv ipmi_devintf uio_pdrv_genirq ipmi_msghandler uio vmx_crypto powernv_rng powernv_op_panel leds_powernv ibmpowernv ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi knem(OE) ip_tables x_tables autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear mlx4_en ses enclosure scsi_transport_sas crc32c_vpmsum mlx5_core mlx4_core Feb 23 16:05:56 powerio-le11 kernel: [ 201.604451] tg3 ipr devlink Feb 23 16:05:56 powerio-le11 kernel: [ 201.604456] CPU: 14 PID: 6272 Comm: bash Tainted: G OE 4.10.0-8-generic #10-Ubuntu Feb 23 16:05:56 powerio-le11 kernel: [ 201.604457] task: c00f40a6d600 task.stack: c00f40ac8000 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604459] NIP: c06721b8 LR: c06721b4 CTR: Feb 23 16:05:56 powerio-le11 kernel: [ 201.604461] REGS: c00f40acb590 TRAP: 0700 Tainted: G OE(4.10.0-8-generic) Feb 23 16:05:56 powerio-le11 kernel: [ 201.604462] MSR: 9282b033 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604470] CR: 42424422 XER: 2000 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] CFAR: c0b49db4 SOFTE: 1 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] GPR00: c06721b4 c00f40acb810 c143c900 0063 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] GPR04: 0001 0539 c01fff70 00021a50 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] GPR08: 0007 0007 0001 656d5b2030524142 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] GPR12: 4400 cfb87e00 10180df8 10189e60 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] GPR16: 10189ed8 c00fdd0a2400 c01fff97d180 c0d46268 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] GPR20: c0d4e410 c0d41df8 c01fff97d190 c0d4d8d8 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] GPR24: c0d4d8e0 c00fe8f460a0 0001 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604471] GPR28: c00fe8f80f80 c00fe8f46580 c00fe8f46000 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604498] NIP [c06721b8] pci_iov_update_resource+0x178/0x1d0 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604501] LR [c06721b4] pci_iov_update_resource+0x174/0x1d0 Feb 23 16:05:56 powerio-le11 kernel: [ 201.604501] Call Trace: Feb 23 16:05:56 powerio-le11 kernel: [ 201.604504] [c00f40acb810] [c06721b4] pci_iov_update_resource+0x174/0x1d0 (unreliable) Feb 23
[Kernel-packages] [Bug 1696102] Re: xfs/073 test fails with Metadata corruption detected on xfs file system (xfsprogs)
** Changed in: ubuntu-power-systems Status: Fix Released => Fix Committed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1696102 Title: xfs/073 test fails with Metadata corruption detected on xfs file system (xfsprogs) Status in The Ubuntu-power-systems project: Fix Committed Status in linux package in Ubuntu: Invalid Status in xfsprogs package in Ubuntu: Fix Released Status in linux source package in Xenial: Invalid Status in xfsprogs source package in Xenial: Fix Committed Status in linux source package in Zesty: Invalid Status in xfsprogs source package in Zesty: Fix Committed Bug description: Impact == When making multiple copies of a V5 filesystem with xfs_copy those copies will be corrupt due to incorrect UUIDs. Test Case = See the detailed steps to create the test environment in comment #20. Regression Potential From the patch "Most of this patch is changing comments and re-ordering tests to match; the functional change is to simply use the *sb rather than the *ag_hdr to identify the proper metadata UUID." So this seems pretty straightforward and given that copy process is currently broken, because the copies are corrupt, this will be an improvement. Original Description Problem Description xfs/073 test fails with Metadata corruption detected on xfs file system. Test fails with _check_xfs_filesystem: filesystem on /mnt/test/84004.image2 is inconsistent. # diff -u tests/xfs/073.out /root/xfstests-dev/results//xfs/073.out.bad --- tests/xfs/073.out 2017-03-23 12:13:05.288877197 +0530 +++ /root/xfstests-dev/results//xfs/073.out.bad 2017-03-27 11:11:43.023059702 +0530 @@ -59,8 +59,7 @@ comparing new image geometry to old unmounting and removing new image checking new image -mounting new image on loopback -comparing new image files to old -comparing new image directories to old -comparing new image geometry to old -unmounting and removing new image +_check_xfs_filesystem: filesystem on /mnt/test/15413.image2 is inconsistent (c) +(see /root/xfstests-dev/results//xfs/073.full for details) +_check_xfs_filesystem: filesystem on /mnt/test/15413.image2 is inconsistent (r) +(see /root/xfstests-dev/results//xfs/073.full for details) Metadata corruption detected at xfs_agf block 0x1/0x200 # uname -a Linux ltc-tuleta12 4.10.0-21-generic #23~16.04.1-Ubuntu SMP Tue May 2 12:54:57 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Steps to reproduce: 1. Create a loop device with xfs filesystem 2. git clone git://git.kernel.org/pub/scm/fs/xfs/xfstests-dev.git; cd xfstests-dev 3. make 4. Create a local.conf for running with created loop device 5.. Run xfstests-dev test : ./check tests/xfs/073 Full log is attached. == Comment: #2 - Harish Sriram- 2017-05-31 01:22:11 == (In reply to comment #1) > Hi Harish, > Can you share the steps used in creating the loop device with xfs filesystem > ? > > Thank you. Create loop device: # mkdir /mnt/loop-device /mnt/test /mnt/scratch # for i in $(seq 0 1); do fallocate -o 0 -l 5GiB /mnt/loop-device/file-$i.img; done # for i in $(seq 0 1); do losetup /dev/loop$i /mnt/loop-device/file-$i.img; done Create File system: # for i in $(seq 0 1); do mkfs.ext4 -F /dev/loop$i; done # cat local.config export TEST_DEV=/dev/loop0 export TEST_DIR=/mnt/test export SCRATCH_DEV=/dev/loop1 export SCRATCH_MNT=/mnt/scratch To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1696102/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1714571] Re: [P9, POwer NV] Perf PMU event : pm_br_2path and pm_ld_miss_l1 is counted twice when perf stat is done (perf:)
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1714571 Title: [P9,POwer NV] Perf PMU event : pm_br_2path and pm_ld_miss_l1 is counted twice when perf stat is done (perf:) Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Bug description: == Comment: #0 - Sukadev Bhattiprolu - 2017-09-01 12:56:47 == ---Problem Description--- Problem description : = Perf PMU event : pm_br_2path and pm_ld_miss_l1 is counted twice , it is because two raw point to the same name. [ {, "EventCode": "0x20036", "EventName": "PM_BR_2PATH", "BriefDescription": "Branches that are not strongly biased", "PublicDescription": "" }, {, "EventCode": "0x40036", "EventName": "PM_BR_2PATH", "BriefDescription": "Branches that are not strongly biased", "PublicDescription": "" }, Ex : === [root@zzfp189p1 ~]# perf stat -e pm_br_2path sleep 1 Performance counter stats for 'sleep 1': 30,163 pm_br_2path pm_br_2path (0.00%) 1.002046416 seconds time elapsed [root@zzfp189p1 ~]# perf stat -e pm_ld_miss_l1 sleep 1 Performance counter stats for 'sleep 1': 9,217 pm_ld_miss_l1 pm_ld_miss_l1 (0.00%) 1.001956110 seconds time elapsed ---Steps to Reproduce--- Steps to reproduce : 1. Install perf (linux-tools) 2. Run perf stat -e pm_ld_miss_l1 sleep 1 perf stat -e pm_br_2path sleep 1 == Comment: #1 - Sukadev Bhattiprolu - 2017-09-01 12:58:16 == The fix for this bug was pulled into Arnaldo's tree: https://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git/commit/?h=perf/core=828e291e2982a7c10a62aa32195d0cb501cedaae To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1714571/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1715064] Re: 17.10 fails to boot on POWER9 DD2.0 with Deep stop states
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1715064 Title: 17.10 fails to boot on POWER9 DD2.0 with Deep stop states Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Artful: Fix Released Bug description: == Comment: #0 - Ranjal G. Shenoy On Boston DD2.0 system, where deep stop states such as stop4 are enabled, the 17.10 kernel Ubuntu-4.12.0-12.13 fails to boot. It requires the following upstream fixes to be backported. 1) commit 5f221c3ca13d ("powerpc/powernv/idle: Correctly initialize core_idle_state_ptr") 2) commit ec4867355244 ("powerpc/powernv/idle: Decouple Timebase restore & Per-core SPRs restore") 3) commit cb0be7ec0307 ("powerpc/powernv/idle: Restore LPCR on wakeup from deep-stop") 4) commit 1e1601b38e6e ("powerpc/powernv/idle: Restore SPRs for deep idle states via stop API.") 5) commit 22c6663dc69a ("powerpc/powernv/idle: Use Requested Level for restoring state on P9 DD1") 6) commit f9122ee4f558 ("cpuidle-powernv: Allow Deep stop states that don't stop time") 7) commit 785a12afdb4a ("powerpc/powernv/idle: Disable LOSE_FULL_CONTEXT states when stop-api fails") 8) commit e1c1cfed5432 ("powerpc/powernv: Save/Restore additional SPRs for stop4 cpuidle") 9) commit 24be85a23d1f ("powerpc/powernv: Clear PECE1 in LPCR via stop-api only on Hotplug") 10) https://patchwork.ozlabs.org/patch/808233/ ("powerpc/powernv: Clear LPCR[PECE1] via stop-api only for deep state offline") Of these 1-7 are in Linux Kernel 4.13. 8 and 9 are in powerpc/linux.git -next branch. and 10) is posted upstream which fixes 9). These patches have been backported on top of Ubuntu-4.12.0-12.13 and tested on Boston where they are working as expected. == Comment: #1 - Ranjal G. Shenoy The lower 8 bits of core_idle_state_ptr tracks the number of non-idle threads in the core. This is supposed to be initialized to bit-map corresponding to the threads_per_core. However, currently it is initialized to PNV_CORE_IDLE_THREAD_BITS (0xFF). This is correct for POWER8 which has 8 threads per core, but not for POWER9 which has 4 threads per core. As a result, on POWER9, core_idle_state_ptr gets initialized to 0xFF. In case when all the threads of the core are idle, the bits corresponding tracking the idle-threads are non-zero. As a result, the idle entry/exit code fails to save/restore per-core hypervisor state since it assumes that there are threads in the cores which are still active. Fix this by correctly initializing the lower bits of the core_idle_state_ptr on the basis of threads_per_core. Cherry-picked from commit 5f221c3ca13d ("powerpc/powernv/idle: Correctly initialize core_idle_state_ptr") == Comment: #2 - Ranjal G. Shenoy On POWER8, in case of - nap: both timebase and hypervisor state is retained. - fast-sleep: timebase is lost. But the hypervisor state is retained. - winkle: timebase and hypervisor state is lost. Hence, the current code for handling exit from a idle state assumes that if the timebase value is retained, then so is the hypervisor state. Thus, the current code doesn't restore per-core hypervisor state in such cases. But that is no longer the case on POWER9 where we do have stop states in which timebase value is retained, but the hypervisor state is lost. So we have to ensure that the per-core hypervisor state gets restored in such cases. Fix this by ensuring that even in the case when timebase is retained, we explicitly check if we are waking up from a deep stop that loses per-core hypervisor state (indicated by cr4 being eq or gt), and if this is the case, we restore the per-core hypervisor state. Cherry-picked from commit ec4867355244 ("powerpc/powernv/idle: Decouple Timebase restore & Per-core SPRs restore") == Comment: #3 - Ranjal G. Shenoy On wakeup from a deep stop state which is supposed to lose the hypervisor state, we don't restore the LPCR to the old value but set it to a "sane" value via cur_cpu_spec->cpu_restore(). The problem is that the "sane" value doesn't include UPRT and the HR bits which are required to run correctly in Radix mode. Fix this on POWER9 onwards by restoring the LPCR value whatever it was before executing the stop instruction. Cherry-picked from commit cb0be7ec0307 ("powerpc/powernv/idle: Restore LPCR on wakeup from deep-stop") == Comment: #4 - Ranjal G. Shenoy Some of the SPR values (HID0, MSR,
[Kernel-packages] [Bug 1709171] Re: Disable CONFIG_MEMORY_HOTPLUG_DEFAULT_ONLINE
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709171 Title: Disable CONFIG_MEMORY_HOTPLUG_DEFAULT_ONLINE Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Zesty: Fix Released Status in linux source package in Artful: Fix Released Bug description: == SRU Justification == IBM is requesting that we disable CONFIG_MEMORY_HOTPLUG_DEFAULT_ONLINE in the current ppc64el kernels up to 16.04.3. This was requested originally in LP#1706380, but it was not integrated in that bug. This chanage has already been made in Artful(Commit 8f189e08c9a), so that is where I cherry picked the patch from. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1709171/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1711251] Re: vhost guest network randomly drops under stress (kvm)
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1711251 Title: vhost guest network randomly drops under stress (kvm) Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Zesty: Fix Released Bug description: == SRU Justification == A vhost performance patch was introduced in the 4.10 kernel upstream, and is currently included in the Zesty 4.10 kernel: commit 809ecb9bca6a9424ccd392d67e368160f8b76c92 Author: Jason WangDate: Mon Dec 12 14:46:49 2016 +0800 vhost: cache used event for better performance -- However I recently hit a functional issue linked to this patch which would cause random guests to lose their network connection under stress. This is not architecture specific and more likely to be hit with high network stress (i.e. lots of uperf instances). The patch author has now reverted this patch upstream: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/drivers/vhost?id=8d65843c44269c21e95c98090d9bb4848d473853 which reads: " Revert "vhost: cache used event for better performance" This reverts commit 809ecb9bca6a9424ccd392d67e368160f8b76c92. Since it was reported to break vhost_net. We want to cache used event and use it to check for notification. The assumption was that guest won't move the event idx back, but this could happen in fact when 16 bit index wraps around after 64K entries. Signed-off-by: Jason Wang Acked-by: Michael S. Tsirkin Signed-off-by: David S. Miller " I am requesting this patch to revert the problematic one be pulled into Ubuntu Zesty (anything 4.10+). ---uname output--- Linux p82qvirt 4.10.0-32-generic #36~16.04.1-Ubuntu SMP Wed Aug 9 09:19:19 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = 8247-22L ---Steps to Reproduce--- I can recreate the scenario with the following setup: - on a 20core host, start 20 1core VMs - I have a single linux bridge assigned to all guests using virtio - start a uperf benchmark between each guest pair (10 total) using a high number of uperf nprocs (32) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1711251/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1709179] Re: Drop GPL from of_node_to_nid() export to match other arches
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709179 Title: Drop GPL from of_node_to_nid() export to match other arches Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Zesty: Fix Released Bug description: == SRU Justification == The commit requested in this bug exposes functionality needed by the Nvidia device driver on POWER9. This patch is a clean cherry pick and specific to powerpc. This commit is in mainline as of 4.11-rc1. == Comment: #0 - Reza A. Arbab- 2017-08-07 15:34:04 == We need to backport mainline commit be9ba9ff93cc ("Drop GPL from of_node_to_nid() export to match other arches") to the Ubuntu 4.10 kernel. This exposes functionality needed by the Nvidia device driver on POWER9. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1709179/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1718292] Re: POWER9: NX842 module changes
** Changed in: ubuntu-power-systems Status: Incomplete => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1718292 Title: POWER9: NX842 module changes Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Bug description: This feature request will be used to submit NX842 changes needed to support on p9 - Using VAS interfaces instead of icswx to communicate with VAS (on PowerNV), modification of HCALL interface (on PowerVM, if needed) and other format changes. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1718292/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1643527] Re: dasdfmt fails after vary online. syslog show strange message "The disk layout of the DASD is not supported"
** Changed in: ubuntu-z-systems Status: Incomplete => Invalid -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1643527 Title: dasdfmt fails after vary online. syslog show strange message "The disk layout of the DASD is not supported" Status in Ubuntu on IBM z Systems: Won't Fix Status in linux package in Ubuntu: Won't Fix Bug description: Problem description: procedure: echo 1 > /sys/bus/ccw/devices/0.0.0199/online /sbin/chzdev dasd-eckd 0199 -e -p /sbin/dasdfmt -b 4096 -d cdl -f /dev/disk/by-path/ccw-0.0.0199 -y dasdfmt fails with: /sbin/dasdfmt: Unable to open device /dev/disk/by-path/ccw-0.0.0199: No such device looking at the syslog i can see a strange message that only comes out when i am having this issue: Nov 21 08:52:12 JUUB16MS kernel: [ 67.412484] dasd-eckd 0.0.0199: The disk layout of the DASD is not supported lsdasd returns: root@JUUB16MS:~# lsdasd Bus-ID Status Name Device Type BlkSz Size Blocks == 0.0.0192 active dasda 94:0ECKD 4096 5070MB1298160 0.0.0195 active dasdb 94:4FBA 51240MB 81920 0.0.0193 n/f dasdc 94:8ECKD 0.0.0196 active dasdd 94:12 FBA 51240MB 81920 0.0.0194 n/f dasde 94:16 ECKD 0.0.0197 n/f dasdf 94:20 ECKD 0.0.0198 n/f dasdg 94:24 ECKD 0.0.0199 n/f dasdh 94:28 ECKD you can see that the 199 is not format and can't be formatted... a workaround is to vary the disk offline and then online again. this time it will start ok and you can format it and work with it. -- just to be clear... this happens after linking to an un-formatted disk dynamically and then vary it online... for example: vmcp link '*' 199 199 echo 1 > /sys/bus/ccw/devices/0.0.0199/online /sbin/chzdev dasd-eckd 0199 -e -p /sbin/dasdfmt -b 4096 -d cdl -f /dev/disk/by-path/ccw-0.0.0199 -y To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-z-systems/+bug/1643527/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1702998] Re: Ubuntu 17.04: Guest crashed @writeback_sb_inodes+0x310/0x590
** Changed in: ubuntu-power-systems Status: Triaged => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1702998 Title: Ubuntu 17.04: Guest crashed @writeback_sb_inodes+0x310/0x590 Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: In Progress Status in linux source package in Zesty: In Progress Bug description: == Comment: #0 - Lata Kuntal- 2017-03-03 00:50:54 == Ubuntu 17.04 guest dropped at xmon after crashing at writeback_sb_inodes+0x310/0x590. The guest is having XFS rootfs and NPIV disk. It crashed after 30+ hrs of BASE and NFS stress test . Crash logs === root@guskvm:~# virsh console gusg1 --force Connected to domain gusg1 Escape character is ^] 0:mon> 0:mon> t [c000a4bc7940] c036f790 writeback_sb_inodes+0x310/0x590 [c000a4bc7a50] c036faf4 __writeback_inodes_wb+0xe4/0x150 [c000a4bc7ab0] c036ff1c wb_writeback+0x2cc/0x440 [c000a4bc7b80] c0370c30 wb_workfn+0x150/0x560 [c000a4bc7c90] c00ed8c0 process_one_work+0x2b0/0x5a0 [c000a4bc7d20] c00edc58 worker_thread+0xa8/0x650 [c000a4bc7dc0] c00f67b4 kthread+0x154/0x1a0 [c000a4bc7e30] c000b4e8 ret_from_kernel_thread+0x5c/0x74 0:mon> r R00 = c036f790 R16 = c000eca70300 R01 = c000a4bc78e0 R17 = c000f7035240 R02 = c143c900 R18 = R03 = c000f7035150 R19 = R04 = 0019 R20 = c000a4bc4000 R05 = 0100 R21 = ff7f R06 = R22 = c433d758 R07 = R23 = c433d738 R08 = 00034995 R24 = R09 = R25 = R10 = 8000 R26 = c000f70351d8 R11 = c000a4bc7a40 R27 = R12 = 2200 R28 = 0001 R13 = cfb8 R29 = c433d728 R14 = R30 = c000f7035150 R15 = c000f70351d8 R31 = pc = c036c120 locked_inode_to_wb_and_lock_list+0x50/0x290 cfar= c00b2a14 kvmppc_save_tm+0x168/0x16c lr = c036f790 writeback_sb_inodes+0x310/0x590 msr = 80009033 cr = 24002482 ctr = c0381e30 xer = trap = 300 dar = dsisr = 4000 0:mon> e cpu 0x0: Vector: 300 (Data Access) at [c000a4bc7660] pc: c036c120: locked_inode_to_wb_and_lock_list+0x50/0x290 lr: c036f790: writeback_sb_inodes+0x310/0x590 sp: c000a4bc78e0 msr: 80009033 dar: 0 dsisr: 4000 current = 0xc000fbe96000 paca= 0xcfb8 softe: 0irq_happened: 0x01 pid = 17305, comm = kworker/u16:0 Linux version 4.10.0-8-generic (buildd@bos01-ppc64el-001) (gcc version 6.3.0 20161229 (Ubuntu 6.3.0-2ubuntu1) ) #10-Ubuntu SMP Mon Feb 13 14:00:06 UTC 2017 (Ubuntu 4.10.0-8.10-generic 4.10.0-rc8) 0:mon> d || 0:mon> Host and guest kernel build = 4.10.0-8-generic OPAL firmware version T side: FW860.20 (SV860_078) Boot side : FW860.20 (SV860_078) == Comment: #4 - VIPIN K. PARASHAR - 2017-03-03 02:55:20 == [140071.761707] Adding 153536k swap on /dev/loop0. Priority:-2 extents:1 across:153536k FS [140072.153143] Adding 153472k swap on /dev/loop0. Priority:-2 extents:1 across:153472k FS [140072.441833] Unable to handle kernel paging request for data at address 0x [140072.442064] Faulting instruction address: 0xc036c120 0:mon> 0:mon> e cpu 0x0: Vector: 300 (Data Access) at [c000a4bc7660] pc: c036c120: locked_inode_to_wb_and_lock_list+0x50/0x290 lr: c036f790: writeback_sb_inodes+0x310/0x590 sp: c000a4bc78e0 msr: 80009033 dar: 0 dsisr: 4000 current = 0xc000fbe96000 paca= 0xcfb8 softe: 0irq_happened: 0x01 pid = 17305, comm = kworker/u16:0 Linux version 4.10.0-8-generic (buildd@bos01-ppc64el-001) (gcc version 6.3.0 20161229 (Ubuntu 6.3.0-2ubuntu1) ) #10-Ubuntu SMP Mon Feb 13 14:00:06 UTC 2017 (Ubuntu 4.10.0-8.10-generic 4.10.0-rc8) 0:mon> t [c000a4bc7940] c036f790 writeback_sb_inodes+0x310/0x590 [c000a4bc7a50] c036faf4 __writeback_inodes_wb+0xe4/0x150 [c000a4bc7ab0] c036ff1c wb_writeback+0x2cc/0x440 [c000a4bc7b80] c0370c30 wb_workfn+0x150/0x560 [c000a4bc7c90] c00ed8c0 process_one_work+0x2b0/0x5a0 [c000a4bc7d20] c00edc58
[Kernel-packages] [Bug 1635597] Re: Ubuntu16.10:talclp1: Kdump failed with multipath disk
** Changed in: ubuntu-power-systems Status: In Progress => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1635597 Title: Ubuntu16.10:talclp1: Kdump failed with multipath disk Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: New Status in makedumpfile package in Ubuntu: Fix Released Status in linux source package in Trusty: New Status in makedumpfile source package in Trusty: Confirmed Status in linux source package in Xenial: New Status in makedumpfile source package in Xenial: Confirmed Status in linux source package in Zesty: New Status in makedumpfile source package in Zesty: Confirmed Bug description: Problem Description == On talclp1, I enabled kdump. But kdump failed and it drop to BusyBox. root@talclp1:~# echo c> /proc/sysrq-trigger [ 132.643690] sysrq: SysRq : Trigger a crash [ 132.643739] Unable to handle kernel paging request for data at address 0x [ 132.643745] Faulting instruction address: 0xc05c28f4 [ 132.643749] Oops: Kernel access of bad area, sig: 11 [#1] [ 132.643753] SMP NR_CPUS=2048 NUMA pSeries [ 132.643758] Modules linked in: fuse ufs qnx4 hfsplus hfs minix ntfs msdos jfs rpadlpar_io rpaphp rpcsec_gss_krb5 nfsv4 dccp_diag cifs nfs dns_resolver dccp tcp_diag fscache udp_diag inet_diag unix_diag af_packet_diag netlink_diag binfmt_misc xfs libcrc32c pseries_rng rng_core ghash_generic gf128mul vmx_crypto sg nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables x_tables autofs4 ext4 crc16 jbd2 fscrypto mbcache crc32c_generic btrfs xor raid6_pq dm_round_robin sr_mod sd_mod cdrom ses enclosure scsi_transport_sas ibmveth crc32c_vpmsum ipr scsi_dh_emc scsi_dh_rdac scsi_dh_alua dm_multipath dm_mod [ 132.643819] CPU: 49 PID: 10174 Comm: bash Not tainted 4.8.0-15-generic #16-Ubuntu [ 132.643824] task: c00111767080 task.stack: c000d82e [ 132.643828] NIP: c05c28f4 LR: c05c39d8 CTR: c05c28c0 [ 132.643832] REGS: c000d82e3990 TRAP: 0300 Not tainted (4.8.0-15-generic) [ 132.643836] MSR: 80009033CR: 28242422 XER: 0001 [ 132.643848] CFAR: c00087d0 DAR: DSISR: 4200 SOFTE: 1 GPR00: c05c39d8 c000d82e3c10 c0f67b00 0063 GPR04: c0011d04a9b8 c0011d05f7e0 c0047fb0 00015998 GPR08: 0007 0001 0001 GPR12: c05c28c0 c7b4b900 2200 GPR16: 10170dc8 01002b566368 10140f58 100c7570 GPR20: 1017dd58 10153618 1017b608 GPR24: 3e87a294 0001 c0ebff60 0004 GPR28: c0ec0320 0063 c0e72a90 [ 132.643906] NIP [c05c28f4] sysrq_handle_crash+0x34/0x50 [ 132.643911] LR [c05c39d8] __handle_sysrq+0xe8/0x280 [ 132.643914] Call Trace: [ 132.643917] [c000d82e3c10] [c0a245e8] 0xc0a245e8 (unreliable) [ 132.643923] [c000d82e3c30] [c05c39d8] __handle_sysrq+0xe8/0x280 [ 132.643928] [c000d82e3cd0] [c05c4188] write_sysrq_trigger+0x78/0xa0 [ 132.643935] [c000d82e3d00] [c03ad770] proc_reg_write+0xb0/0x110 [ 132.643941] [c000d82e3d50] [c030fc3c] __vfs_write+0x6c/0xe0 [ 132.643946] [c000d82e3d90] [c0311144] vfs_write+0xd4/0x240 [ 132.643950] [c000d82e3de0] [c0312e5c] SyS_write+0x6c/0x110 [ 132.643957] [c000d82e3e30] [c00095e0] system_call+0x38/0x108 [ 132.643961] Instruction dump: [ 132.643963] 38425240 7c0802a6 f8010010 f821ffe1 6000 6000 3d220019 3949ba60 [ 132.643972] 3921 912a 7c0004ac 3940 <992a> 38210020 e8010010 7c0803a6 [ 132.643981] ---[ end trace eed6bbcd2c3bdfdf ]--- [ 132.646105] [ 132.646176] Sending IPI to other CPUs [ 132.647490] IPI complete I'm in purgatory -> smp_release_cpus() spinning_secondaries = 104 <- smp_release_cpus() [2.011346] alg: hash: Test 1 failed for crc32c-vpmsum [2.729254] sd 0:2:0:0: [sda] Assuming drive cache: write through [2.731554] sd 1:2:5:0: [sdn] Assuming drive cache: write through [2.739087] sd 1:2:4:0: [sdm] Assuming drive cache: write through [2.739089] sd 1:2:6:0: [sdo] Assuming drive cache: write through [2.739110] sd 1:2:7:0: [sdp] Assuming drive cache: write through [2.739115] sd 1:2:0:0: [sdi] Assuming drive cache: write through [2.739122] sd 1:2:3:0: [sdl] Assuming drive cache: write through [2.739123] sd 1:2:2:0: [sdk] Assuming drive cache: write through [2.739148] sd 1:2:1:0: [sdj] Assuming drive cache: write through
[Kernel-packages] [Bug 1711104] Re: [P9][Ubuntu 16.04.03] Additional PMU fixes for P9
** Changed in: ubuntu-power-systems Status: Triaged => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1711104 Title: [P9][Ubuntu 16.04.03] Additional PMU fixes for P9 Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: Triaged Status in linux source package in Zesty: Triaged Status in perforate source package in Zesty: New Bug description: == Comment: #0 - Shriya R. Kulkarni- 2017-08-16 04:07:45 == Please integrate the following PMU fixes for P9. 93fc5ca9a0048c ('powerpc/perf: Cleanup of PM_BR_CMPL vs. PM_BRU_CMPL in Power9 event list') 91e0bd1e62519b ('powerpc/perf: Add PM_LD_MISS_L1 and PM_BR_2PATH to power9 event list') 70a7e720998d5b ('powerpc/perf: Factor out PPMU_ONLY_COUNT_RUN check code from power8') 7aa345d84245a7 ('powerpc/perf: Update default sdar_mode value for power9') 101dd590a7fa37 ('powerpc/perf: Avoid spurious PMU interrupts after idle') bdd21ddb919d28 ('powerpc/perf: Avoid spurious PMU interrupts after idle on Power9') 3f0bd8dad0db73 ('powerpc/perf: Add POWER9 alternate PM_RUN_CYC and PM_RUN_INST_CMPL events') 20dd4c624d2515 ('powerpc/perf: Fix SDAR_MODE value for continous sampling on Power9') 24bedcb7c81137 ('powerpc/perf: Fix branch event code for power9') 8c218578fcbbbd ('powerpc/perf: Fix Power9 test_adder fields') To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1711104/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1707875] Re: include support for Python bindings in "perf"
** Tags removed: triage-a ** Tags added: triage-r -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1707875 Title: include support for Python bindings in "perf" Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Paul A. Clarke- 2017-07-26 09:59:57 == ---Problem Description--- Request to build python bindings support with perf. ---Steps to Reproduce--- Ubuntu-17.04# perf script -g python Python scripting not supported. Install libpython and rebuild perf to enable it. For example: # apt-get install python-dev (ubuntu) # yum install python-devel (Fedora) etc. Userspace tool common name: perf Userspace rpm: linux-tools-common Userspace tool obtained from project website: na == Comment: #2 - MAMATHA INAMDAR - 2017-08-01 05:02:34 == After installing libpython, downloaded ubuntu 17.04 linux source code and compiled perf tool then it works fine. # ./perf script -g python generated Python script: perf-script.py but when I run perf tool without rebuilding after installing libpython it shows an error # perf script -g python Python scripting not supported. Install libpython and rebuild perf to enable it. For example: # apt-get install python-dev (ubuntu) # yum install python-devel (Fedora) etc. root@p8wookie:/home/Mamatha/ubuntu/ubuntu-zesty/tools/perf# perf --version perf version 4.10.17 == Comment: #3 - MAMATHA INAMDAR - 2017-08-01 05:03:44 == Request to build python bindings support with perf. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1707875/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1715064] Re: 17.10 fails to boot on POWER9 DD2.0 with Deep stop states
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1715064 Title: 17.10 fails to boot on POWER9 DD2.0 with Deep stop states Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Ranjal G. Shenoy On Boston DD2.0 system, where deep stop states such as stop4 are enabled, the 17.10 kernel Ubuntu-4.12.0-12.13 fails to boot. It requires the following upstream fixes to be backported. 1) commit 5f221c3ca13d ("powerpc/powernv/idle: Correctly initialize core_idle_state_ptr") 2) commit ec4867355244 ("powerpc/powernv/idle: Decouple Timebase restore & Per-core SPRs restore") 3) commit cb0be7ec0307 ("powerpc/powernv/idle: Restore LPCR on wakeup from deep-stop") 4) commit 1e1601b38e6e ("powerpc/powernv/idle: Restore SPRs for deep idle states via stop API.") 5) commit 22c6663dc69a ("powerpc/powernv/idle: Use Requested Level for restoring state on P9 DD1") 6) commit f9122ee4f558 ("cpuidle-powernv: Allow Deep stop states that don't stop time") 7) commit 785a12afdb4a ("powerpc/powernv/idle: Disable LOSE_FULL_CONTEXT states when stop-api fails") 8) commit e1c1cfed5432 ("powerpc/powernv: Save/Restore additional SPRs for stop4 cpuidle") 9) commit 24be85a23d1f ("powerpc/powernv: Clear PECE1 in LPCR via stop-api only on Hotplug") 10) https://patchwork.ozlabs.org/patch/808233/ ("powerpc/powernv: Clear LPCR[PECE1] via stop-api only for deep state offline") Of these 1-7 are in Linux Kernel 4.13. 8 and 9 are in powerpc/linux.git -next branch. and 10) is posted upstream which fixes 9). These patches have been backported on top of Ubuntu-4.12.0-12.13 and tested on Boston where they are working as expected. == Comment: #1 - Ranjal G. Shenoy The lower 8 bits of core_idle_state_ptr tracks the number of non-idle threads in the core. This is supposed to be initialized to bit-map corresponding to the threads_per_core. However, currently it is initialized to PNV_CORE_IDLE_THREAD_BITS (0xFF). This is correct for POWER8 which has 8 threads per core, but not for POWER9 which has 4 threads per core. As a result, on POWER9, core_idle_state_ptr gets initialized to 0xFF. In case when all the threads of the core are idle, the bits corresponding tracking the idle-threads are non-zero. As a result, the idle entry/exit code fails to save/restore per-core hypervisor state since it assumes that there are threads in the cores which are still active. Fix this by correctly initializing the lower bits of the core_idle_state_ptr on the basis of threads_per_core. Cherry-picked from commit 5f221c3ca13d ("powerpc/powernv/idle: Correctly initialize core_idle_state_ptr") == Comment: #2 - Ranjal G. Shenoy On POWER8, in case of - nap: both timebase and hypervisor state is retained. - fast-sleep: timebase is lost. But the hypervisor state is retained. - winkle: timebase and hypervisor state is lost. Hence, the current code for handling exit from a idle state assumes that if the timebase value is retained, then so is the hypervisor state. Thus, the current code doesn't restore per-core hypervisor state in such cases. But that is no longer the case on POWER9 where we do have stop states in which timebase value is retained, but the hypervisor state is lost. So we have to ensure that the per-core hypervisor state gets restored in such cases. Fix this by ensuring that even in the case when timebase is retained, we explicitly check if we are waking up from a deep stop that loses per-core hypervisor state (indicated by cr4 being eq or gt), and if this is the case, we restore the per-core hypervisor state. Cherry-picked from commit ec4867355244 ("powerpc/powernv/idle: Decouple Timebase restore & Per-core SPRs restore") == Comment: #3 - Ranjal G. Shenoy On wakeup from a deep stop state which is supposed to lose the hypervisor state, we don't restore the LPCR to the old value but set it to a "sane" value via cur_cpu_spec->cpu_restore(). The problem is that the "sane" value doesn't include UPRT and the HR bits which are required to run correctly in Radix mode. Fix this on POWER9 onwards by restoring the LPCR value whatever it was before executing the stop instruction. Cherry-picked from commit cb0be7ec0307 ("powerpc/powernv/idle: Restore
[Kernel-packages] [Bug 1713576] Re: More migrations with constant load
Thanks for the update. Can this issue now be closed? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1713576 Title: More migrations with constant load Status in The Ubuntu-power-systems project: Triaged Status in linux package in Ubuntu: Triaged Bug description: == Comment: #0 - PUVICHAKRAVARTHY RAMACHANDRAN - 2017-08-06 13:44:45 == ---Problem Description--- Significantly higher number of task migrations when the load is fixed but not balanced across cores. ---uname output--- Linux isvbos3 4.10.0-29-generic #33~16.04.1-Ubuntu SMP Tue Jul 25 18:17:06 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux ---Additional Hardware Info--- Power9 dd2.0 Machine Type = Power9 ---Steps to Reproduce--- Benchmark : Multithreaded - cpu intensive. The system had 2 socket/ 32 cores/ SMT4 mode. When 64 threads was run - the migrations were less over 10s interval. when 80 threads were run - the migrations were very high. Ideally, it should have been very minimal, as the over all load was constant == Comment: #3 - SRIKAR DRONAMRAJU - 2017-08-11 06:56:47 == As suspected (commit : 05b40e0577 : "sched/fair: Prefer sibiling only if local group is under-utilized") https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=05b40e0577 should fix the problem Ran ' perf stat -a -r 5 -e sched:sched_migrate_task /home/srikar/work/ebizzy-0.3/ebizzy -t 35 -S 100' to detect the problem and verify the fix Here is perf stat without fix. Performance counter stats for 'system wide' (5 runs): 7,758 sched:sched_migrate_task ( +- 1.28% ) 100.015658079 seconds time elapsed ( +- 0.00% ) perf stat with fix. Performance counter stats for 'system wide' (5 runs): 415 sched:sched_migrate_task ( +- 11.74% ) 100.016021787 seconds time elapsed ( +- 0.00% ) git describe on upstream kernel says v4.11-rc2 # git describe 05b40e0577 v4.11-rc2-227-g05b40e0 == Comment: #4 - SRIKAR DRONAMRAJU - 2017-08-11 07:05:37 == Attaching the patch that needs to be applied to fix this bug. Verified that patch fixes the problem. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1713576/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1708399] Re: kernel panic -not syncing: Fatal exception: panic_on_oops
Moving to "incomplete", pending patches landing upstream. ** Changed in: ubuntu-z-systems Status: In Progress => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1708399 Title: kernel panic -not syncing: Fatal exception: panic_on_oops Status in Ubuntu on IBM z Systems: Incomplete Status in linux package in Ubuntu: In Progress Status in linux source package in Xenial: In Progress Status in linux source package in Zesty: In Progress Bug description: SRU justification: Impact: A race in context flushing is causing a kernel panic on the s390x architecture. Fix: Using a set of 3 patches (all restricted to arch code), one already upstream and the other 2 pending on linux-next. Regression risk should be low (limited to arch code and tested). Testcase: see below --- == Comment: #0 - QI YE- 2017-08-02 04:11:25 == ---Problem Description--- Ubuntu got kernel panic ---uname output--- #110-Ubuntu SMP Tue Jul 18 12:56:43 UTC 2017 s390x s390x s390x GNU/Linux ---Debugger Data--- PID: 10991 TASK: 19872a0e8 CPU: 2 COMMAND: "hyperkube" LOWCORE INFO: -psw : 0x0004c0018000 0x00115fa6 -function : pcpu_delegate at 115fa6 -prefix : 0x7fe42000 -cpu timer: 0x7ffab2827828aa50 -clock cmp: 0xd2eb8b31445e4200 -general registers: 0x0004e001 0x001283b6 0xc001 0x8380fcb8 0x00115f9e 0x0056f6e2 0x0004 0x00cf9070 0x0001f3bfc000 0x00112fd8 0x0001c72bb400 0x0002 0x7fffc000 0x007c9ef0 0x00115f9e 0x8380fc18 -access registers: 0x03ff 0x7910 00 00 00 00 00 00 00 00 00 00 00 00 00 00 -control registers: 0x14066a12 0x7e6d81c7 0x00011140 00 0x2aef 0x0400 0x5000 0x7e6d81c7 00 00 00 00 00 0x00cfc007 0xdb00 0x00011280 -floating point registers: 0x409c7e258000 0x401de4e0 00 0x3fd24407ab0e073a 0x3ff0 0x3fee 0x3fef218f8a7a41a0 0x3fee 0x0080 00 0x03ff7f80 0x02aa4940e9e0 0xd401 0x03ffe81fe110 00 0x03fff2cfe638 #0 [8380fc78] smp_find_processor_id at 1160f8 #1 [8380fc90] machine_kexec at 1135d4 #2 [8380fcb8] crash_kexec at 1fbb8a #3 [8380fd88] panic at 27d0e0 #4 [8380fe28] die at 1142cc #5 [8380fe90] do_low_address at 12215e #6 [8380fea8] pgm_check_handler at 7c2ab4 PSW: 070520018000 02aa267e0e42 (user space) GPRS: 02aa2c4fd690 0001 02aa2c4fd690 03ff7fffee38 0002 00029c0f 00c42001ea00 0001 0001 00c42001c5c8 00c42082c1a0 02aa2666325e 03ff7fffed90 Contact Information = Chee Ye / y...@cn.ibm.com Stack trace output: no Oops output: [43200.761465] docker0: port 10(vethb9132e9) entered forwarding state [50008.560926] hrtimer: interrupt took 1698076 ns [123483.768984] systemd[1]: apt-daily.timer: Adding 7h 34min 22.582204s random time. [123483.930058] systemd[1]: apt-daily.timer: Adding 2h 18min 14.857162s random time. [123484.064879] systemd[1]: apt-daily.timer: Adding 10h 46min 2.301756s random time. [123484.824760] systemd[1]: apt-daily.timer: Adding 6h 16min 22.178655s random time. [153113.703126] conntrack: generic helper won't handle protocol 47. Please consider loading the specific helper module. [477085.704538] Low-address protection: 0004 ilc:2 [#1] SMP [477085.704551] Modules linked in: xt_physdev veth xt_recent xt_comment xt_mark xt_nat ipt_MASQUERADE nf_nat_masquerade_ipv4 xfrm_user xfrm_algo iptable_nat nf_nat_ipv4 xt_addrtype nf_nat br_netfilter bridge stp llc aufs ipt_REJECT nf_reject_ipv4 xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack iptable_filter ip_tables x_tables ghash_s390 prng aes_s390 des_s390 des_generic sha512_s390 qeth_l2 sha256_s390 qeth sha1_s390 qdio sha_common ccwgroup vmur dasd_eckd_mod dasd_mod [477085.705522] CPU: 2 PID: 10991 Comm: hyperkube Not tainted 4.4.0-87-generic #110-Ubuntu [477085.705525] task: 00019872a0e8 ti: 8380c000 task.ti: 8380c000 [477085.705529] User PSW : 070520018000 02aa267e0e42
[Kernel-packages] [Bug 1708630] Re: Ubuntu17.10 - perf: Update Power9 PMU event JSON files
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1708630 Title: Ubuntu17.10 - perf: Update Power9 PMU event JSON files Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Bug description: == Comment: #0 - Sukadev Bhattiprolu- 2017-08-03 20:36:26 == +++ This bug was initially created as a clone of Bug #157304 +++ ---Problem Description--- BZ 150738 added Power9 PMU event lists to the Linux kernel tree. We need to update the event lists to include several more events as well as clean up the event descriptions of some events. The patches for these updates were posted to LKML https://lkml.org/lkml/2017/8/2/699 with: Subject: Re: [GIT PULL] Please pull JSON files for Power9 PMU events and were merged into Arnaldo's "perf-core" tree as git commits: https://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git/commit/?h=perf/core=864c572433dedfb5c36db79c4ceb6dfb80b3344b ("perf vendor events powerpc: remove suffix in mapfile") and https://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git/commit/?h=perf/core=b547e94b919c84de22052935a77de9c8a97d3418 ("perf vendor events powerpc: Update POWER9 events") We will need to include those updates into Contact Information = s...@us.ibm.com ---uname output--- na Machine Type = Powre9 ---Debugger--- A debugger is not configured ---Steps to Reproduce--- 'perf list pmu' , 'perf stat' and 'perf record' are unable to list/use some Power9 PMU events as they are missing from the kernel source tree. Stack trace output: no Oops output: no Userspace tool common name: perf Userspace rpm: linux-tools The userspace tool has the following bit modes: 64-bit System Dump Info: The system is not configured to capture a system dump. Userspace tool obtained from project website: na *Additional Instructions for s...@us.ibm.com: -Attach sysctl -a output output to the bug. -Attach ltrace and strace of userspace application. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1708630/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1696102] Re: xfs/073 test fails with Metadata corruption detected on xfs file system (xfsprogs)
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1696102 Title: xfs/073 test fails with Metadata corruption detected on xfs file system (xfsprogs) Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Invalid Status in xfsprogs package in Ubuntu: Fix Released Status in linux source package in Xenial: Invalid Status in xfsprogs source package in Xenial: Fix Released Status in linux source package in Zesty: Invalid Status in xfsprogs source package in Zesty: Fix Released Bug description: Impact == When making multiple copies of a V5 filesystem with xfs_copy those copies will be corrupt due to incorrect UUIDs. Test Case = See the detailed steps to create the test environment in comment #20. Regression Potential From the patch "Most of this patch is changing comments and re-ordering tests to match; the functional change is to simply use the *sb rather than the *ag_hdr to identify the proper metadata UUID." So this seems pretty straightforward and given that copy process is currently broken, because the copies are corrupt, this will be an improvement. Original Description Problem Description xfs/073 test fails with Metadata corruption detected on xfs file system. Test fails with _check_xfs_filesystem: filesystem on /mnt/test/84004.image2 is inconsistent. # diff -u tests/xfs/073.out /root/xfstests-dev/results//xfs/073.out.bad --- tests/xfs/073.out 2017-03-23 12:13:05.288877197 +0530 +++ /root/xfstests-dev/results//xfs/073.out.bad 2017-03-27 11:11:43.023059702 +0530 @@ -59,8 +59,7 @@ comparing new image geometry to old unmounting and removing new image checking new image -mounting new image on loopback -comparing new image files to old -comparing new image directories to old -comparing new image geometry to old -unmounting and removing new image +_check_xfs_filesystem: filesystem on /mnt/test/15413.image2 is inconsistent (c) +(see /root/xfstests-dev/results//xfs/073.full for details) +_check_xfs_filesystem: filesystem on /mnt/test/15413.image2 is inconsistent (r) +(see /root/xfstests-dev/results//xfs/073.full for details) Metadata corruption detected at xfs_agf block 0x1/0x200 # uname -a Linux ltc-tuleta12 4.10.0-21-generic #23~16.04.1-Ubuntu SMP Tue May 2 12:54:57 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Steps to reproduce: 1. Create a loop device with xfs filesystem 2. git clone git://git.kernel.org/pub/scm/fs/xfs/xfstests-dev.git; cd xfstests-dev 3. make 4. Create a local.conf for running with created loop device 5.. Run xfstests-dev test : ./check tests/xfs/073 Full log is attached. == Comment: #2 - Harish Sriram- 2017-05-31 01:22:11 == (In reply to comment #1) > Hi Harish, > Can you share the steps used in creating the loop device with xfs filesystem > ? > > Thank you. Create loop device: # mkdir /mnt/loop-device /mnt/test /mnt/scratch # for i in $(seq 0 1); do fallocate -o 0 -l 5GiB /mnt/loop-device/file-$i.img; done # for i in $(seq 0 1); do losetup /dev/loop$i /mnt/loop-device/file-$i.img; done Create File system: # for i in $(seq 0 1); do mkfs.ext4 -F /dev/loop$i; done # cat local.config export TEST_DEV=/dev/loop0 export TEST_DIR=/mnt/test export SCRATCH_DEV=/dev/loop1 export SCRATCH_MNT=/mnt/scratch To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1696102/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1721070] Re: powerpc/64s: Add workaround for P9 vector CI load issuenext
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => Critical ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1721070 Title: powerpc/64s: Add workaround for P9 vector CI load issuenext Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: Triaged Status in linux source package in Zesty: Triaged Status in linux source package in Artful: Triaged Bug description: -- Problem Description -- POWER9 DD2.1 and earlier has an issue where some cache inhibited vector load will return bad data. The workaround is two part, one firmware/microcode part triggers HMI interrupts when hitting such loads, the other part is this patch which then emulates the instructions in Linux. The affected instructions are limited to lxvd2x, lxvw4x, lxvb16x and lxvh8x. When an instruction triggers the HMI, all threads in the core will be sent to the HMI handler, not just the one running the vector load. In general, these spurious HMIs are detected by the emulation code and we just return back to the running process. Unfortunately, if a spurious interrupt occurs on a vector load that's to normal memory we have no way to detect that it's spurious (unless we walk the page tables, which is very expensive). In this case we emulate the load but we need do so using a vector load itself to ensure 128bit atomicity is preserved. Some additional debugfs emulated instruction counters are added also. In order to solve this bug, we need to cherry pick the following patch https://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux.git/commit/?h=next=5080332c2c893118dbc18755f35c8b0131cf0fc4 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1721070/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1709179] Re: Drop GPL from of_node_to_nid() export to match other arches
Apologies for the delay. This bug somehow missed our initial handling process. I've now assigned to the kernel team. ** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1709179 Title: Drop GPL from of_node_to_nid() export to match other arches Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Reza A. Arbab- 2017-08-07 15:34:04 == We need to backport mainline commit be9ba9ff93cc ("Drop GPL from of_node_to_nid() export to match other arches") to the Ubuntu 4.10 kernel. This exposes functionality needed by the Nvidia device driver on POWER9. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1709179/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1708399] Re: kernel panic -not syncing: Fatal exception: panic_on_oops
** Changed in: ubuntu-power-systems Status: Incomplete => New ** Changed in: linux (Ubuntu) Status: Incomplete => New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1708399 Title: kernel panic -not syncing: Fatal exception: panic_on_oops Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - QI YE- 2017-08-02 04:11:25 == ---Problem Description--- Ubuntu got kernel panic ---uname output--- #110-Ubuntu SMP Tue Jul 18 12:56:43 UTC 2017 s390x s390x s390x GNU/Linux ---Debugger Data--- PID: 10991 TASK: 19872a0e8 CPU: 2 COMMAND: "hyperkube" LOWCORE INFO: -psw : 0x0004c0018000 0x00115fa6 -function : pcpu_delegate at 115fa6 -prefix : 0x7fe42000 -cpu timer: 0x7ffab2827828aa50 -clock cmp: 0xd2eb8b31445e4200 -general registers: 0x0004e001 0x001283b6 0xc001 0x8380fcb8 0x00115f9e 0x0056f6e2 0x0004 0x00cf9070 0x0001f3bfc000 0x00112fd8 0x0001c72bb400 0x0002 0x7fffc000 0x007c9ef0 0x00115f9e 0x8380fc18 -access registers: 0x03ff 0x7910 00 00 00 00 00 00 00 00 00 00 00 00 00 00 -control registers: 0x14066a12 0x7e6d81c7 0x00011140 00 0x2aef 0x0400 0x5000 0x7e6d81c7 00 00 00 00 00 0x00cfc007 0xdb00 0x00011280 -floating point registers: 0x409c7e258000 0x401de4e0 00 0x3fd24407ab0e073a 0x3ff0 0x3fee 0x3fef218f8a7a41a0 0x3fee 0x0080 00 0x03ff7f80 0x02aa4940e9e0 0xd401 0x03ffe81fe110 00 0x03fff2cfe638 #0 [8380fc78] smp_find_processor_id at 1160f8 #1 [8380fc90] machine_kexec at 1135d4 #2 [8380fcb8] crash_kexec at 1fbb8a #3 [8380fd88] panic at 27d0e0 #4 [8380fe28] die at 1142cc #5 [8380fe90] do_low_address at 12215e #6 [8380fea8] pgm_check_handler at 7c2ab4 PSW: 070520018000 02aa267e0e42 (user space) GPRS: 02aa2c4fd690 0001 02aa2c4fd690 03ff7fffee38 0002 00029c0f 00c42001ea00 0001 0001 00c42001c5c8 00c42082c1a0 02aa2666325e 03ff7fffed90 Contact Information = Chee Ye / y...@cn.ibm.com Stack trace output: no Oops output: [43200.761465] docker0: port 10(vethb9132e9) entered forwarding state [50008.560926] hrtimer: interrupt took 1698076 ns [123483.768984] systemd[1]: apt-daily.timer: Adding 7h 34min 22.582204s random time. [123483.930058] systemd[1]: apt-daily.timer: Adding 2h 18min 14.857162s random time. [123484.064879] systemd[1]: apt-daily.timer: Adding 10h 46min 2.301756s random time. [123484.824760] systemd[1]: apt-daily.timer: Adding 6h 16min 22.178655s random time. [153113.703126] conntrack: generic helper won't handle protocol 47. Please consider loading the specific helper module. [477085.704538] Low-address protection: 0004 ilc:2 [#1] SMP [477085.704551] Modules linked in: xt_physdev veth xt_recent xt_comment xt_mark xt_nat ipt_MASQUERADE nf_nat_masquerade_ipv4 xfrm_user xfrm_algo iptable_nat nf_nat_ipv4 xt_addrtype nf_nat br_netfilter bridge stp llc aufs ipt_REJECT nf_reject_ipv4 xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack iptable_filter ip_tables x_tables ghash_s390 prng aes_s390 des_s390 des_generic sha512_s390 qeth_l2 sha256_s390 qeth sha1_s390 qdio sha_common ccwgroup vmur dasd_eckd_mod dasd_mod [477085.705522] CPU: 2 PID: 10991 Comm: hyperkube Not tainted 4.4.0-87-generic #110-Ubuntu [477085.705525] task: 00019872a0e8 ti: 8380c000 task.ti: 8380c000 [477085.705529] User PSW : 070520018000 02aa267e0e42 [477085.705532]R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:1 AS:0 CC:2 PM:0 EA:3 User GPRS: 02aa2c4fd690 0001 [477085.705539]02aa2c4fd690 03ff7fffee38 0002 [477085.705553]00029c0f 00c42001ea00 0001 0001 [477085.705554]00c42001c5c8 00c42082c1a0
[Kernel-packages] [Bug 1664545] Re: In Ubuntu17.04 as Kvm guest : While trigger kdump console hung having call traces
** Changed in: ubuntu-power-systems Assignee: Canonical Server Team (canonical-server) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1664545 Title: In Ubuntu17.04 as Kvm guest : While trigger kdump console hung having call traces Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Status in makedumpfile package in Ubuntu: New Bug description: In ubuntu17.04 as KVM guest on ubuntu KVM Host and trying kdump on guest while kdump process console got hung having call traces Reproducible Step: 1- Install Ubuntu17.04 as kvm guest on ubuntu kvm host 2- configure kdump 3- trigger kdump Expected Result : Kdump should capture Actual Result : Kdump console hung having continuous call traces LOG: [0.488534] Freeing unused kernel memory: 4416K (c8e8 - c92d) [0.488725] This architecture does not have kernel memory protection. Loading, please wait... starting version 232 [0.501616] random: udevadm: uninitialized urandom read (16 bytes read) [0.501830] random: udevadm: uninitialized urandom read (16 bytes read) [0.501981] random: udevadm: uninitialized urandom read (16 bytes read) [0.502162] random: udevadm: uninitialized urandom read (16 bytes read) [0.502254] random: udevadm: uninitialized urandom read (16 bytes read) [0.502433] random: udevadm: uninitialized urandom read (16 bytes read) [0.503188] random: udevadm: uninitialized urandom read (16 bytes read) [0.503270] random: udevadm: uninitialized urandom read (16 bytes read) [0.503978] random: systemd-udevd: uninitialized urandom read (16 bytes read) [0.504218] random: systemd-udevd: uninitialized urandom read (16 bytes read) [ 242.663388] INFO: task systemd-udevd:151 blocked for more than 120 seconds. [ 242.663514] Not tainted 4.9.0-15-generic #16-Ubuntu [ 242.663553] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 242.663755] systemd-udevd D0 151145 0x00040002 [ 242.663795] Call Trace: [ 242.663868] [c0001fe4ecd0] [c801c3a0] __switch_to+0x2e0/0x4c0 [ 242.663964] [c0001fe4ed30] [c8b19398] __schedule+0x2f8/0x990 [ 242.664076] [c0001fe4ee10] [c8b19a78] schedule+0x48/0xc0 [ 242.664179] [c0001fe4ee40] [c8b1de54] schedule_timeout+0x274/0x470 [ 242.664334] [c0001fe4ef30] [c8b19010] io_schedule_timeout+0xd0/0x160 [ 242.664502] [c0001fe4ef80] [c8b1a720] bit_wait_io+0x30/0x90 [ 242.664618] [c0001fe4efb0] [c8b1a168] __wait_on_bit+0xf8/0x170 [ 242.664754] [c0001fe4f000] [c824b238] wait_on_page_bit+0x98/0xb0 [ 242.664847] [c0001fe4f060] [c824d60c] do_read_cache_page+0x21c/0x4e0 [ 242.665008] [c0001fe4f0d0] [c859be78] read_dev_sector+0xb8/0x140 [ 242.665126] [c0001fe4f100] [c85a5d88] read_lba.isra.0+0x148/0x250 [ 242.665259] [c0001fe4f170] [c85a652c] efi_partition+0x12c/0x830 [ 242.665363] [c0001fe4f2e0] [c859e768] check_partition+0x158/0x2d0 [ 242.665469] [c0001fe4f360] [c859c760] rescan_partitions+0xe0/0x390 [ 242.665552] [c0001fe4f430] [c8371828] __blkdev_get+0x358/0x490 [ 242.665669] [c0001fe4f4a0] [c8372b50] blkdev_get+0x1a0/0x4a0 [ 242.665784] [c0001fe4f550] [c8599538] device_add_disk+0x4a8/0x500 [ 242.665894] [c0001fe4f600] [d0511cc8] virtblk_probe+0x560/0x928 [virtio_blk] [ 242.665983] [c0001fe4f6c0] [c8687700] virtio_dev_probe+0x1d0/0x350 [ 242.666050] [c0001fe4f700] [c8716f30] driver_probe_device+0x240/0x540 [ 242.666116] [c0001fe4f790] [c871738c] __driver_attach+0x15c/0x160 [ 242.666174] [c0001fe4f810] [c87138ec] bus_for_each_dev+0x8c/0xf0 [ 242.666232] [c0001fe4f860] [c87162e4] driver_attach+0x34/0x50 [ 242.666289] [c0001fe4f880] [c8715a78] bus_add_driver+0x238/0x380 [ 242.666345] [c0001fe4f910] [c871829c] driver_register+0x9c/0x180 [ 242.666403] [c0001fe4f980] [c8686abc] register_virtio_driver+0x4c/0x60 [ 242.666470] [c0001fe4f9a0] [d0512114] init+0x84/0xd4 [virtio_blk] [ 242.666527] [c0001fe4fa10] [c800dde8] do_one_initcall+0x68/0x1d0 [ 242.666584] [c0001fe4fad0] [c8b28e00] do_init_module+0x90/0x244 [ 242.43] [c0001fe4fb60] [c8184794] load_module+0x1614/0x17a0 [ 242.666701] [c0001fe4fd30] [c8184c60] SyS_finit_module+0xf0/0x170 [ 242.666759] [c0001fe4fe30] [c800bd84] system_call+0x38/0xe0
[Kernel-packages] [Bug 1680349] Re: Ubuntu 17.04: Kdump fails to capture dump on Firestone NV when machine crashes while running stress-ng.
** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1680349 Title: Ubuntu 17.04: Kdump fails to capture dump on Firestone NV when machine crashes while running stress-ng. Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - PAVITHRA R. PRAKASH <> - 2017-03-10 02:43:10 == ---Problem Description--- Ubuntu 17.04: Kdump fails to capture dump on Firestone NV when machine crashes while running stress-ng. Machine hangs. ---Steps to Reproduce--- 1. Configure kdump. 2. Install stress-ng # apt-get install stress-ng 3. Run stress-ng # stress-ng - a 0 Logs: root@ltc-firep3:~# kdump-config load Modified cmdline:root=UUID=8b0d5b99-6087-4f40-82ea-375c83a4c139 ro quiet splash irqpoll nr_cpus=1 nousb systemd.unit=kdump-tools.service ata_piix.prefer_ms_hyperv=0 elfcorehdr=155200K * loaded kdump kernel root@ltc-firep3:~# kdump-config show DUMP_MODE:kdump USE_KDUMP:1 KDUMP_SYSCTL: kernel.panic_on_oops=1 KDUMP_COREDIR:/var/crash crashkernel addr: /var/lib/kdump/vmlinuz: symbolic link to /boot/vmlinux-4.10.0-11-generic kdump initrd: /var/lib/kdump/initrd.img: symbolic link to /var/lib/kdump/initrd.img-4.10.0-11-generic current state:ready to kdump kexec command: /sbin/kexec -p --command-line="root=UUID=8b0d5b99-6087-4f40-82ea-375c83a4c139 ro quiet splash irqpoll nr_cpus=1 nousb systemd.unit=kdump-tools.service ata_piix.prefer_ms_hyperv=0" --initrd=/var/lib/kdump/initrd.img /var/lib/kdump/vmlinuz root@ltc-firep3:~# stress-ng -a 0 stress-ng: info: [3900] defaulting to a 86400 second run per stressor stress-ng: info: [3900] dispatching hogs: 160 af-alg, 160 affinity, 160 aio, 160 aiol, 160 apparmor, 160 atomic, 160 bigheap, 160 brk, 160 bsearch, 160 cache, 160 cap, 160 chdir, 160 chmod, 160 chown, 160 chroot, 160 clock, 160 clone, 160 context, 160 copy-file, 160 cpu, 160 cpu-online, 160 crypt, 160 daemon, 160 dccp, 160 dentry, 160 dir, 160 dirdeep, 160 dnotify, 160 dup, 160 epoll, 160 eventfd, 160 exec, 160 fallocate, 160 fanotify, 160 fault, 160 fcntl, 160 fiemap, 160 fifo, 160 filename, 160 flock, 160 fork, 160 fp-error, 160 fstat, 160 full, 160 futex, 160 get, 160 getdent, 160 getrandom, 160 handle, 160 hdd, 160 heapsort, 160 hsearch, 160 icache, 160 icmp-flood, 160 inotify, 160 io, 160 iomix, 160 ioprio, 160 itimer, 160 kcmp, 160 key, 160 kill, 160 klog, 160 lease, 160 link, 160 locka, 160 lockbus, 160 lockf, 160 lockofd, 160 longjmp, 160 lsearch, 160 madvise, 160 malloc, 160 matrix, 160 membarrier, 160 memcpy, 160 memfd, 160 mergesort, 160 mincore, 160 mknod, 160 mlock, 1 60 mmap, 160 mmapfork, 160 mmapmany, 160 mq, 160 mremap, 160 msg, 160 msync, 160 netlink-proc, 160 nice, 160 nop, 160 null, 160 numa, 160 oom-pipe, 160 opcode, 160 open, 160 personality, 160 pipe, 160 poll, 160 procfs, 160 pthread, 160 ptrace, 160 pty, 160 qsort, 160 quota, 160 rdrand, 160 readahead, 160 remap, 160 rename, 160 resources, 160 rlimit, 160 rmap, 160 rtc, 160 schedpolicy, 160 sctp, 160 seal, 160 seccomp, 160 seek, 160 sem, 160 sem-sysv, 160 sendfile, 160 shm, 160 shm-sysv, 160 sigfd, 160 sigfpe, 160 sigpending, 160 sigq, 160 sigsegv, 160 sigsuspend, 160 sleep, 160 sock, 160 sockfd, 160 sockpair, 160 spawn, 160 splice, 160 stack, 160 stackmmap, 160 str, 160 stream, 160 switch, 160 symlink, 160 sync-file, 160 sysfs, 160 sysinfo, 160 tee, 160 timer, 160 timerfd, 160 tlb-shootdown, 160 tmpfs, 160 tsc, 160 tsearch, 160 udp, 160 udp-flood, 160 unshare, 160 urandom, 160 userfaultfd, 160 utime, 160 vecmath, 160 vfork, 160 vforkmany, 160 vm, 160 vm-rw, 160 vm-splice, 160 wait, 1 60 wcs, 160 xattr, 160 yield, 160 zero, 160 zlib, 160 zombie stress-ng: info: [3900] cache allocate: using built-in defaults as unable to determine cache details stress-ng: info: [3900] cache allocate: default cache size: 2048K stress-ng: info: [3907] stress-ng-atomic: this stressor is not implemented on this system: ppc64le Linux 4.10.0-11-generic stress-ng: info: [3955] stress-ng-exec: running as root, won't run test. stress-ng: info: [3999] stress-ng-icache: this stressor is not implemented on this system: ppc64le Linux 4.10.0-11-generic stress-ng: info: [4040] stress-ng-lockbus: this stressor is not implemented on this system: ppc64le Linux 4.10.0-11-generic stress-ng: info: [4313] stress-ng-numa: system has 2 of a maximum 256 memory NUMA nodes stress-ng: info: [4455] stress-ng-rdrand: this stressor is not implemented on this system: ppc64le Linux 4.10.0-11-generic stress-ng: fail: [4558] stress-ng-rtc: ioctl RTC_ALRM_READ failed, errno=22 (Invalid argument) stress-ng: fail: [4017] stress-ng-key:
[Kernel-packages] [Bug 1681909] Re: Ubuntu 17.04: dump is not captured in remote host when kdump over ssh is configured on firestone.
** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1681909 Title: Ubuntu 17.04: dump is not captured in remote host when kdump over ssh is configured on firestone. Status in The Ubuntu-power-systems project: New Status in makedumpfile package in Ubuntu: New Bug description: == Comment: #0 - PAVITHRA R. PRAKASH- 2017-03-07 05:00:29 == ---Problem Description--- Ubuntu 17.04: dump is not captured in remote host when kdump over ssh is configured on firestone. ---Steps to Reproduce--- 1. Configure kdump. 2. Check whether kdump is operational using ?# kdump-config show?. 3. Install ?kernel-debuginfo? and ?kernel-debuginfo-common? rpms. 4. Setup password less ssh connection, generate rsa key. # ssh-keygen -t rsa 5. verify id_rsa and id_rsa.pub are created under /root/.ssh/ 6. Edit /etc/default/kdump-tools and add below entries. SSH="ubuntu@9.114.15.239" SSH_KEY=/root/.ssh/id_rsa 7. Propagate RSA key. # kdump-config propagate 8. Restart kdump service. # kdump-config load 9. Trigger Crash using below commands. # echo "1" > /proc/sys/kernel/sysrq # echo "c" > /proc/sysrq-trigger 10. Verify dump is available in remote server in configured path. Machine details === $ ipmitool -I lanplus -H 9.47.70.3 -U ADMIN -P admin sol activate $ ssh ubuntu@9.47.70.29 PW: shriya101 Attaching logs == Comment: #1 - PAVITHRA R. PRAKASH - 2017-03-07 05:01:42 == == Comment: #5 - PAVITHRA R. PRAKASH - 2017-03-07 23:19:46 == Hi, Attaching the logs. Network info: root@ltc-firep3:~# hwinfo --network 36: None 00.0: 10700 Loopback [Created at net.126] Unique ID: ZsBS.GQNx7L4uPNA SysFS ID: /class/net/lo Hardware Class: network interface Model: "Loopback network interface" Device File: lo Link detected: yes Config Status: cfg=new, avail=yes, need=no, active=unknown 37: None 00.0: 10701 Ethernet [Created at net.126] Unique ID: 2lHw.ndpeucax6V1 Parent ID: mIXc.aXC4wIvegH8 SysFS ID: /class/net/enP33p3s0f2 SysFS Device Link: /devices/pci0021:00/0021:00:00.0/0021:01:00.0/0021:02:01.0/0021:03:00.2 Hardware Class: network interface Model: "Ethernet network interface" Driver: "tg3" Driver Modules: "tg3" Device File: enP33p3s0f2 HW Address: 98:be:94:03:18:4a Permanent HW Address: 98:be:94:03:18:4a Link detected: no Config Status: cfg=new, avail=yes, need=no, active=unknown Attached to: #15 (Ethernet controller) 38: None 00.0: 10701 Ethernet [Created at net.126] Unique ID: 7Onn.ndpeucax6V1 Parent ID: sx0U.aXC4wIvegH8 SysFS ID: /class/net/enP33p3s0f0 SysFS Device Link: /devices/pci0021:00/0021:00:00.0/0021:01:00.0/0021:02:01.0/0021:03:00.0 Hardware Class: network interface Model: "Ethernet network interface" Driver: "tg3" Driver Modules: "tg3" Device File: enP33p3s0f0 HW Address: 98:be:94:03:18:48 Permanent HW Address: 98:be:94:03:18:48 Link detected: yes Config Status: cfg=new, avail=yes, need=no, active=unknown Attached to: #16 (Ethernet controller) 39: None 00.0: 10701 Ethernet [Created at net.126] Unique ID: VwX_.ndpeucax6V1 Parent ID: DUng.aXC4wIvegH8 SysFS ID: /class/net/enP33p3s0f3 SysFS Device Link: /devices/pci0021:00/0021:00:00.0/0021:01:00.0/0021:02:01.0/0021:03:00.3 Hardware Class: network interface Model: "Ethernet network interface" Driver: "tg3" Driver Modules: "tg3" Device File: enP33p3s0f3 HW Address: 98:be:94:03:18:4b Permanent HW Address: 98:be:94:03:18:4b Link detected: no Config Status: cfg=new, avail=yes, need=no, active=unknown Attached to: #25 (Ethernet controller) 40: None 00.0: 10701 Ethernet [Created at net.126] Unique ID: bZ1s.ndpeucax6V1 Parent ID: J7HY.aXC4wIvegH8 SysFS ID: /class/net/enP33p3s0f1 SysFS Device Link: /devices/pci0021:00/0021:00:00.0/0021:01:00.0/0021:02:01.0/0021:03:00.1 Hardware Class: network interface Model: "Ethernet network interface" Driver: "tg3" Driver Modules: "tg3" Device File: enP33p3s0f1 HW Address: 98:be:94:03:18:49 Permanent HW Address: 98:be:94:03:18:49 Link detected: no Config Status: cfg=new, avail=yes, need=no, active=unknown Attached to: #4 (Ethernet controller) root@ltc-firep3:~# Thanks, Pavithra == Comment: #6 - PAVITHRA R. PRAKASH - 2017-03-07 23:20:47 == == Comment: #7 - PAVITHRA R. PRAKASH - 2017-03-07 23:21:27 == == Comment: #8 - Urvashi Jawere - 2017-03-08 02:48:15 == I am
[Kernel-packages] [Bug 1676884] Re: kdump-tools uses the wrong crashkernel command line parameter in ppc64le
** Changed in: ubuntu-power-systems Assignee: (unassigned) => Manoj Iyer (manjo) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1676884 Title: kdump-tools uses the wrong crashkernel command line parameter in ppc64le Status in The Ubuntu-power-systems project: New Status in makedumpfile package in Ubuntu: Fix Released Status in makedumpfile package in Debian: Fix Released Bug description: == Comment: #0 - Thiago Jung Bauermann- 2017-03-24 11:44:39 == ---Problem Description--- kdump-tools uses the wrong crashkernel command line parameter in ppc64le: u1704le?? grep crashkernel /boot/grub/grub.cfg linux /boot/vmlinux-4.10.0-13-generic root=UUID=2d6f73c7-b463-4f02-9ec4-8d4afed0635d ro crashkernel=384M-:128M 128M of reserved memory is too small for ppc64le. That happens because /etc/default/grub.d/kdump-tools.cfg links to the wrong file: u1704le?? ls -l /etc/default/grub.d/ total 8.0K lrwxrwxrwx 1 root root 39 Mar 24 13:34 kdump-tools.cfg -> /etc/default/grub.d/kdump-tools.default -rw-r--r-- 1 root root 80 Jan 5 08:07 kdump-tools.default -rw-r--r-- 1 root root 137 Jan 5 08:07 kdump-tools..ppc64el u1704le?? As can be seen, it should be pointing to kdump-tools..ppc64el but isn't. Also, kdump-tools..ppc64el has two dots in it. That doesn't seem right. Possibly just a cosmetic issue, but it would be nice if that was fixed. Contact Information = thiag...@br.ibm.com ---uname output--- Linux u1704le 4.10.0-13-generic #15-Ubuntu SMP Thu Mar 9 20:27:28 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = Any ppc64le machine. In this case, a KVM guest hosted on an 8286-42A. ---Debugger--- A debugger is not configured ---Steps to Reproduce--- sudo apt intall kdump-tools Select 'Yes' when asked whether kdump should be enabled. Userspace tool common name: kdump The userspace tool has the following bit modes: 64 bit Userspace rpm: kdump-tools Userspace tool obtained from project website: na *Additional Instructions for thiag...@br.ibm.com: -Attach ltrace and strace of userspace application. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1676884/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1689365] Re: ibmvscsis: Do not send aborted task response
** Changed in: ubuntu-power-systems Assignee: (unassigned) => Manoj Iyer (manjo) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1689365 Title: ibmvscsis: Do not send aborted task response Status in The Ubuntu-power-systems project: Fix Committed Status in linux package in Ubuntu: Incomplete Status in linux source package in Zesty: Fix Committed Bug description: [Impact] ibmvscsis: Do not send aborted task response The driver is sending a response to the actual scsi op that was aborted by an abort task TM, while LIO is sending a response to the abort task TM. ibmvscsis_tgt does not send the response to the client until release_cmd time. The reason for this was because if we did it at queue_status time, then the client would be free to reuse the tag for that command, but we're still using the tag until the command is released at release_cmd time, so we chose to delay sending the response until then. That then caused this issue, because release_cmd is always called, even if queue_status is not. SCSI spec says that the initiator that sends the abort task TM NEVER gets a response to the aborted op and with the current code it will send a response. Thus this fix will remove that response if the CMD_T_ABORTED && !CMD_T_TAS. Another case with a small timing window is the case where if LIO sends a TMR_DOES_NOT_EXIST, and the release_cmd callback is called for the TMR Abort cmd before the release_cmd for the (attemped) aborted cmd, then we need to ensure that we send the response for the (attempted) abort cmd to the client before we send the response for the TMR Abort cmd. [Test Case] As per comment #11, this requires sending manual abort signals to trigger the bug. [Fix] ibmvscsis: Fix the incorrect req_lim_delta ibmvscsis: Clear left-over abort_cmd pointers ibmvscsis: Do not send aborted task response target: Fix unknown fabric callback queue-full errors [Regression Potential] Patches are confined to ibmvscsi driver and target driver. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1689365/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1661684] Re: ISST-LTE:pVM:roselp4:ubuntu 16.04.2: drop in xmon when running dlpar tests under stress
** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1661684 Title: ISST-LTE:pVM:roselp4:ubuntu 16.04.2: drop in xmon when running dlpar tests under stress Status in The Ubuntu-power-systems project: Opinion Status in linux package in Ubuntu: Incomplete Bug description: == Comment: #0 - Ping Tian Han- 2016-12-26 21:59:52 == ---Problem Description--- When testing DLPAR, include slot/cpu/mem, under stress on roselp4, system dropped into xmon: roselp4 login: [ 95.511790] sysrq: SysRq : Changing Loglevel [ 95.511816] sysrq: Loglevel set to 9 [ 289.363833] mlx4_en 0292:60:00.0: removed PHC [ 293.123896] iommu: Removing device 0292:60:00.0 from group 3 [ 303.173744] pci_bus 0292:60: busn_res: [bus 60-ff] is released [ 303.173865] rpadlpar_io: slot PHB 658 removed [ 335.853779] iommu: Removing device 0021:01:00.0 from group 0 [ 345.893764] pci_bus 0021:01: busn_res: [bus 01-ff] is released [ 345.893869] rpadlpar_io: slot PHB 33 removed [ 382.204003] min_free_kbytes is not updated to 16885 because user defined value 551564 is preferred [ 446.143648] cpu 152 (hwid 152) Ready to die... [ 446.464057] cpu 153 (hwid 153) Ready to die... [ 446.473525] cpu 154 (hwid 154) Ready to die... [ 446.474077] cpu 155 (hwid 155) Ready to die... [ 446.483529] cpu 156 (hwid 156) Ready to die... [ 446.493532] cpu 157 (hwid 157) Ready to die... [ 446.494078] cpu 158 (hwid 158) Ready to die... [ 446.503527] cpu 159 (hwid 159) Ready to die... [ 446.664534] cpu 144 (hwid 144) Ready to die... [ 446.964113] cpu 145 (hwid 145) Ready to die... [ 446.973525] cpu 146 (hwid 146) Ready to die... [ 446.974094] cpu 147 (hwid 147) Ready to die... [ 446.983944] cpu 148 (hwid 148) Ready to die... [ 446.984062] cpu 149 (hwid 149) Ready to die... [ 446.993518] cpu 150 (hwid 150) Ready to die... [ 446.993543] Querying DEAD? cpu 150 (150) shows 2 [ 446.994098] cpu 151 (hwid 151) Ready to die... [ 447.133726] cpu 136 (hwid 136) Ready to die... [ 447.403532] cpu 137 (hwid 137) Ready to die... [ 447.403772] cpu 138 (hwid 138) Ready to die... [ 447.403839] cpu 139 (hwid 139) Ready to die... [ 447.403887] cpu 140 (hwid 140) Ready to die... [ 447.403937] cpu 141 (hwid 141) Ready to die... [ 447.403979] cpu 142 (hwid 142) Ready to die... [ 447.404038] cpu 143 (hwid 143) Ready to die... [ 447.513546] cpu 128 (hwid 128) Ready to die... [ 447.693533] cpu 129 (hwid 129) Ready to die... [ 447.693999] cpu 130 (hwid 130) Ready to die... [ 447.703530] cpu 131 (hwid 131) Ready to die... [ 447.704087] Querying DEAD? cpu 132 (132) shows 2 [ 447.704102] cpu 132 (hwid 132) Ready to die... [ 447.713534] cpu 133 (hwid 133) Ready to die... [ 447.714064] Querying DEAD? cpu 134 (134) shows 2 cpu 0x86: Vector: 300 (Data Access) at [c7b0fd40] pc: 1ec3072c lr: 1ec2fee0 sp: 1faf6bd0 msr: 800102801000 dar: 212d6c1a2a20c dsisr: 4200 current = 0xc00474c6d600 paca= 0xc7b6b600 softe: 0irq_happened: 0x01 pid = 0, comm = swapper/134 Linux version 4.8.0-34-generic (buildd@bos01-ppc64el-026) (gcc version 5.4.0 20160609 (Ubuntu/IBM 5.4.0-6ubuntu1~16.04.4) ) #36~16.04.1-Ubuntu SMP Wed Dec 21 18:53:20 UTC 2016 (Ubuntu 4.8.0-34.36~16.04.1-generic 4.8.11) WARNING: exception is not recoverable, can't continue enter ? for help SP (1faf6bd0) is in userspace 86:mon> 86:mon> t SP (1faf6bd0) is in userspace 86:mon> r R00 = 000212d6c1a2a20f R16 = c0ff1c38 R01 = 1faf6bd0 R17 = c00474c9c080 R02 = 1ed1be80 R18 = c00474c9c000 R03 = 1faf6c80 R19 = c13fdf08 R04 = 0018 R20 = c00474c9c080 R05 = 00e0 R21 = c13e8ad0 R06 = 9e04 R22 = c00474c9c000 R07 = 1faf6d30 R23 = c0047a9a1c40 R08 = 1faf6d28 R24 = 0002 R09 = 000212d6c1a2a20c R25 = c0fd4e6c R10 = 1ec1b118 R26 = c0fd4e6c R11 = 1ee7e040 R27 = c14daae0 R12 = 0163c1d8 R28 = R13 = c7b6b600 R29 = 0086 R14 = c14defb0 R30 = c0fd4e68 R15 = 0001 R31 = 1faf6bd0 pc = 1ec3072c cfar= 1ec2fedc lr = 1ec2fee0 msr = 800102801000 cr = 4200 ctr = 1ec48788 xer = 0020 trap = 300 dar = 000212d6c1a2a20c dsisr = 4200 86:mon> Contact Information = Ping Tian Han/pt...@cn.ibm.com ---uname output--- Linux roselp4 4.8.0-34-generic #36~16.04.1-Ubuntu SMP Wed Dec 21 18:53:20
[Kernel-packages] [Bug 1713576] Re: More migrations with constant load
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1713576 Title: More migrations with constant load Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - PUVICHAKRAVARTHY RAMACHANDRAN - 2017-08-06 13:44:45 == ---Problem Description--- Significantly higher number of task migrations when the load is fixed but not balanced across cores. ---uname output--- Linux isvbos3 4.10.0-29-generic #33~16.04.1-Ubuntu SMP Tue Jul 25 18:17:06 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux ---Additional Hardware Info--- Power9 dd2.0 Machine Type = Power9 ---Steps to Reproduce--- Benchmark : Multithreaded - cpu intensive. The system had 2 socket/ 32 cores/ SMT4 mode. When 64 threads was run - the migrations were less over 10s interval. when 80 threads were run - the migrations were very high. Ideally, it should have been very minimal, as the over all load was constant == Comment: #3 - SRIKAR DRONAMRAJU - 2017-08-11 06:56:47 == As suspected (commit : 05b40e0577 : "sched/fair: Prefer sibiling only if local group is under-utilized") https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=05b40e0577 should fix the problem Ran ' perf stat -a -r 5 -e sched:sched_migrate_task /home/srikar/work/ebizzy-0.3/ebizzy -t 35 -S 100' to detect the problem and verify the fix Here is perf stat without fix. Performance counter stats for 'system wide' (5 runs): 7,758 sched:sched_migrate_task ( +- 1.28% ) 100.015658079 seconds time elapsed ( +- 0.00% ) perf stat with fix. Performance counter stats for 'system wide' (5 runs): 415 sched:sched_migrate_task ( +- 11.74% ) 100.016021787 seconds time elapsed ( +- 0.00% ) git describe on upstream kernel says v4.11-rc2 # git describe 05b40e0577 v4.11-rc2-227-g05b40e0 == Comment: #4 - SRIKAR DRONAMRAJU - 2017-08-11 07:05:37 == Attaching the patch that needs to be applied to fix this bug. Verified that patch fixes the problem. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1713576/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1713575] Re: Vlun resize request could fail with cxlflash driver
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1713575 Title: Vlun resize request could fail with cxlflash driver Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - UMA KRISHNAN - 2017-08-28 12:14:12 == ---Problem Description--- Recently a regression in cxlflash driver was identified and a fix has been upstreamed to kernel.org. We would like to get that included in Xenial 16.04.3 SRU (HWE v4.10 kernel). Even though regression patch is the very last one, I have also listed the missing patches in-between, that are minor and will be easier for further cxlflash updates to Xenial. eeac8cda2c957e156093933b860eec09e488fe15 scsi: cxlflash: return -EFAULT if copy_from_user() fails 9ff870417e56b1fb7b15b2cda74de639d3cd8559 scsi: cxlflash: Fix an error handling path in 'cxlflash_disk_attach()' 48a17ad5931c3832eec68411620bc3527021c193 scsi: cxlflash: Remove unnecessary existence check 1a9e394154e34728f58c1f697b993aaaf89a4db2 scsi: cxlflash: Avoid double mutex unlock 07a191f762a7b8d0db13c38036380927116e29bb scsi: cxlflash: Fix vlun resize failure in the shrink path ---Steps to Reproduce--- Vlun resize request could fail after updating to a kernel that includes Commit 565180723294 ("scsi: cxlflash: SISlite updates to support 4 ports") To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1713575/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1714485] Re: Ubuntu 16.04.03: kdump fails with error "kdump-tools[1532]: /etc/init.d/kdump-tools: 26: [: -ne: unexpected operator" when / file system is xfs.
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1714485 Title: Ubuntu 16.04.03: kdump fails with error "kdump-tools[1532]: /etc/init.d/kdump-tools: 26: [: -ne: unexpected operator" when / file system is xfs. Status in The Ubuntu-power-systems project: New Status in makedumpfile package in Ubuntu: New Bug description: == Comment: #0 - PAVITHRA R. PRAKASH <> - 2017-08-31 00:33:37 == ---Problem Description--- Ubuntu 16.04.03: kdump fails with error "kdump-tools[1532]: /etc/init.d/kdump-tools: 26: [: -ne: unexpected operator" when / file system is xfs. ---Steps to Reproduce--- 1. Install Ubuntu 16.04.03 with / as xfs. 2. Configure kdump. 3. trigger crash. Machine hangs after below log. Attaching console log. [ OK ] Reached target Network is Online. Starting Kernel crash dump capture service... Starting iSCSI initiator daemon (iscsid)... [ 12.263089] kdump-tools[1205]: /etc/init.d/kdump-tools: 26: [: -ne: unexpected operator [ OK ] Started Kernel crash dump capture service. [ OK ] Started iSCSI initiator daemon (iscsid). Starting Login to default iSCSI targets... [ OK ] Started Login to default iSCSI targets. [ OK ] Reached target Remote File Systems (Pre). 4. After manual reboot /etc/default/kdump-tools is empty. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1714485/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1696375] Re: cpupower suggests installation of non-existent linux-cloud-tools-generic package
** Changed in: ubuntu-power-systems Status: Incomplete => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1696375 Title: cpupower suggests installation of non-existent linux-cloud-tools- generic package Status in The Ubuntu-power-systems project: Confirmed Status in linux package in Ubuntu: Incomplete Bug description: Problem Description === cpupower suggests installation of non-existent linux-cloud-tools-generic package Problem occurs on an IBM POWER 16.04 LTS system. Unknown whether it occurs on other architectures or later releases. When run _without package "linux-tools-$(uname -r)" installed_, cpupower will issue a complaint, suggesting to install (among others), some linux-cloud-tools-*-generic packages: $ cpupower WARNING: cpupower not found for kernel 4.4.0-75 You may need to install the following packages for this specific kernel: linux-tools-4.4.0-75-generic linux-cloud-tools-4.4.0-75-generic You may also want to install one of the following packages to keep up to date: linux-tools-generic linux-cloud-tools-generic However, those linux-cloud-tools-*-generic package appear not to exist --at least not in the default repos. Package linux-cloud-tools-common exists, but none of the *-generic packages are available: $ apt-cache policy "linux-cloud-*" $ cat xxx linux-cloud-tools-common: Installed: (none) Candidate: 4.4.0-78.99 Version table: 4.4.0-78.99 500 500 http://ports.ubuntu.com/ubuntu-ports xenial-proposed/main ppc64el Packages 4.4.0-77.98 500 500 http://us.ports.ubuntu.com/ubuntu-ports xenial-updates/main ppc64el Packages 4.4.0-75.96 500 500 http://us.ports.ubuntu.com/ubuntu-ports xenial-updates/main ppc64el Packages 500 http://ports.ubuntu.com/ubuntu-ports xenial-security/main ppc64el Packages [...] 4.4.0-21.37 500 500 http://us.ports.ubuntu.com/ubuntu-ports xenial/main ppc64el Packages The cpupower command works (at least doesn't give this complaint) if the appropriate linux-tools-$(uname -r) (perhaps via linux-tools-generic) is installed, so the cloud package is maybe not needed (as least for some functionality). I think the fix is to either: a) supply the cloud packages for POWER, or b) remove the non-existent packages from cpupower's complaint message. ---uname output--- Linux fs3 4.4.0-75-generic #96-Ubuntu SMP Thu Apr 20 09:55:30 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = Firestone ---Steps to Reproduce--- Install linux-tools-common but NOT linux-tools-*-generic on a POWER 16.04 LTS system, then run cpupower. Userspace rpm: linux-tools-common To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1696375/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1714571] Re: [P9, POwer NV] Perf PMU event : pm_br_2path and pm_ld_miss_l1 is counted twice when perf stat is done (perf:)
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => Medium ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1714571 Title: [P9,POwer NV] Perf PMU event : pm_br_2path and pm_ld_miss_l1 is counted twice when perf stat is done (perf:) Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - Sukadev Bhattiprolu - 2017-09-01 12:56:47 == ---Problem Description--- Problem description : = Perf PMU event : pm_br_2path and pm_ld_miss_l1 is counted twice , it is because two raw point to the same name. [ {, "EventCode": "0x20036", "EventName": "PM_BR_2PATH", "BriefDescription": "Branches that are not strongly biased", "PublicDescription": "" }, {, "EventCode": "0x40036", "EventName": "PM_BR_2PATH", "BriefDescription": "Branches that are not strongly biased", "PublicDescription": "" }, Ex : === [root@zzfp189p1 ~]# perf stat -e pm_br_2path sleep 1 Performance counter stats for 'sleep 1': 30,163 pm_br_2path pm_br_2path (0.00%) 1.002046416 seconds time elapsed [root@zzfp189p1 ~]# perf stat -e pm_ld_miss_l1 sleep 1 Performance counter stats for 'sleep 1': 9,217 pm_ld_miss_l1 pm_ld_miss_l1 (0.00%) 1.001956110 seconds time elapsed ---Steps to Reproduce--- Steps to reproduce : 1. Install perf (linux-tools) 2. Run perf stat -e pm_ld_miss_l1 sleep 1 perf stat -e pm_br_2path sleep 1 == Comment: #1 - Sukadev Bhattiprolu - 2017-09-01 12:58:16 == The fix for this bug was pulled into Arnaldo's tree: https://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git/commit/?h=perf/core=828e291e2982a7c10a62aa32195d0cb501cedaae To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1714571/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1667245] Re: ISST-LTE:pVM:roselp4:ubuntu 17.04: kdump failed after memory DLPAR
** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1667245 Title: ISST-LTE:pVM:roselp4:ubuntu 17.04: kdump failed after memory DLPAR Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: ---Problem Description--- After a memory DLPAR removal, kdump doesn't work: Starting Kernel crash dump capture service... [ 67.714593] kdump-tools[3850]: Starting kdump-tools: * running makedumpfile -c -d 31 /proc/vmcore /var/crash/201702230005/dump-incomplete Copying data : [ 2.1 %] -/usr/sbin/kdump-config: line 639: 3897 Bus error makedumpfile $MAKEDUMP_ARGS $vmcore_file $KDUMP_CORETEMP [ 72.314140] kdump-tools[3850]: * kdump-tools: makedumpfile failed, falling back to 'cp' [ 73.693881] kdump-tools[3850]: cp: error reading '/proc/vmcore': Bad address [ 73.704152] kdump-tools[3850]: * kdump-tools: failed to save vmcore in /var/crash/201702230005 [ 73.823643] kdump-tools[3850]: * running makedumpfile --dump-dmesg /proc/vmcore /var/crash/201702230005/dmesg.201702230005 [ 73.973813] kdump-tools[3850]: The kernel version is not supported. [ 73.974078] kdump-tools[3850]: The makedumpfile operation may be incomplete. [ 73.983506] kdump-tools[3850]: The dmesg log is saved to /var/crash/201702230005/dmesg.201702230005. [ 73.983752] kdump-tools[3850]: makedumpfile Completed. [ 73.983998] kdump-tools[3850]: * kdump-tools: saved dmesg content in /var/crash/201702230005 [ 74.104555] kdump-tools[3850]: Thu, 23 Feb 2017 00:05:15 -0600 [ 74.233502] kdump-tools[3850]: Failed to read reboot parameter file: No such file or directory [ 74.233782] kdump-tools[3850]: Rebooting. [ 86.629777] reboot: Restarting system The kdump service should be restarted after the memory DLPAR operation. C ---uname output--- Linux roselp4 4.10.0-8-generic #10-Ubuntu SMP Mon Feb 13 14:00:06 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = lpar ---Debugger--- A debugger is not configured ---Steps to Reproduce--- 1. config kdump on roselp4 2. do a memory DLPAR removal operation 3. trigger kdump To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1667245/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1678745] Re: Ubuntu17.04 KVM: Guest crashed @ xfs_perag_get_tag+0x6c
** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) ** Changed in: ubuntu-power-systems Status: New => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1678745 Title: Ubuntu17.04 KVM: Guest crashed @ xfs_perag_get_tag+0x6c Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: Incomplete Bug description: == Comment: #0 - Lata Kuntal- 2017-03-30 09:44:23 == Ubuntu 17.04 KVM guest gusg8 was having ubuntu 16.04.2 and was running stress test IO, Base,TCP and NFS.The guest is having XFS as rootFS and after running few hours of regression test it dropped at xmon. Console logs : root@guskvm:~# virsh console gusg8 --force Connected to domain gusg8 Escape character is ^] 1:mon> r R00 = d288edf4 R16 = 024200ca R01 = c000378cb1f0 R17 = R02 = d2936080 R18 = 0020 R03 = 0001 R19 = c002734d1800 R04 = c000378cb190 R20 = R05 = R21 = R06 = 3c00d03fe056 R22 = c0027e26ccf0 R07 = R23 = R08 = c48492d0 R24 = R09 = 3c00d03fe056 R25 = R10 = 3c00d03fe062 R26 = 00024df4cd49 R11 = d28fa360 R27 = R12 = R28 = d28ac7b0 R13 = cfb80900 R29 = c4849000 R14 = R30 = R15 = c137ad08 R31 = pc = d288ee0c xfs_perag_get_tag+0x6c/0x170 [xfs] cfar= c096a494 perf_trace_mmc_request_start+0x104/0x440 lr = d288edf4 xfs_perag_get_tag+0x54/0x170 [xfs] msr = 80010280b033 cr = 82428424 ctr = c05e4950 xer = 2000 trap = 300 dar = 3c00d03fe062 dsisr = 4000 1:mon> t [c000378cb250] d28ac7b0 xfs_reclaim_inodes_count+0x70/0xa0 [xfs] [c000378cb290] d28c0ea8 xfs_fs_nr_cached_objects+0x28/0x40 [xfs] [c000378cb2b0] c03292d8 super_cache_count+0x68/0x120 [c000378cb2f0] c0271530 shrink_slab.part.14+0x150/0x4f0 [c000378cb430] c0276db8 shrink_node+0x158/0x3f0 [c000378cb4f0] c0277178 do_try_to_free_pages+0x128/0x460 [c000378cb590] c02775ac try_to_free_pages+0xfc/0x280 [c000378cb620] c0260158 __alloc_pages_nodemask+0x758/0xe30 [c000378cb7e0] c02dbb98 alloc_pages_vma+0x108/0x360 [c000378cb880] c029d080 wp_page_copy+0xf0/0x9d0 [c000378cb920] c02a0770 do_wp_page+0x210/0xb20 [c000378cb9b0] c02a656c handle_mm_fault+0x9cc/0x14c0 [c000378cba60] c0b511a0 do_page_fault+0x260/0x7d0 [c000378cbb10] c0008948 handle_page_fault+0x10/0x30 --- Exception: 301 (Data Access) at c010aec4 schedule_tail+0x84/0xb0 [c000378cbe30] c0009844 ret_from_fork+0x4/0x54 --- Exception: c00 (System Call) at 3fffa2b5bf44 1:mon> d || 1:mon> c cpus stopped: 0x0-0x3 1:mon> Kernel host build = root@guskvm:~# uname -r 4.10.0-13-generic root@guskvm:~# == Comment: #1 - Luciano Chavez - 2017-03-30 10:42:15 == At first glance, based on the following assembly from around the failure point: d288edd4 38c1 li r6,1 d288edd8 7f8802a6 mflrr28 d288eddc 78a70020 clrldi r7,r5,32 d288ede0 7c7d1b78 mr r29,r3 d288ede4 7c852378 mr r5,r4 d288ede8 386302c8 addir3,r3,712 d288edec 38810020 addir4,r1,32 d288edf0 4806b571 bl d28fa360# exit_xfs_fs+0x180c/0xfd44 [xfs] d288edf4 e8410018 ld r2,24(r1) d288edf8 2f83 cmpwi cr7,r3,0 d288edfc 409d0104 ble cr7,d288ef00# xfs_perag_get_tag+0x160/0x170 [xfs] d288ee00 7c0004ac sync d288ee04 e9210020 ld r9,32(r1) d288ee08 3949000c addir10,r9,12 d288ee0c 7fc05028 lwarx r30,0,r10 d288ee10 33de0001 addic r30,r30,1 d288ee14 7fc0512d stwcx. r30,0,r10 I believe the crash in fs_perag_get_tag() is after we come back from the radix_tree_gang_lookup_tag() call and are attempting the atomic_inc_return() and struct xfs_perag*pag is R09 = 3c00d03fe056 which is invalid. 85 rcu_read_lock(); 86 found =
[Kernel-packages] [Bug 1706033] Re: [P9, Power NV][ WSP][Ubuntu 16.04.03] : perf hw breakpoint command results in call traces and system goes for reboot.
** Changed in: ubuntu-power-systems Status: In Progress => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1706033 Title: [P9, Power NV][ WSP][Ubuntu 16.04.03] : perf hw breakpoint command results in call traces and system goes for reboot. Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: In Progress Bug description: == Comment: #0 - Shriya R. Kulkarni <> - 2017-06-14 04:38:16 == Problem Description : = While running perftool - testsuite, the perf hw breakpoint fails and it result in call traces , hence system goes for reboot . Machine details : == System : P9 , WSP , Bare metal. OS : Ubuntu 16.04.03 uname -a : Linux ltc-wspoon3 4.10.0-23-generic #25~16.04.1-Ubuntu SMP Fri Jun 9 10:43:34 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Steps to reproduce: 1. Install perf. 2. git clone perftool-testsuite. https://github.com/rfmvh/perftool-testsuite 3. Do make. 4. Test fails at step : -- [ FAIL ] -- perf_stat :: test_hw_breakpoints :: kspace address execution mem:0xc035c020:x (command exitcode + output regexp parsing and call trace is seen as system goes for reboot. Call traces : === ubuntu@ltc-wspoon3:~$ [1602513.518414] Unable to handle kernel paging request for data at address 0xc135d3b8 [1602513.518553] Faulting instruction address: 0xc02869bc [1602513.518694] Oops: Kernel access of bad area, sig: 11 [#1] [1602513.518782] SMP NR_CPUS=2048 [1602513.518784] NUMA [1602513.518842] PowerNV [1602513.518922] Modules linked in: vmx_crypto ofpart ipmi_powernv cmdlinepart ipmi_devintf powernv_flash ipmi_msghandler ibmpowernv opal_prd mtd at24 nvmem_core uio_pdrv_genirq uio autofs4 ast i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops crc32c_vpmsum ttm drm tg3 ahci libahci [1602513.519399] CPU: 27 PID: 4069 Comm: sysctl Not tainted 4.10.0-22-generic #24 [1602513.519524] task: c000203968c42c00 task.stack: c00020396571 [1602513.519624] NIP: c02869bc LR: c03f7348 CTR: c0286990 [1602513.519747] REGS: c000203965713a40 TRAP: 0300 Not tainted (4.10.0-22-generic) [1602513.519876] MSR: 90009033[1602513.519889] CR: 22002448 XER: [1602513.520058] CFAR: c03f7344 DAR: c135d3b8 DSISR: 0040 SOFTE: 1 [1602513.520058] GPR00: c03f7348 c000203965713cc0 c145d100 c134af00 [1602513.520058] GPR04: 4ee50300 c000203965713d20 c000203965713e00 [1602513.520058] GPR08: c135d100 c0b71020 [1602513.520058] GPR12: c0286990 c7b4f300 [1602513.520058] GPR16: [1602513.520058] GPR20: 0001 [1602513.520058] GPR24: 3fffc542f5a0 0400 c000203965713e00 4ee50300 [1602513.520058] GPR28: c134af00 c03fee038800 [1602513.521280] NIP [c02869bc] dirty_ratio_handler+0x2c/0x90 [1602513.521374] LR [c03f7348] proc_sys_call_handler+0x138/0x1c0 [1602513.521481] Call Trace: [1602513.521526] [c000203965713cc0] [c000203965713d00] 0xc000203965713d00 (unreliable) [1602513.521655] [c000203965713d00] [c03f7348] proc_sys_call_handler+0x138/0x1c0 [1602513.521797] [c000203965713d70] [c03436ec] __vfs_read+0x3c/0x70 [1602513.521907] [c000203965713d90] [c034516c] vfs_read+0xbc/0x1b0 [1602513.522016] [c000203965713de0] [c0346dd8] SyS_read+0x68/0x110 [1602513.522112] [c000203965713e30] [c000b184] system_call+0x38/0xe0 [1602513.522243] Instruction dump: [1602513.522303] 6042 3c4c011d 38426770 7c0802a6 6000 7c0802a6 fbc1fff0 fbe1fff8 [1602513.522445] f8010010 f821ffc1 3d22fff0 7c9f2378 4be66da9 6000 3d22fff0 [1602513.522564] ---[ end trace 17c76e13e641d3c6 ]--- [1602513.522657] It goes for reboot : After booting to Ubuntu , I see series of call traces. Ubuntu 16.04.2 LTS ltc-wspoon3 hvc0 ltc-wspoon3 login: [ 3476.626263] Unable to handle kernel paging request for data at address 0xc13ad438 [ 3476.626422] Faulting instruction address: 0xc029a140 [ 3476.626537] Oops: Kernel access of bad area, sig: 11 [#1] [ 3476.626615] SMP NR_CPUS=2048 [ 3476.626616] NUMA [ 3476.626673] PowerNV [ 3476.626746] Modules linked in: ipmi_powernv at24 ipmi_devintf nvmem_core ipmi_msghandler ofpart cmdlinepart powernv_flash mtd opal_prd vmx_crypto ibmpowernv uio_pdrv_genirq uio autofs4 ast i2c_algo_bit ttm crc32c_vpmsum drm_kms_helper syscopyarea
[Kernel-packages] [Bug 1689946] Re: Ubuntu16.04: NVMe 4K+T10 DIF/DIX format returns I/O error on dd with split op
** Changed in: ubuntu-power-systems Status: In Progress => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1689946 Title: Ubuntu16.04: NVMe 4K+T10 DIF/DIX format returns I/O error on dd with split op Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Xenial: Fix Released Status in linux source package in Yakkety: Won't Fix Status in linux source package in Zesty: Fix Released Status in linux source package in Artful: Fix Released Bug description: State: Open by: mdate on 19 March 2017 12:33:34 On a Bolt adapter in a system with Ubuntu 16.04, I've formatted the Bolt for T10 and am using it to do a dd with a 2M block size. Here are the commands: nvme format /dev/nvme0n1 --lbaf=1 --pil=0 --ms=0 --pi=2 dd if=/dev/urandom of=/dev/nvme0n1 bs=2M oflag=direct count=1 I get an error on the dd. root@x1623bp1:~# dd if=/dev/urandom of=/dev/nvme0n1 bs=2M oflag=direct count=1 dd: error writing '/dev/nvme0n1': Input/output error 1+0 records in 0+0 records out 0 bytes copied, 0.0525061 s, 0.0 kB/s dmesg shows: [589997.985151] blk_update_request: I/O error, dev nvme0n1, sector 0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1689946/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1685899] Re: [Ubuntu 17.04] - JFS related call traces and system enters xmon when rebooted after installation
** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1685899 Title: [Ubuntu 17.04] - JFS related call traces and system enters xmon when rebooted after installation Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: Incomplete Bug description: Issue: JFS related call traces and system enters xmon when rebooted after installation Steps to reproduce: - 1 - Install Ubuntu 17.04 the system with - prepboot - /root [JFS filesystem] - swap space 2 -After installation when rebooted it gives out call traces like as below: [3.895246] Unable to handle kernel paging request for data at address 0x [3.895278] Faulting instruction address: 0xd4c5df1c [3.895284] Oops: Kernel access of bad area, sig: 11 [#1] [3.895287] SMP NR_CPUS=2048 [3.895288] NUMA [3.895290] pSeries [3.895292] Modules linked in: ip_tables x_tables autofs4 jfs ibmvscsi crc32c_vpmsum [3.895301] CPU: 30 PID: 923 Comm: ureadahead Not tainted 4.9.0-15-generic #16-Ubuntu [3.895304] task: c00381d3c800 task.stack: c00381fd [3.895307] NIP: d4c5df1c LR: d4c5deb0 CTR: c01279d0 [3.895310] REGS: c00381fd3500 TRAP: 0300 Not tainted (4.9.0-15-generic) [3.895313] MSR: 8280b033[ 3.895322] CR: 48008804 XER: 0001 [3.895324] CFAR: c0008a60 DAR: DSISR: 4000 SOFTE: 1 GPR00: d4c5deb0 c00381fd3780 d4c78c28 c003802f40f0 GPR04: d4c6f6f0 d4c72b58 0563 d4c78c28 GPR08: 00180e97 d4c6a608 GPR12: c01279d0 cfb90e00 GPR16: GPR20: GPR24: 1000 d4c72b38 GPR28: 00180e97 f0e1d5c0 c003812af240 c003802f40b0 NIP [d4c5df1c] __get_metapage+0x204/0x6f0 [jfs] [3.895372] LR [d4c5deb0] __get_metapage+0x198/0x6f0 [jfs] [3.895374] Call Trace: [3.895378] [c00381fd3780] [d4c5de6c] __get_metapage+0x154/0x6f0 [jfs] (unreliable) [3.895384] [c00381fd3870] [d4c4c368] diRead+0x130/0x260 [jfs] [3.895388] [c00381fd3920] [d4c424f4] jfs_iget+0x6c/0x1e0 [jfs] [3.895393] [c00381fd3950] [d4c43adc] jfs_lookup+0xe4/0x100 [jfs] [3.895398] [c00381fd3a80] [c032a120] lookup_slow+0xe0/0x240 [3.895402] [c00381fd3b00] [c032e8a8] walk_component+0x2d8/0x3f0 [3.895406] [c00381fd3b70] [c032eb94] link_path_walk+0x1d4/0x600 [3.895409] [c00381fd3c00] [c0330c1c] path_openat+0xbc/0x480 [3.895413] [c00381fd3c80] [c03328ac] do_filp_open+0xec/0x160 [3.895417] [c00381fd3db0] [c031863c] do_sys_open+0x1cc/0x380 [3.895421] [c00381fd3e30] [c000bd84] system_call+0x38/0xe0 [3.895424] Instruction dump: [3.895426] 7909f00e 7fc9f214 3921 f93f0028 fbdf0030 e93d 71280800 41820460 [3.895433] ebdd0030 41920034 e91d0008 e93f0038 811e 80e70090 39080001 [3.895441] ---[ end trace c2aa9ba09ea05eac ]--- [3.895443] [4.088560] systemd-journald[925]: Received request to flush runtime journal from PID 1 [4.362062] crypto_register_alg 'aes' = 0 [4.362112] crypto_register_alg 'cbc(aes)' = 0 [4.362150] crypto_register_alg 'ctr(aes)' = 0 [4.362191] crypto_register_alg 'xts(aes)' = 0 [4.366949] pseries_rng: Registering IBM pSeries RNG driver When I first connected to the LPAR, it was unresponsive so I restarted it from the HMC and surprisingly it came up to the login prompt and I logged into the shell. I proceeded to install the matching linux- image-4.10.0-15-generic-dbgsym_4.10.0-15.17_ppc64el.ddeb. However, the installation of the matching dbgsym wasn't as helpful as I wanted it to be. objdump, crash tool, or addr2line wouldn't give me the source line correspond to the NIP address. I then restarted the LPAR with xmon enabled and it would drop to xmon immediately after attempting to remount / and at the same location as before at __get_metapage+0x204/0x6f0 [jfs] and again with a dereference of 0x0 as the cause of the data exception 0xd649df54 <__get_metapage+508>:ld r8,8(r29) 0xd649df58 <__get_metapage+512>:ld r9,56(r31)
[Kernel-packages] [Bug 1689365] Re: ibmvscsis: Do not send aborted task response
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1689365 Title: ibmvscsis: Do not send aborted task response Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Incomplete Status in linux source package in Zesty: Fix Released Bug description: [Impact] ibmvscsis: Do not send aborted task response The driver is sending a response to the actual scsi op that was aborted by an abort task TM, while LIO is sending a response to the abort task TM. ibmvscsis_tgt does not send the response to the client until release_cmd time. The reason for this was because if we did it at queue_status time, then the client would be free to reuse the tag for that command, but we're still using the tag until the command is released at release_cmd time, so we chose to delay sending the response until then. That then caused this issue, because release_cmd is always called, even if queue_status is not. SCSI spec says that the initiator that sends the abort task TM NEVER gets a response to the aborted op and with the current code it will send a response. Thus this fix will remove that response if the CMD_T_ABORTED && !CMD_T_TAS. Another case with a small timing window is the case where if LIO sends a TMR_DOES_NOT_EXIST, and the release_cmd callback is called for the TMR Abort cmd before the release_cmd for the (attemped) aborted cmd, then we need to ensure that we send the response for the (attempted) abort cmd to the client before we send the response for the TMR Abort cmd. [Test Case] As per comment #11, this requires sending manual abort signals to trigger the bug. [Fix] ibmvscsis: Fix the incorrect req_lim_delta ibmvscsis: Clear left-over abort_cmd pointers ibmvscsis: Do not send aborted task response target: Fix unknown fabric callback queue-full errors [Regression Potential] Patches are confined to ibmvscsi driver and target driver. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1689365/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1684054] Re: [LTCTest][Opal][FW860.20] HMI recoverable errors failed to recover and system goes to dump state.
** Changed in: ubuntu-power-systems Status: In Progress => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1684054 Title: [LTCTest][Opal][FW860.20] HMI recoverable errors failed to recover and system goes to dump state. Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Zesty: Fix Released Bug description: == Comment: #0 - Pridhiviraj Paidipeddi- 2017-04-17 06:08:41 == ---Problem Description--- HMI Recoverable error injection tests leads to system checkstop followed by system dump with ubuntu 17.04 os and kernel 4.10.0-19-generic ppc64le Contact Information = ppaid...@in.ibm.com ---uname output--- #21-Ubuntu SMP Thu Apr 6 17:03:05 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type = PowerNV 8284-22A ---System Hang--- System is in dumping state. after dump finishes system will IPL to OS again. ---Debugger--- A debugger is not configured == Comment: #3 - Pridhiviraj Paidipeddi - 2017-04-17 06:12:51 == # uname -a #21-Ubuntu SMP Thu Apr 6 17:03:05 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux # cat /etc/os-release NAME="Ubuntu" VERSION="17.04 (Zesty Zapus)" ID=ubuntu ID_LIKE=debian PRETTY_NAME="Ubuntu 17.04" VERSION_ID="17.04" HOME_URL="https://www.ubuntu.com/; SUPPORT_URL="https://help.ubuntu.com/; BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/; PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy; VERSION_CODENAME=zesty UBUNTU_CODENAME=zesty root@p8wookie:~# == Comment: #4 - Kevin W. Rudd - 2017-04-17 11:10:22 == == Comment: #5 - MAHESH J. SALGAONKAR - 2017-04-17 13:34:03 == it looks like below commit is a culprit: === commit 2337d207288f163e10bd8d4d7eeb0c1c75046a0c Author: Nicholas Piggin Date: Fri Jan 27 14:24:33 2017 +1000 powerpc/64: CONFIG_RELOCATABLE support for hmi interrupts The branch from hmi_exception_early to hmi_exception_realmode must use a "relocatable-style" branch, because it is branching from unrelocated exception code to beyond __end_interrupts. Signed-off-by: Nicholas Piggin Signed-off-by: Michael Ellerman === With the above commit changes now hmi_exception_realmode() is called using bctrl which ends up messing up TOC (r2) value and further access using new r2 results into unpredictable behaviour. c0025f50 : c0025f50: 3a 01 4c 3c addis r2,r12,314 c0025f54: b0 01 42 38 addir2,r2,432 c0025f58: a6 02 08 7c mflrr0 - With above commit the hmi_exception_early() code jumps to c0025f50 (hmi_exception_realmode+0x0) which then sets up new value for r2. If we revert above commit the code jumps to c0025f58 (hmi_exception_realmode+0x8) and hmi handler works fine. After reverting above patch I don't see this issue anymore. I have rebuilt the ubuntu kernel after reverting above patch and you can find the kernel rpm at: Can you please retry your tests with above kernel and see if issue still persists. == Comment: #6 - MAHESH J. SALGAONKAR - 2017-04-17 23:02:31 == Spoke to Michael Ellerman this morning. He helped me to identify the root cause and a fix patch beow: diff --git a/arch/powerpc/kernel/exceptions-64s.S b/arch/powerpc/kernel/exceptions-64s.S index 857bf7c5b946..7cfeb8768587 100644 --- a/arch/powerpc/kernel/exceptions-64s.S +++ b/arch/powerpc/kernel/exceptions-64s.S @@ -982,7 +982,7 @@ TRAMP_REAL_BEGIN(hmi_exception_early) EXCEPTION_PROLOG_COMMON_2(PACA_EXGEN) EXCEPTION_PROLOG_COMMON_3(0xe60) addir3,r1,STACK_FRAME_OVERHEAD - BRANCH_LINK_TO_FAR(r4, hmi_exception_realmode) + BRANCH_LINK_TO_FAR(r12, hmi_exception_realmode) /* Windup the stack. */ /* Move original HSRR0 and HSRR1 into the respective regs */ ld r9,_MSR(r1) == Comment: #7 - Pridhiviraj Paidipeddi - 2017-04-18 01:52:03 == == Comment: #8 - Pridhiviraj Paidipeddi - 2017-04-18 01:53:57 == Hi Mahesh Tested all the HMI Recoverable errors on the below patched kernel, attached the corresponding executing logs. All tests are working fine. #21 SMP Mon Apr 17 12:58:30 EDT 2017 ppc64le ppc64le ppc64le GNU/Linux Thanks == Comment: #9 - MAHESH J. SALGAONKAR - 2017-04-18
[Kernel-packages] [Bug 1686019] Re: Ubuntu 16.04.3: Qemu fails on P9
** Changed in: ubuntu-power-systems Importance: Undecided => Low -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1686019 Title: Ubuntu 16.04.3: Qemu fails on P9 Status in The Ubuntu-power-systems project: Fix Committed Status in kernel-package package in Ubuntu: Invalid Status in linux package in Ubuntu: Fix Released Status in kernel-package source package in Zesty: New Status in linux source package in Zesty: Fix Committed Bug description: Trying to start KVM on Ubuntu 16.04.3 with QEMU from dgibson 2.10 tree[1], I see the following error when trying to boot a 17.04 image. 8000 DISK : "QEMU QEMU HARDDISK2.5+" Populating /pci@8002000 No NVRAM common partition, re-initializing... Scanning USB Using default console: /vdevice/vty@7100 ted RAM kernel at 40 (17995b0 bytes) C08FF Welcome to Open Firmware Copyright (c) 2004, 2011 IBM Corporation All rights reserved. This program and the accompanying materials are made available under the terms of the BSD License available at http://www.opensource.org/licenses/bsd-license.php Booting from memory... OF stdout device is: /vdevice/vty@7100 Preparing to boot Linux version 4.10.0-19-generic (buildd@bos01-ppc64el-009) (gcc version 6.3.0 20170321 (Ubuntu 6.3.0-10ubuntu1) ) #21-Ubuntu SMP Thu Apr 6 17:03:05 UTC 2017 (Ubuntu 4.10.0-19.21-generic 4.10.8) Detected machine type: 0101 command line: debug initcall_debug udbg-immortal console=/dev/hvc0 Max number of cores passed to firmware: 2048 (NR_CPUS = 2048) Calling ibm,client-architecture-support... done memory layout at init: memory_limit : (16 MB aligned) alloc_bottom : 01bb alloc_top: 1000 alloc_top_hi : 8000 rmo_top : 1000 ram_top : 8000 instantiating rtas at 0x0daf... done prom_hold_cpus: skipped copying OF device tree... Building dt strings... Building dt structure... Device tree strings 0x041c -> 0x041c09fd Device tree struct 0x041d -> 0x041e Quiescing Open Firmware ... Booting Linux via __start() @ 0x0040 ... I tried to add some debug options as "debug initcall_debug udbg-immortal console=/dev/hvc0" but no luck. [1] https://github.com/dgibson/qemu.git branch ppc-for-2.10 ---uname output--- 4.10.0-19 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1686019/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1664545] Re: In Ubuntu17.04 as Kvm guest : While trigger kdump console hung having call traces
Following comment #8, closing bug. ** Changed in: ubuntu-power-systems Status: New => Fix Released ** Changed in: linux (Ubuntu) Status: New => Fix Released ** Changed in: makedumpfile (Ubuntu) Status: New => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1664545 Title: In Ubuntu17.04 as Kvm guest : While trigger kdump console hung having call traces Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in makedumpfile package in Ubuntu: Fix Released Bug description: In ubuntu17.04 as KVM guest on ubuntu KVM Host and trying kdump on guest while kdump process console got hung having call traces Reproducible Step: 1- Install Ubuntu17.04 as kvm guest on ubuntu kvm host 2- configure kdump 3- trigger kdump Expected Result : Kdump should capture Actual Result : Kdump console hung having continuous call traces LOG: [0.488534] Freeing unused kernel memory: 4416K (c8e8 - c92d) [0.488725] This architecture does not have kernel memory protection. Loading, please wait... starting version 232 [0.501616] random: udevadm: uninitialized urandom read (16 bytes read) [0.501830] random: udevadm: uninitialized urandom read (16 bytes read) [0.501981] random: udevadm: uninitialized urandom read (16 bytes read) [0.502162] random: udevadm: uninitialized urandom read (16 bytes read) [0.502254] random: udevadm: uninitialized urandom read (16 bytes read) [0.502433] random: udevadm: uninitialized urandom read (16 bytes read) [0.503188] random: udevadm: uninitialized urandom read (16 bytes read) [0.503270] random: udevadm: uninitialized urandom read (16 bytes read) [0.503978] random: systemd-udevd: uninitialized urandom read (16 bytes read) [0.504218] random: systemd-udevd: uninitialized urandom read (16 bytes read) [ 242.663388] INFO: task systemd-udevd:151 blocked for more than 120 seconds. [ 242.663514] Not tainted 4.9.0-15-generic #16-Ubuntu [ 242.663553] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 242.663755] systemd-udevd D0 151145 0x00040002 [ 242.663795] Call Trace: [ 242.663868] [c0001fe4ecd0] [c801c3a0] __switch_to+0x2e0/0x4c0 [ 242.663964] [c0001fe4ed30] [c8b19398] __schedule+0x2f8/0x990 [ 242.664076] [c0001fe4ee10] [c8b19a78] schedule+0x48/0xc0 [ 242.664179] [c0001fe4ee40] [c8b1de54] schedule_timeout+0x274/0x470 [ 242.664334] [c0001fe4ef30] [c8b19010] io_schedule_timeout+0xd0/0x160 [ 242.664502] [c0001fe4ef80] [c8b1a720] bit_wait_io+0x30/0x90 [ 242.664618] [c0001fe4efb0] [c8b1a168] __wait_on_bit+0xf8/0x170 [ 242.664754] [c0001fe4f000] [c824b238] wait_on_page_bit+0x98/0xb0 [ 242.664847] [c0001fe4f060] [c824d60c] do_read_cache_page+0x21c/0x4e0 [ 242.665008] [c0001fe4f0d0] [c859be78] read_dev_sector+0xb8/0x140 [ 242.665126] [c0001fe4f100] [c85a5d88] read_lba.isra.0+0x148/0x250 [ 242.665259] [c0001fe4f170] [c85a652c] efi_partition+0x12c/0x830 [ 242.665363] [c0001fe4f2e0] [c859e768] check_partition+0x158/0x2d0 [ 242.665469] [c0001fe4f360] [c859c760] rescan_partitions+0xe0/0x390 [ 242.665552] [c0001fe4f430] [c8371828] __blkdev_get+0x358/0x490 [ 242.665669] [c0001fe4f4a0] [c8372b50] blkdev_get+0x1a0/0x4a0 [ 242.665784] [c0001fe4f550] [c8599538] device_add_disk+0x4a8/0x500 [ 242.665894] [c0001fe4f600] [d0511cc8] virtblk_probe+0x560/0x928 [virtio_blk] [ 242.665983] [c0001fe4f6c0] [c8687700] virtio_dev_probe+0x1d0/0x350 [ 242.666050] [c0001fe4f700] [c8716f30] driver_probe_device+0x240/0x540 [ 242.666116] [c0001fe4f790] [c871738c] __driver_attach+0x15c/0x160 [ 242.666174] [c0001fe4f810] [c87138ec] bus_for_each_dev+0x8c/0xf0 [ 242.666232] [c0001fe4f860] [c87162e4] driver_attach+0x34/0x50 [ 242.666289] [c0001fe4f880] [c8715a78] bus_add_driver+0x238/0x380 [ 242.666345] [c0001fe4f910] [c871829c] driver_register+0x9c/0x180 [ 242.666403] [c0001fe4f980] [c8686abc] register_virtio_driver+0x4c/0x60 [ 242.666470] [c0001fe4f9a0] [d0512114] init+0x84/0xd4 [virtio_blk] [ 242.666527] [c0001fe4fa10] [c800dde8] do_one_initcall+0x68/0x1d0 [ 242.666584] [c0001fe4fad0] [c8b28e00] do_init_module+0x90/0x244 [ 242.43] [c0001fe4fb60] [c8184794] load_module+0x1614/0x17a0 [ 242.666701] [c0001fe4fd30] [c8184c60] SyS_finit_module+0xf0/0x170 [
[Kernel-packages] [Bug 1680349] Re: Ubuntu 17.04: Kdump fails to capture dump on Firestone NV when machine crashes while running stress-ng.
** Tags added: ppc64el-kdump -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1680349 Title: Ubuntu 17.04: Kdump fails to capture dump on Firestone NV when machine crashes while running stress-ng. Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: == Comment: #0 - PAVITHRA R. PRAKASH <> - 2017-03-10 02:43:10 == ---Problem Description--- Ubuntu 17.04: Kdump fails to capture dump on Firestone NV when machine crashes while running stress-ng. Machine hangs. ---Steps to Reproduce--- 1. Configure kdump. 2. Install stress-ng # apt-get install stress-ng 3. Run stress-ng # stress-ng - a 0 Logs: root@ltc-firep3:~# kdump-config load Modified cmdline:root=UUID=8b0d5b99-6087-4f40-82ea-375c83a4c139 ro quiet splash irqpoll nr_cpus=1 nousb systemd.unit=kdump-tools.service ata_piix.prefer_ms_hyperv=0 elfcorehdr=155200K * loaded kdump kernel root@ltc-firep3:~# kdump-config show DUMP_MODE:kdump USE_KDUMP:1 KDUMP_SYSCTL: kernel.panic_on_oops=1 KDUMP_COREDIR:/var/crash crashkernel addr: /var/lib/kdump/vmlinuz: symbolic link to /boot/vmlinux-4.10.0-11-generic kdump initrd: /var/lib/kdump/initrd.img: symbolic link to /var/lib/kdump/initrd.img-4.10.0-11-generic current state:ready to kdump kexec command: /sbin/kexec -p --command-line="root=UUID=8b0d5b99-6087-4f40-82ea-375c83a4c139 ro quiet splash irqpoll nr_cpus=1 nousb systemd.unit=kdump-tools.service ata_piix.prefer_ms_hyperv=0" --initrd=/var/lib/kdump/initrd.img /var/lib/kdump/vmlinuz root@ltc-firep3:~# stress-ng -a 0 stress-ng: info: [3900] defaulting to a 86400 second run per stressor stress-ng: info: [3900] dispatching hogs: 160 af-alg, 160 affinity, 160 aio, 160 aiol, 160 apparmor, 160 atomic, 160 bigheap, 160 brk, 160 bsearch, 160 cache, 160 cap, 160 chdir, 160 chmod, 160 chown, 160 chroot, 160 clock, 160 clone, 160 context, 160 copy-file, 160 cpu, 160 cpu-online, 160 crypt, 160 daemon, 160 dccp, 160 dentry, 160 dir, 160 dirdeep, 160 dnotify, 160 dup, 160 epoll, 160 eventfd, 160 exec, 160 fallocate, 160 fanotify, 160 fault, 160 fcntl, 160 fiemap, 160 fifo, 160 filename, 160 flock, 160 fork, 160 fp-error, 160 fstat, 160 full, 160 futex, 160 get, 160 getdent, 160 getrandom, 160 handle, 160 hdd, 160 heapsort, 160 hsearch, 160 icache, 160 icmp-flood, 160 inotify, 160 io, 160 iomix, 160 ioprio, 160 itimer, 160 kcmp, 160 key, 160 kill, 160 klog, 160 lease, 160 link, 160 locka, 160 lockbus, 160 lockf, 160 lockofd, 160 longjmp, 160 lsearch, 160 madvise, 160 malloc, 160 matrix, 160 membarrier, 160 memcpy, 160 memfd, 160 mergesort, 160 mincore, 160 mknod, 160 mlock, 160 mmap, 160 mmapfork, 160 mmapmany, 160 mq, 160 mremap, 160 msg, 160 msync, 160 netlink-proc, 160 nice, 160 nop, 160 null, 160 numa, 160 oom-pipe, 160 opcode, 160 open, 160 personality, 160 pipe, 160 poll, 160 procfs, 160 pthread, 160 ptrace, 160 pty, 160 qsort, 160 quota, 160 rdrand, 160 readahead, 160 remap, 160 rename, 160 resources, 160 rlimit, 160 rmap, 160 rtc, 160 schedpolicy, 160 sctp, 160 seal, 160 seccomp, 160 seek, 160 sem, 160 sem-sysv, 160 sendfile, 160 shm, 160 shm-sysv, 160 sigfd, 160 sigfpe, 160 sigpending, 160 sigq, 160 sigsegv, 160 sigsuspend, 160 sleep, 160 sock, 160 sockfd, 160 sockpair, 160 spawn, 160 splice, 160 stack, 160 stackmmap, 160 str, 160 stream, 160 switch, 160 symlink, 160 sync-file, 160 sysfs, 160 sysinfo, 160 tee, 160 timer, 160 timerfd, 160 tlb-shootdown, 160 tmpfs, 160 tsc, 160 tsearch, 160 udp, 160 udp-flood, 160 unshare, 160 urandom, 160 userfaultfd, 160 utime, 160 vecmath, 160 vfork, 160 vforkmany, 160 vm, 160 vm-rw, 160 vm-splice, 160 wait, 160 wcs, 160 xattr, 160 yield, 160 zero, 160 zlib, 160 zombie stress-ng: info: [3900] cache allocate: using built-in defaults as unable to determine cache details stress-ng: info: [3900] cache allocate: default cache size: 2048K stress-ng: info: [3907] stress-ng-atomic: this stressor is not implemented on this system: ppc64le Linux 4.10.0-11-generic stress-ng: info: [3955] stress-ng-exec: running as root, won't run test. stress-ng: info: [3999] stress-ng-icache: this stressor is not implemented on this system: ppc64le Linux 4.10.0-11-generic stress-ng: info: [4040] stress-ng-lockbus: this stressor is not implemented on this system: ppc64le Linux 4.10.0-11-generic stress-ng: info: [4313] stress-ng-numa: system has 2 of a maximum 256 memory NUMA nodes stress-ng: info: [4455] stress-ng-rdrand: this stressor is not implemented on this system: ppc64le Linux 4.10.0-11-generic stress-ng: fail: [4558] stress-ng-rtc: ioctl RTC_ALRM_READ failed, errno=22 (Invalid argument) stress-ng: fail: [4017] stress-ng-key: keyctl KEYCTL_DESCRIBE failed, errno=127 (Key has expired) stress-ng: fail: [4017]
[Kernel-packages] [Bug 1658733] Re: Ubuntu 16.04.2KVM:kdump fails to mount root file system when noirqdistrib is missing as dump kernel parameter
** Tags added: ppc64el-kdump -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to kexec-tools in Ubuntu. https://bugs.launchpad.net/bugs/1658733 Title: Ubuntu 16.04.2KVM:kdump fails to mount root file system when noirqdistrib is missing as dump kernel parameter Status in The Ubuntu-power-systems project: Confirmed Status in kexec-tools package in Ubuntu: Invalid Status in makedumpfile package in Ubuntu: In Progress Status in kexec-tools source package in Trusty: New Status in makedumpfile source package in Trusty: New Status in kexec-tools source package in Xenial: New Status in makedumpfile source package in Xenial: In Progress Status in kexec-tools source package in Zesty: New Status in makedumpfile source package in Zesty: New Status in kexec-tools source package in Artful: Invalid Status in makedumpfile source package in Artful: In Progress Bug description: [Impact] On Power Systems, some interrupts are missed, and dumping the crash will fail. Adding the noirqdistrib kernel parameter to the kdump kernel will fix this. [Test Case] Setting up kdump to target a virtio-scsi device on a Power System. [Regression Potential] The parameter could be interpreted differently on a different platform and kdump would fail. However, it has been verified that no other platform uses such parameter. If another parameter would have been incorrectly removed on the patch, kdump could fail on other systems. == Comment: #0 - Richard M. Scheller - 2016-12-14 16:50:26 == ---Problem Description--- On a KVM guest installed to a multipath root device, the kdump kernel fails to mount the root file system. This error does not occur in a similar guest installed to a single path device. Full console output of the kdump failure is attached. These messages from the output may be relevant: Begin: Loading multipath modules ... Success: loaded module dm-multipath. done. Begin: Loading multipath hardware handlers ... Failure: failed to load module sc si_dh_alua. Failure: failed to load module scsi_dh_rdac. Failure: failed to load module scsi_dh_emc. done. Begin: Starting multipathd ... done. ---uname output--- Linux dotg9 4.8.0-32-generic #34~16.04.1-Ubuntu SMP Tue Dec 13 17:01:57 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux Machine Type = 8247-22L Ubuntu 16.04.1 KVM guest ---Steps to Reproduce--- - Install Ubuntu 16.04.1 to a muiltpath target disk - Install kdump-tools package - Configure kexec-tools to reserve sufficient RAM for the kdump kernel to load (I use 512MB) in /etc/default/grub.d/kexec-tools.cfg - Run update-grub - Reboot - Initiate a system crash using "echo c > /proc/sysrq-trigger" == Comment: #12 - Richard M. Scheller - 2016-12-20 20:37:45 == Here is the log level 8 kdump console log requested in comment 10. == Comment: #21 - Richard M. Scheller - 2017-01-06 11:04:17 == (In reply to comment #19) > Hi, I logged in dotkvm and I couldn't find the guest dotg9. Also, although I > found a dotg9.xml in /kte/xml/ it doesn't look like it uses multipath (it > uses .img files which I didn't found as disks). > > Could you please recreate the guest for further debug? Yes, I recreated the guest with its correct multipath lun configuration. I have also attached the guest XML to this bug. > Besides that could you please let us know: > - is the multipath the system's root? I mean / is installed/mounted on the > multipath device? Yes, the guest has only one disk. That disk is actually a LUN from a fiber channel storage device with two paths on the host side. I have passed through both paths to the guest, so the multipath nature of the target disk is known to the guest. In other words, the guest sees a multipath device and is using it as a multipath device. The root file system is called /dev/mapper/mpatha- part2 on the guest. > - how did you attach the device to the guest? Each FC LUN path on the host is mapped to a virtio-scsi controller on the guest using LUN passthrough. (See the guest XML for details on this.) == Comment: #22 - Mauro Sergio Martins Rodrigues - 2017-01-11 09:31:38 == I managed to get kdump to mount rootfs and perform its tasks by setting KDUMP_CMDLINE_APPEND="nr_cpus=4" parameter in /etc/default/kdump-tools see http://pastebin.hursley.ibm.com/8239 I'm still investigating to figure out what is the reason behind this behavior. Thanks, -- maurosr == Comment: #23 - Mauricio Faria De Oliveira - 2017-01-11 11:56:40 == Mauro, (In reply to comment #22) > I managed to get kdump to mount rootfs and perform its tasks by setting > KDUMP_CMDLINE_APPEND="nr_cpus=4" parameter in /etc/default/kdump-tools see > http://pastebin.hursley.ibm.com/8239 > > I'm still investigating to figure out what is the reason behind this > behavior. > > Thanks, > > -- > maurosr
[Kernel-packages] [Bug 1704730] Re: xfstest fails with corrupt file /mnt/scratch/1 - non-zero size but no extents ( ext4 )
** Tags removed: triage-a ** Tags added: triage-r -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1704730 Title: xfstest fails with corrupt file /mnt/scratch/1 - non-zero size but no extents ( ext4 ) Status in The Ubuntu-power-systems project: Triaged Status in linux package in Ubuntu: Triaged Bug description: xfstests fails non-zero size but no extents on ext4 filesystem Environment -- Kernel Build: 4.12.1-041201-generic Model : 8247-22L Platform: PowerNV ( P8 ) Uname output --- # uname -a Linux ltc-test-ci2 4.12.1-041201-generic #201707121132 SMP Wed Jul 12 17:03:25 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Steps to reproduce: 1. Create a loop device with ext4 filesystem 2. git clone git://git.kernel.org/pub/scm/fs/xfs/xfstests-dev.git; cd xfstests-dev 3. make 4. Create a local.config for running with created loop device 5. Run xfstests-dev test : ./check tests/generic/044 The test 044 fails with following generic/044 - output mismatch (see /root/harish/xfstests-dev/results//generic/044.out.bad) --- tests/generic/044.out 2017-07-13 06:04:36.208323135 -0400 +++ /root/harish/xfstests-dev/results//generic/044.out.bad 2017-07-14 06:24:08.153731112 -0400 @@ -1 +1,1000 @@ QA output created by 044 +corrupt file /mnt/scratch/1 - non-zero size but no extents +corrupt file /mnt/scratch/2 - non-zero size but no extents +corrupt file /mnt/scratch/3 - non-zero size but no extents +corrupt file /mnt/scratch/4 - non-zero size but no extents +corrupt file /mnt/scratch/5 - non-zero size but no extents +corrupt file /mnt/scratch/6 - non-zero size but no extents ... (Run 'diff -u tests/generic/044.out /root/harish/xfstests-dev/results//generic/044.out.bad' to see the entire diff) Ran: generic/044 Failures: generic/044 Failed 1 of 1 tests Dmesg: -- [17244.878673] EXT4-fs (loop0): mounted filesystem with ordered data mode. Opts: (null) [17245.517227] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17245.697100] EXT4-fs (loop0): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17245.710634] run fstests generic/044 at 2017-07-14 06:23:49 [17246.534410] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17246.535534] EXT4-fs (loop2): shut down requested (1) [17246.535625] Aborting journal on device loop2-8. [17247.278467] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17259.888304] EXT4-fs (loop2): shut down requested (2) [17259.995751] Aborting journal on device loop2-8. [17260.113582] EXT4-fs (loop2): recovery complete [17260.113902] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17260.190076] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr [17264.821978] EXT4-fs (loop2): mounted filesystem with ordered data mode. Opts: acl,user_xattr == Comment: #2 - SEETEENA THOUFEEK- 2017-07-17 02:10:52 == Issue does not happen when running the same test with xfs file system. (ie, creating loop device with xfs file system). To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1704730/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1708130] Re: ISST-LTE: Ubuntu16.04.03: PowerNV: 'ppc64_cpu' commands hangs while changing SMT value with Leaf IO and BASE tests
** Changed in: ubuntu-power-systems Status: Triaged => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1708130 Title: ISST-LTE: Ubuntu16.04.03: PowerNV: 'ppc64_cpu' commands hangs while changing SMT value with Leaf IO and BASE tests Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: Triaged Bug description: == Comment: #0 - INDIRA P. JOGA- 2017-07-07 03:56:54 == Description: -- Started Leaf IO and BASE(without smt tests)and then tried to change the SMT value manually where it hangs UBUNTU BUILD: 4.10.0-26-generic Steps to re-create: -- > Installed latest Ubuntu160403 kernel on system lotkvm 4.10.0-26-generic > Leaf microcode: KMIPP113 > Started Leaf IO and BASE tests(without smt tests). root@lotkvm:/home# show.report.py HOSTNAMEKERNEL VERSION DISTRO INFO ---- lotkvm 4.10.0-26-genericUbuntu 16.04.2 LTS \n \l Current Time: Tue Jul 4 00:55:37 2017 Job-IDFOCUS Start-Time DurationFunction --- -- 1 IO 20170704-00:44:45 0.0 hr(s) 10.0 min(s) IO_Focus 2 BASE20170704-00:44:52 0.0 hr(s) 10.0 min(s) Test FOCUS IO BASESUM TOTAL 76 25 101 FAIL 0 4 4 PASS 76 21 97 (%) (100%) (84%) (96%) >Now manually changed the smt value root@lotkvm:/home# ppc64_cpu --smt SMT=8 root@lotkvm:/home# date Tue Jul 4 00:46:01 CDT 2017 root@lotkvm:/home# ppc64_cpu --smt=2 root@lotkvm:/home# ppc64_cpu --smt SMT=2 root@lotkvm:/home# date Tue Jul 4 00:50:01 CDT 2017 root@lotkvm:/home# ppc64_cpu --smt=4 root@lotkvm:/home# ppc64_cpu --smt SMT=4 root@lotkvm:/home# date Tue Jul 4 00:54:38 CDT 2017 root@lotkvm:/home# ppc64_cpu --smt=8 [ 2055.142781] INFO: task jbd2/nvme0n1p6-:22052 blocked for more than 120 seconds. [ 2055.142915] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2055.142978] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2055.143150] INFO: task kworker/48:0H:21755 blocked for more than 120 seconds. [ 2055.143226] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2055.143289] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2055.143570] INFO: task kworker/u259:3:22436 blocked for more than 120 seconds. [ 2055.143647] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2055.143709] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2055.143953] INFO: task kworker/8:188:118516 blocked for more than 120 seconds. [ 2055.144029] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2055.144091] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2055.144289] INFO: task mkfs.ntfs:95505 blocked for more than 120 seconds. [ 2055.144353] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2055.144416] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2055.144600] INFO: task ppc64_cpu:80305 blocked for more than 120 seconds. [ 2055.144665] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2055.144727] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2055.144872] INFO: task rm:80950 blocked for more than 120 seconds. [ 2055.144936] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2055.144998] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2055.145133] INFO: task rm:80951 blocked for more than 120 seconds. [ 2055.145195] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2055.145257] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2175.974718] INFO: task jbd2/nvme0n1p6-:22052 blocked for more than 120 seconds. [ 2175.974848] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2175.974912] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [ 2175.975068] INFO: task kworker/48:0H:21755 blocked for more than 120 seconds. [ 2175.975144] Not tainted 4.10.0-26-generic #30~16.04.1-Ubuntu [ 2175.975206] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > root@lotkvm:# ps -eaf | grep ppc* root 48054 12068 0 01:24 pts/000:00:00 grep --color=auto ppc* root 80305 5719 0 00:54 hvc0 00:00:00 ppc64_cpu --smt 8 > ppc64_cpu --smt command hangs here . Not able to change the SMT value from 4 to 8. > Attached dmesg logs == Comment: #2 - VIPIN K.
[Kernel-packages] [Bug 1711104] Re: [P9][Ubuntu 16.04.03] Additional PMU fixes for P9
** Changed in: ubuntu-power-systems Status: In Progress => Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1711104 Title: [P9][Ubuntu 16.04.03] Additional PMU fixes for P9 Status in The Ubuntu-power-systems project: Incomplete Status in linux package in Ubuntu: In Progress Status in linux source package in Zesty: In Progress Status in perforate source package in Zesty: New Bug description: == Comment: #0 - Shriya R. Kulkarni- 2017-08-16 04:07:45 == Please integrate the following PMU fixes for P9. 93fc5ca9a0048c ('powerpc/perf: Cleanup of PM_BR_CMPL vs. PM_BRU_CMPL in Power9 event list') 91e0bd1e62519b ('powerpc/perf: Add PM_LD_MISS_L1 and PM_BR_2PATH to power9 event list') 70a7e720998d5b ('powerpc/perf: Factor out PPMU_ONLY_COUNT_RUN check code from power8') 7aa345d84245a7 ('powerpc/perf: Update default sdar_mode value for power9') 101dd590a7fa37 ('powerpc/perf: Avoid spurious PMU interrupts after idle') bdd21ddb919d28 ('powerpc/perf: Avoid spurious PMU interrupts after idle on Power9') 3f0bd8dad0db73 ('powerpc/perf: Add POWER9 alternate PM_RUN_CYC and PM_RUN_INST_CMPL events') 20dd4c624d2515 ('powerpc/perf: Fix SDAR_MODE value for continous sampling on Power9') 24bedcb7c81137 ('powerpc/perf: Fix branch event code for power9') 8c218578fcbbbd ('powerpc/perf: Fix Power9 test_adder fields') To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1711104/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1708409] Re: kdump service does not start after configure/reboot
** Tags added: ppc64el-kdump -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1708409 Title: kdump service does not start after configure/reboot Status in The Ubuntu-power-systems project: Triaged Status in makedumpfile package in Ubuntu: Confirmed Status in systemd package in Ubuntu: Triaged Status in makedumpfile source package in Artful: New Status in systemd source package in Artful: New Status in makedumpfile source package in Bionic: Confirmed Status in systemd source package in Bionic: Triaged Bug description: == Comment: #0 - Harish Sriram- 2017-08-02 01:45:01 == kdump service does not start after configure/reboot --Problem Description--- kdump service does not start after configure/reboot. It has to be started/loaded manually, everytime after reboot. # kdump-config status current state : Not ready to kdump ---uname output--- Linux ltc-test-ci2 4.11.0-10-generic #15-Ubuntu SMP Thu Jun 29 15:02:54 UTC 2017 ppc64le ppc64le ppc64le GNU/Linux Machine Type/Model = Power 8/8247-22L Additional Info- # cat /proc/cmdline root=UUID=974df602-c0e4-4e67-8853-78ad15884c59 ro console=tty0 console=ttyS0,115200 quiet splash cgroup_enable=memory swapaccount=1 crashkernel=2G-4G:320M,4G-32G:512M,32G-64G:1024M,64G-128G:2048M,128G-:4096M ---Steps to Reproduce--- 1. installed linux-crashdump 2. edited the kdump-tools.cfg crashkernel cmdline to above 3. update-grub 4. reboot Expected: kdump-config to be loaded by default after reboot # kdump-config status current state : Not ready to kdump # service kdump-tools status * kdump-tools.service - Kernel crash dump capture service Loaded: loaded (/lib/systemd/system/kdump-tools.service; enabled; vendor pres Active: inactive (dead) ... https://github.com/systemd/systemd/issues/6334 systemd in artful is not properly picking up the unit files in /etc/systemd/system/default.target.wants To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1708409/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1681909] Re: Ubuntu 17.04: dump is not captured in remote host when kdump over ssh is configured on firestone.
** Tags added: ppc64el-kdump -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1681909 Title: Ubuntu 17.04: dump is not captured in remote host when kdump over ssh is configured on firestone. Status in The Ubuntu-power-systems project: New Status in makedumpfile package in Ubuntu: New Bug description: == Comment: #0 - PAVITHRA R. PRAKASH- 2017-03-07 05:00:29 == ---Problem Description--- Ubuntu 17.04: dump is not captured in remote host when kdump over ssh is configured on firestone. ---Steps to Reproduce--- 1. Configure kdump. 2. Check whether kdump is operational using ?# kdump-config show?. 3. Install ?kernel-debuginfo? and ?kernel-debuginfo-common? rpms. 4. Setup password less ssh connection, generate rsa key. # ssh-keygen -t rsa 5. verify id_rsa and id_rsa.pub are created under /root/.ssh/ 6. Edit /etc/default/kdump-tools and add below entries. SSH="ubuntu@9.114.15.239" SSH_KEY=/root/.ssh/id_rsa 7. Propagate RSA key. # kdump-config propagate 8. Restart kdump service. # kdump-config load 9. Trigger Crash using below commands. # echo "1" > /proc/sys/kernel/sysrq # echo "c" > /proc/sysrq-trigger 10. Verify dump is available in remote server in configured path. Machine details === $ ipmitool -I lanplus -H 9.47.70.3 -U ADMIN -P admin sol activate $ ssh ubuntu@9.47.70.29 PW: shriya101 Attaching logs == Comment: #1 - PAVITHRA R. PRAKASH - 2017-03-07 05:01:42 == == Comment: #5 - PAVITHRA R. PRAKASH - 2017-03-07 23:19:46 == Hi, Attaching the logs. Network info: root@ltc-firep3:~# hwinfo --network 36: None 00.0: 10700 Loopback [Created at net.126] Unique ID: ZsBS.GQNx7L4uPNA SysFS ID: /class/net/lo Hardware Class: network interface Model: "Loopback network interface" Device File: lo Link detected: yes Config Status: cfg=new, avail=yes, need=no, active=unknown 37: None 00.0: 10701 Ethernet [Created at net.126] Unique ID: 2lHw.ndpeucax6V1 Parent ID: mIXc.aXC4wIvegH8 SysFS ID: /class/net/enP33p3s0f2 SysFS Device Link: /devices/pci0021:00/0021:00:00.0/0021:01:00.0/0021:02:01.0/0021:03:00.2 Hardware Class: network interface Model: "Ethernet network interface" Driver: "tg3" Driver Modules: "tg3" Device File: enP33p3s0f2 HW Address: 98:be:94:03:18:4a Permanent HW Address: 98:be:94:03:18:4a Link detected: no Config Status: cfg=new, avail=yes, need=no, active=unknown Attached to: #15 (Ethernet controller) 38: None 00.0: 10701 Ethernet [Created at net.126] Unique ID: 7Onn.ndpeucax6V1 Parent ID: sx0U.aXC4wIvegH8 SysFS ID: /class/net/enP33p3s0f0 SysFS Device Link: /devices/pci0021:00/0021:00:00.0/0021:01:00.0/0021:02:01.0/0021:03:00.0 Hardware Class: network interface Model: "Ethernet network interface" Driver: "tg3" Driver Modules: "tg3" Device File: enP33p3s0f0 HW Address: 98:be:94:03:18:48 Permanent HW Address: 98:be:94:03:18:48 Link detected: yes Config Status: cfg=new, avail=yes, need=no, active=unknown Attached to: #16 (Ethernet controller) 39: None 00.0: 10701 Ethernet [Created at net.126] Unique ID: VwX_.ndpeucax6V1 Parent ID: DUng.aXC4wIvegH8 SysFS ID: /class/net/enP33p3s0f3 SysFS Device Link: /devices/pci0021:00/0021:00:00.0/0021:01:00.0/0021:02:01.0/0021:03:00.3 Hardware Class: network interface Model: "Ethernet network interface" Driver: "tg3" Driver Modules: "tg3" Device File: enP33p3s0f3 HW Address: 98:be:94:03:18:4b Permanent HW Address: 98:be:94:03:18:4b Link detected: no Config Status: cfg=new, avail=yes, need=no, active=unknown Attached to: #25 (Ethernet controller) 40: None 00.0: 10701 Ethernet [Created at net.126] Unique ID: bZ1s.ndpeucax6V1 Parent ID: J7HY.aXC4wIvegH8 SysFS ID: /class/net/enP33p3s0f1 SysFS Device Link: /devices/pci0021:00/0021:00:00.0/0021:01:00.0/0021:02:01.0/0021:03:00.1 Hardware Class: network interface Model: "Ethernet network interface" Driver: "tg3" Driver Modules: "tg3" Device File: enP33p3s0f1 HW Address: 98:be:94:03:18:49 Permanent HW Address: 98:be:94:03:18:49 Link detected: no Config Status: cfg=new, avail=yes, need=no, active=unknown Attached to: #4 (Ethernet controller) root@ltc-firep3:~# Thanks, Pavithra == Comment: #6 - PAVITHRA R. PRAKASH - 2017-03-07 23:20:47 == == Comment: #7 - PAVITHRA R. PRAKASH - 2017-03-07 23:21:27 == == Comment: #8 - Urvashi Jawere - 2017-03-08 02:48:15 == I am able to see some errors in syslog ; auxiliary Mar 7 04:57:44 ltc-firep3
[Kernel-packages] [Bug 1714485] Re: Ubuntu 16.04.03: kdump fails with error "kdump-tools[1532]: /etc/init.d/kdump-tools: 26: [: -ne: unexpected operator" when / file system is xfs.
** Tags added: ppc64el-kdump -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to makedumpfile in Ubuntu. https://bugs.launchpad.net/bugs/1714485 Title: Ubuntu 16.04.03: kdump fails with error "kdump-tools[1532]: /etc/init.d/kdump-tools: 26: [: -ne: unexpected operator" when / file system is xfs. Status in The Ubuntu-power-systems project: Triaged Status in makedumpfile package in Ubuntu: In Progress Status in makedumpfile source package in Xenial: New Status in makedumpfile source package in Zesty: New Status in makedumpfile source package in Artful: New Status in makedumpfile source package in Bionic: In Progress Bug description: == Comment: #0 - PAVITHRA R. PRAKASH <> - 2017-08-31 00:33:37 == ---Problem Description--- Ubuntu 16.04.03: kdump fails with error "kdump-tools[1532]: /etc/init.d/kdump-tools: 26: [: -ne: unexpected operator" when / file system is xfs. ---Steps to Reproduce--- 1. Install Ubuntu 16.04.03 with / as xfs. 2. Configure kdump. 3. trigger crash. Machine hangs after below log. Attaching console log. [ OK ] Reached target Network is Online. Starting Kernel crash dump capture service... Starting iSCSI initiator daemon (iscsid)... [ 12.263089] kdump-tools[1205]: /etc/init.d/kdump-tools: 26: [: -ne: unexpected operator [ OK ] Started Kernel crash dump capture service. [ OK ] Started iSCSI initiator daemon (iscsid). Starting Login to default iSCSI targets... [ OK ] Started Login to default iSCSI targets. [ OK ] Reached target Remote File Systems (Pre). 4. After manual reboot /etc/default/kdump-tools is empty. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1714485/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1724120] Re: Ubuntu 16.04.3 - call traces occurs when memory-hotplug test is run with 16Gb hugepages configured
Are we waiting on the patches referenced in the bug description and comment #1 to land upstream before backporting? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1724120 Title: Ubuntu 16.04.3 - call traces occurs when memory-hotplug test is run with 16Gb hugepages configured Status in The Ubuntu-power-systems project: Triaged Status in linux package in Ubuntu: Triaged Bug description: Issue: Call traces occurs when memory-hotplug script is run with 16Gb hugepages configured. Environment: ppc64le PowerVM Lpar root@ltctuleta-lp1:~# uname -r 4.4.0-34-generic root@ltctuleta-lp1:~# cat /proc/meminfo | grep -i huge AnonHugePages: 0 kB HugePages_Total: 2 HugePages_Free:2 HugePages_Rsvd:0 HugePages_Surp:0 Hugepagesize: 16777216 kB root@ltctuleta-lp1:~# free -h totalusedfree shared buff/cache available Mem:85G 32G 52G 16M193M 52G Swap: 43G 0B 43G Steps to reproduce: 1 - Download kernel source and enter to the directory- tools/testing/selftests/memory-hotplug/ 2 - Run mem-on-off-test.sh script in it. System gives call traces like: offline_memory_expect_success 639: unexpected fail online-offline 668 [ 57.552964] Unable to handle kernel paging request for data at address 0x0028 [ 57.552977] Faulting instruction address: 0xc029bc04 [ 57.552987] Oops: Kernel access of bad area, sig: 11 [#1] [ 57.552992] SMP NR_CPUS=2048 NUMA pSeries [ 57.553002] Modules linked in: btrfs xor raid6_pq pseries_rng sunrpc autofs4 ses enclosure nouveau bnx2x i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm vxlan ip6_udp_tunnel ipr udp_tunnel rtc_generic mdio libcrc32c [ 57.553050] CPU: 44 PID: 6518 Comm: mem-on-off-test Not tainted 4.4.0-34-generic #53-Ubuntu [ 57.553059] task: c0072773c8e0 ti: c0072778 task.ti: c0072778 [ 57.553067] NIP: c029bc04 LR: c029bbdc CTR: c01107f0 [ 57.553076] REGS: c00727783770 TRAP: 0300 Not tainted (4.4.0-34-generic) [ 57.553083] MSR: 80019033CR: 24242882 XER: 0002 [ 57.553104] CFAR: c0008468 DAR: 0028 DSISR: 4000 SOFTE: 1 GPR00: c029bbdc c007277839f0 c15b5d00 GPR04: 0029d000 0800 fa01 GPR08: fa700020 0008 c185e270 c00e7e50 GPR12: 2200 ce6ea200 0029d000 2200 GPR16: 1000 c15e2200 0a70 GPR20: 0001 0100 0200 c15f16d0 GPR24: c1876510 0001 c1872a00 GPR28: 0029d000 f000 fa70 0029c000 [ 57.553211] NIP [c029bc04] dissolve_free_huge_pages+0x154/0x220 [ 57.553219] LR [c029bbdc] dissolve_free_huge_pages+0x12c/0x220 [ 57.553226] Call Trace: [ 57.553231] [c007277839f0] [c029bbdc] dissolve_free_huge_pages+0x12c/0x220 (unreliable) [ 57.553244] [c00727783a80] [c02dcbc8] __offline_pages.constprop.6+0x3f8/0x900 [ 57.553254] [c00727783bd0] [c06fbb38] memory_subsys_offline+0xa8/0x110 [ 57.553265] [c00727783c00] [c06d6424] device_offline+0x104/0x140 [ 57.553274] [c00727783c40] [c06fba80] store_mem_state+0x180/0x190 [ 57.553283] [c00727783c80] [c06d1e58] dev_attr_store+0x68/0xa0 [ 57.553293] [c00727783cc0] [c0398110] sysfs_kf_write+0x80/0xb0 [ 57.553302] [c00727783d00] [c0397028] kernfs_fop_write+0x188/0x200 [ 57.553312] [c00727783d50] [c02e190c] __vfs_write+0x6c/0xe0 [ 57.553321] [c00727783d90] [c02e2640] vfs_write+0xc0/0x230 [ 57.553329] [c00727783de0] [c02e367c] SyS_write+0x6c/0x110 [ 57.553339] [c00727783e30] [c0009204] system_call+0x38/0xb4 [ 57.553346] Instruction dump: [ 57.553351] 7e831836 4bfff991 e91e0028 e8fe0020 7d32e82a f9070008 f8e8 fabe0020 [ 57.553366] fade0028 79294620 79291764 7d234a14 3908 f9030028 81091458 [ 57.553383] ---[ end trace 617f7bdd75bcfc10 ]--- [ 57.557133] Segmentation fault The following commit IDs were built into a 4.10.0-37-generic #41 test kernel and verified to fix the problem: a525108cf1cc14651602d678da38fa627a76a724 e1073d1e7920946ac4776a619cc40668b9e1401b 40692eb5eea209c2dd55857f44b4e1d7206e91d6 e24a1307ba1f99fc62a0bd61d5e87fcfb6d5503d 79cc38ded1e1ac86e69c90f604efadd50b0b3762 4ae279c2c96ab38a78b954d218790a8f6db714e5 To