[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
** Tags added: cscc -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1750441 Title: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Bionic: Fix Released Bug description: Problem Description: = - PCI passthru Qlogic Fibre Channel adapter from Ubuntu 18.04 KVM host to Ubuntu 18.04 KVM guest. - Stress test on Qlogic Fibre Channel on Ubuntu KVM guest caused KVM host crashed in qlt_free_session_done call. - Below stack traces from KVM host: 91:mon> t [c000200e4e81fb60] c0081162f044 qlt_free_session_done+0x4ec/0x680 [qla2xxx] (unreliable) [c000200e4e81fc90] c012fbb8 process_one_work+0x298/0x5a0 [c000200e4e81fd20] c012ff58 worker_thread+0x98/0x630 [c000200e4e81fdc0] c0138ae8 kthread+0x1a8/0x1b0 [c000200e4e81fe30] c000b528 ret_from_kernel_thread+0x5c/0xb4 91:mon> e cpu 0x91: Vector: 300 (Data Access) at [c000200e4e81f8e0] pc: c0081162ed58: qlt_free_session_done+0x200/0x680 [qla2xxx] lr: c0081162eca8: qlt_free_session_done+0x150/0x680 [qla2xxx] sp: c000200e4e81fb60 msr: 9280b033 dar: 20 dsisr: 4000 current = 0xc000200e4e7b0e00 paca= 0xcfae3b00 softe: 0irq_happened: 0x01 pid = 1119, comm = kworker/145:1 Linux version 4.15.0-041500rc9-generic (kernel@tangerine) (gcc version 7.2.0 (Ubuntu 7.2.0-6ubuntu1)) #201801212130 SMP Mon Jan 22 03:36:42 UTC 2018 91:mon> r R00 = c0081162eca8 R16 = R01 = c000200e4e81fb60 R17 = R02 = c0081166ad60 R18 = R03 = 0001 R19 = R04 = c000200e44f8c7f8 R20 = c000200e618e7d80 R05 = f087 R21 = R06 = c0081165e6c8 R22 = 0001 R07 = c0081164adb0 R23 = c000200e44f99d24 R08 = R24 = 0402 R09 = R25 = R10 = R26 = c00fe1270c20 R11 = c0081163e170 R27 = c000200e44f99000 R12 = c0cfccf0 R28 = c0081164adb0 R13 = cfae3b00 R29 = c00fe1270c00 R14 = c0138948 R30 = c000200e44f8c7f8 R15 = c000200e4f019440 R31 = c00fe1270cc0 pc = c0081162ed58 qlt_free_session_done+0x200/0x680 [qla2xxx] cfar= c0081162ed1c qlt_free_session_done+0x1c4/0x680 [qla2xxx] lr = c0081162eca8 qlt_free_session_done+0x150/0x680 [qla2xxx] msr = 9280b033 cr = 28002284 ctr = c0cfccf0 xer = trap = 300 dar = 0020 dsisr = 4000 91:mon> The crash location seems close to this one fixed about two weeks ago: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux- next.git/commit/drivers/scsi/qla2xxx/qla_os.c?h=next-20180212&id=2ce87cc5b269510de9ca1185ca8a6e10ec78c069 scsi: qla2xxx: Fix memory corruption during hba reset test This patch fixes memory corrpution while performing HBA Reset test. Following stack trace is seen: [ 466.397219] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 466.433669] IP: [] qlt_free_session_done+0x260/0x5f0 [qla2xxx] [ 466.467731] PGD 0 [ 466.476718] Oops: [#1] SMP - Luciano built and provided the patch with new Qlogic change on Friday last week. root@bos1u1p1:~/chavez# ls linux-image* linux-image-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb linux-image-extra-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb - I configured and ran same test over weekend and test ran good. KVM host did not crash in qlt_free_session_done call like before. - So the patch fixed the problem. Hi Canonical, Please review and consider this a request to pull in commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069 please. Thanks! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1750441/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
** Changed in: ubuntu-power-systems Status: In Progress => Fix Released -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1750441 Title: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call Status in The Ubuntu-power-systems project: Fix Released Status in linux package in Ubuntu: Fix Released Status in linux source package in Bionic: Fix Released Bug description: Problem Description: = - PCI passthru Qlogic Fibre Channel adapter from Ubuntu 18.04 KVM host to Ubuntu 18.04 KVM guest. - Stress test on Qlogic Fibre Channel on Ubuntu KVM guest caused KVM host crashed in qlt_free_session_done call. - Below stack traces from KVM host: 91:mon> t [c000200e4e81fb60] c0081162f044 qlt_free_session_done+0x4ec/0x680 [qla2xxx] (unreliable) [c000200e4e81fc90] c012fbb8 process_one_work+0x298/0x5a0 [c000200e4e81fd20] c012ff58 worker_thread+0x98/0x630 [c000200e4e81fdc0] c0138ae8 kthread+0x1a8/0x1b0 [c000200e4e81fe30] c000b528 ret_from_kernel_thread+0x5c/0xb4 91:mon> e cpu 0x91: Vector: 300 (Data Access) at [c000200e4e81f8e0] pc: c0081162ed58: qlt_free_session_done+0x200/0x680 [qla2xxx] lr: c0081162eca8: qlt_free_session_done+0x150/0x680 [qla2xxx] sp: c000200e4e81fb60 msr: 9280b033 dar: 20 dsisr: 4000 current = 0xc000200e4e7b0e00 paca= 0xcfae3b00 softe: 0irq_happened: 0x01 pid = 1119, comm = kworker/145:1 Linux version 4.15.0-041500rc9-generic (kernel@tangerine) (gcc version 7.2.0 (Ubuntu 7.2.0-6ubuntu1)) #201801212130 SMP Mon Jan 22 03:36:42 UTC 2018 91:mon> r R00 = c0081162eca8 R16 = R01 = c000200e4e81fb60 R17 = R02 = c0081166ad60 R18 = R03 = 0001 R19 = R04 = c000200e44f8c7f8 R20 = c000200e618e7d80 R05 = f087 R21 = R06 = c0081165e6c8 R22 = 0001 R07 = c0081164adb0 R23 = c000200e44f99d24 R08 = R24 = 0402 R09 = R25 = R10 = R26 = c00fe1270c20 R11 = c0081163e170 R27 = c000200e44f99000 R12 = c0cfccf0 R28 = c0081164adb0 R13 = cfae3b00 R29 = c00fe1270c00 R14 = c0138948 R30 = c000200e44f8c7f8 R15 = c000200e4f019440 R31 = c00fe1270cc0 pc = c0081162ed58 qlt_free_session_done+0x200/0x680 [qla2xxx] cfar= c0081162ed1c qlt_free_session_done+0x1c4/0x680 [qla2xxx] lr = c0081162eca8 qlt_free_session_done+0x150/0x680 [qla2xxx] msr = 9280b033 cr = 28002284 ctr = c0cfccf0 xer = trap = 300 dar = 0020 dsisr = 4000 91:mon> The crash location seems close to this one fixed about two weeks ago: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux- next.git/commit/drivers/scsi/qla2xxx/qla_os.c?h=next-20180212&id=2ce87cc5b269510de9ca1185ca8a6e10ec78c069 scsi: qla2xxx: Fix memory corruption during hba reset test This patch fixes memory corrpution while performing HBA Reset test. Following stack trace is seen: [ 466.397219] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 466.433669] IP: [] qlt_free_session_done+0x260/0x5f0 [qla2xxx] [ 466.467731] PGD 0 [ 466.476718] Oops: [#1] SMP - Luciano built and provided the patch with new Qlogic change on Friday last week. root@bos1u1p1:~/chavez# ls linux-image* linux-image-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb linux-image-extra-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb - I configured and ran same test over weekend and test ran good. KVM host did not crash in qlt_free_session_done call like before. - So the patch fixed the problem. Hi Canonical, Please review and consider this a request to pull in commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069 please. Thanks! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1750441/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
This bug was fixed in the package linux - 4.15.0-13.14 --- linux (4.15.0-13.14) bionic; urgency=medium * linux: 4.15.0-13.14 -proposed tracker (LP: #1756408) * devpts: handle bind-mounts (LP: #1755857) - SAUCE: devpts: hoist out check for DEVPTS_SUPER_MAGIC - SAUCE: devpts: resolve devpts bind-mounts - SAUCE: devpts: comment devpts_mntget() - SAUCE: selftests: add devpts selftests * [bionic][arm64] d-i: add hisi_sas_v3_hw to scsi-modules (LP: #1756103) - d-i: add hisi_sas_v3_hw to scsi-modules * [Bionic][ARM64] enable ROCE and HNS3 driver support for hip08 SoC (LP: #1756097) - RDMA/hns: Refactor eq code for hip06 - RDMA/hns: Add eq support of hip08 - RDMA/hns: Add detailed comments for mb() call - RDMA/hns: Add rq inline data support for hip08 RoCE - RDMA/hns: Update the usage of sr_max and rr_max field - RDMA/hns: Set access flags of hip08 RoCE - RDMA/hns: Filter for zero length of sge in hip08 kernel mode - RDMA/hns: Fix QP state judgement before sending work requests - RDMA/hns: Assign dest_qp when deregistering mr - RDMA/hns: Fix endian problems around imm_data and rkey - RDMA/hns: Assign the correct value for tx_cqn - RDMA/hns: Create gsi qp in hip08 - RDMA/hns: Add gsi qp support for modifying qp in hip08 - RDMA/hns: Fill sq wqe context of ud type in hip08 - RDMA/hns: Assign zero for pkey_index of wc in hip08 - RDMA/hns: Update the verbs of polling for completion - RDMA/hns: Set the guid for hip08 RoCE device - net: hns3: Refactor of the reset interrupt handling logic - net: hns3: Add reset service task for handling reset requests - net: hns3: Refactors the requested reset & pending reset handling code - net: hns3: Add HNS3 VF IMP(Integrated Management Proc) cmd interface - net: hns3: Add mailbox support to VF driver - net: hns3: Add HNS3 VF HCL(Hardware Compatibility Layer) Support - net: hns3: Add HNS3 VF driver to kernel build framework - net: hns3: Unified HNS3 {VF|PF} Ethernet Driver for hip08 SoC - net: hns3: Add mailbox support to PF driver - net: hns3: Change PF to add ring-vect binding & resetQ to mailbox - net: hns3: Add mailbox interrupt handling to PF driver - net: hns3: add support to query tqps number - net: hns3: add support to modify tqps number - net: hns3: change the returned tqp number by ethtool -x - net: hns3: free the ring_data structrue when change tqps - net: hns3: get rss_size_max from configuration but not hardcode - net: hns3: add a mask initialization for mac_vlan table - net: hns3: add vlan offload config command - net: hns3: add ethtool related offload command - net: hns3: add handling vlan tag offload in bd - net: hns3: cleanup mac auto-negotiation state query - net: hns3: fix for getting auto-negotiation state in hclge_get_autoneg - net: hns3: add support for set_pauseparam - net: hns3: add support to update flow control settings after autoneg - net: hns3: add Asym Pause support to phy default features - net: hns3: add support for querying advertised pause frame by ethtool ethx - net: hns3: Increase the default depth of bucket for TM shaper - net: hns3: change TM sched mode to TC-based mode when SRIOV enabled - net: hns3: hns3_get_channels() can be static - net: hns3: Add ethtool interface for vlan filter - net: hns3: Disable VFs change rxvlan offload status - net: hns3: Unify the strings display of packet statistics - net: hns3: Fix spelling errors - net: hns3: Remove repeat statistic of rx_errors - net: hns3: Modify the update period of packet statistics - net: hns3: Mask the packet statistics query when NIC is down - net: hns3: Fix an error of total drop packet statistics - net: hns3: Fix a loop index error of tqp statistics query - net: hns3: Fix an error macro definition of HNS3_TQP_STAT - net: hns3: Remove a useless member of struct hns3_stats - net: hns3: Add packet statistics of netdev - net: hns3: Fix a response data read error of tqp statistics query - net: hns3: fix for updating fc_mode_last_time - net: hns3: fix for setting MTU - net: hns3: fix for changing MTU - net: hns3: add MTU initialization for hardware - net: hns3: fix for not setting pause parameters - net: hns3: remove redundant semicolon - net: hns3: Add more packet size statisctics - Revert "net: hns3: Add packet statistics of netdev" - net: hns3: report the function type the same line with hns3_nic_get_stats64 - net: hns3: add ethtool_ops.get_channels support for VF - net: hns3: remove TSO config command from VF driver - net: hns3: add ethtool_ops.get_coalesce support to PF - net: hns3: add ethtool_ops.set_coalesce support to PF - net: hns3: refactor interrupt coalescing init function - net: hns3: refactor GL update function - net: hns3: remove unused
[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
** Changed in: linux (Ubuntu Bionic) Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1750441 Title: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call Status in The Ubuntu-power-systems project: In Progress Status in linux package in Ubuntu: Fix Committed Status in linux source package in Bionic: Fix Committed Bug description: Problem Description: = - PCI passthru Qlogic Fibre Channel adapter from Ubuntu 18.04 KVM host to Ubuntu 18.04 KVM guest. - Stress test on Qlogic Fibre Channel on Ubuntu KVM guest caused KVM host crashed in qlt_free_session_done call. - Below stack traces from KVM host: 91:mon> t [c000200e4e81fb60] c0081162f044 qlt_free_session_done+0x4ec/0x680 [qla2xxx] (unreliable) [c000200e4e81fc90] c012fbb8 process_one_work+0x298/0x5a0 [c000200e4e81fd20] c012ff58 worker_thread+0x98/0x630 [c000200e4e81fdc0] c0138ae8 kthread+0x1a8/0x1b0 [c000200e4e81fe30] c000b528 ret_from_kernel_thread+0x5c/0xb4 91:mon> e cpu 0x91: Vector: 300 (Data Access) at [c000200e4e81f8e0] pc: c0081162ed58: qlt_free_session_done+0x200/0x680 [qla2xxx] lr: c0081162eca8: qlt_free_session_done+0x150/0x680 [qla2xxx] sp: c000200e4e81fb60 msr: 9280b033 dar: 20 dsisr: 4000 current = 0xc000200e4e7b0e00 paca= 0xcfae3b00 softe: 0irq_happened: 0x01 pid = 1119, comm = kworker/145:1 Linux version 4.15.0-041500rc9-generic (kernel@tangerine) (gcc version 7.2.0 (Ubuntu 7.2.0-6ubuntu1)) #201801212130 SMP Mon Jan 22 03:36:42 UTC 2018 91:mon> r R00 = c0081162eca8 R16 = R01 = c000200e4e81fb60 R17 = R02 = c0081166ad60 R18 = R03 = 0001 R19 = R04 = c000200e44f8c7f8 R20 = c000200e618e7d80 R05 = f087 R21 = R06 = c0081165e6c8 R22 = 0001 R07 = c0081164adb0 R23 = c000200e44f99d24 R08 = R24 = 0402 R09 = R25 = R10 = R26 = c00fe1270c20 R11 = c0081163e170 R27 = c000200e44f99000 R12 = c0cfccf0 R28 = c0081164adb0 R13 = cfae3b00 R29 = c00fe1270c00 R14 = c0138948 R30 = c000200e44f8c7f8 R15 = c000200e4f019440 R31 = c00fe1270cc0 pc = c0081162ed58 qlt_free_session_done+0x200/0x680 [qla2xxx] cfar= c0081162ed1c qlt_free_session_done+0x1c4/0x680 [qla2xxx] lr = c0081162eca8 qlt_free_session_done+0x150/0x680 [qla2xxx] msr = 9280b033 cr = 28002284 ctr = c0cfccf0 xer = trap = 300 dar = 0020 dsisr = 4000 91:mon> The crash location seems close to this one fixed about two weeks ago: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux- next.git/commit/drivers/scsi/qla2xxx/qla_os.c?h=next-20180212&id=2ce87cc5b269510de9ca1185ca8a6e10ec78c069 scsi: qla2xxx: Fix memory corruption during hba reset test This patch fixes memory corrpution while performing HBA Reset test. Following stack trace is seen: [ 466.397219] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 466.433669] IP: [] qlt_free_session_done+0x260/0x5f0 [qla2xxx] [ 466.467731] PGD 0 [ 466.476718] Oops: [#1] SMP - Luciano built and provided the patch with new Qlogic change on Friday last week. root@bos1u1p1:~/chavez# ls linux-image* linux-image-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb linux-image-extra-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb - I configured and ran same test over weekend and test ran good. KVM host did not crash in qlt_free_session_done call like before. - So the patch fixed the problem. Hi Canonical, Please review and consider this a request to pull in commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069 please. Thanks! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1750441/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
Bionic request submitted: https://lists.ubuntu.com/archives/kernel-team/2018-March/090767.html -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1750441 Title: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call Status in The Ubuntu-power-systems project: In Progress Status in linux package in Ubuntu: In Progress Status in linux source package in Bionic: In Progress Bug description: Problem Description: = - PCI passthru Qlogic Fibre Channel adapter from Ubuntu 18.04 KVM host to Ubuntu 18.04 KVM guest. - Stress test on Qlogic Fibre Channel on Ubuntu KVM guest caused KVM host crashed in qlt_free_session_done call. - Below stack traces from KVM host: 91:mon> t [c000200e4e81fb60] c0081162f044 qlt_free_session_done+0x4ec/0x680 [qla2xxx] (unreliable) [c000200e4e81fc90] c012fbb8 process_one_work+0x298/0x5a0 [c000200e4e81fd20] c012ff58 worker_thread+0x98/0x630 [c000200e4e81fdc0] c0138ae8 kthread+0x1a8/0x1b0 [c000200e4e81fe30] c000b528 ret_from_kernel_thread+0x5c/0xb4 91:mon> e cpu 0x91: Vector: 300 (Data Access) at [c000200e4e81f8e0] pc: c0081162ed58: qlt_free_session_done+0x200/0x680 [qla2xxx] lr: c0081162eca8: qlt_free_session_done+0x150/0x680 [qla2xxx] sp: c000200e4e81fb60 msr: 9280b033 dar: 20 dsisr: 4000 current = 0xc000200e4e7b0e00 paca= 0xcfae3b00 softe: 0irq_happened: 0x01 pid = 1119, comm = kworker/145:1 Linux version 4.15.0-041500rc9-generic (kernel@tangerine) (gcc version 7.2.0 (Ubuntu 7.2.0-6ubuntu1)) #201801212130 SMP Mon Jan 22 03:36:42 UTC 2018 91:mon> r R00 = c0081162eca8 R16 = R01 = c000200e4e81fb60 R17 = R02 = c0081166ad60 R18 = R03 = 0001 R19 = R04 = c000200e44f8c7f8 R20 = c000200e618e7d80 R05 = f087 R21 = R06 = c0081165e6c8 R22 = 0001 R07 = c0081164adb0 R23 = c000200e44f99d24 R08 = R24 = 0402 R09 = R25 = R10 = R26 = c00fe1270c20 R11 = c0081163e170 R27 = c000200e44f99000 R12 = c0cfccf0 R28 = c0081164adb0 R13 = cfae3b00 R29 = c00fe1270c00 R14 = c0138948 R30 = c000200e44f8c7f8 R15 = c000200e4f019440 R31 = c00fe1270cc0 pc = c0081162ed58 qlt_free_session_done+0x200/0x680 [qla2xxx] cfar= c0081162ed1c qlt_free_session_done+0x1c4/0x680 [qla2xxx] lr = c0081162eca8 qlt_free_session_done+0x150/0x680 [qla2xxx] msr = 9280b033 cr = 28002284 ctr = c0cfccf0 xer = trap = 300 dar = 0020 dsisr = 4000 91:mon> The crash location seems close to this one fixed about two weeks ago: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux- next.git/commit/drivers/scsi/qla2xxx/qla_os.c?h=next-20180212&id=2ce87cc5b269510de9ca1185ca8a6e10ec78c069 scsi: qla2xxx: Fix memory corruption during hba reset test This patch fixes memory corrpution while performing HBA Reset test. Following stack trace is seen: [ 466.397219] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 466.433669] IP: [] qlt_free_session_done+0x260/0x5f0 [qla2xxx] [ 466.467731] PGD 0 [ 466.476718] Oops: [#1] SMP - Luciano built and provided the patch with new Qlogic change on Friday last week. root@bos1u1p1:~/chavez# ls linux-image* linux-image-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb linux-image-extra-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb - I configured and ran same test over weekend and test ran good. KVM host did not crash in qlt_free_session_done call like before. - So the patch fixed the problem. Hi Canonical, Please review and consider this a request to pull in commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069 please. Thanks! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1750441/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
** Changed in: ubuntu-power-systems Status: New => In Progress -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1750441 Title: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call Status in The Ubuntu-power-systems project: In Progress Status in linux package in Ubuntu: In Progress Status in linux source package in Bionic: In Progress Bug description: Problem Description: = - PCI passthru Qlogic Fibre Channel adapter from Ubuntu 18.04 KVM host to Ubuntu 18.04 KVM guest. - Stress test on Qlogic Fibre Channel on Ubuntu KVM guest caused KVM host crashed in qlt_free_session_done call. - Below stack traces from KVM host: 91:mon> t [c000200e4e81fb60] c0081162f044 qlt_free_session_done+0x4ec/0x680 [qla2xxx] (unreliable) [c000200e4e81fc90] c012fbb8 process_one_work+0x298/0x5a0 [c000200e4e81fd20] c012ff58 worker_thread+0x98/0x630 [c000200e4e81fdc0] c0138ae8 kthread+0x1a8/0x1b0 [c000200e4e81fe30] c000b528 ret_from_kernel_thread+0x5c/0xb4 91:mon> e cpu 0x91: Vector: 300 (Data Access) at [c000200e4e81f8e0] pc: c0081162ed58: qlt_free_session_done+0x200/0x680 [qla2xxx] lr: c0081162eca8: qlt_free_session_done+0x150/0x680 [qla2xxx] sp: c000200e4e81fb60 msr: 9280b033 dar: 20 dsisr: 4000 current = 0xc000200e4e7b0e00 paca= 0xcfae3b00 softe: 0irq_happened: 0x01 pid = 1119, comm = kworker/145:1 Linux version 4.15.0-041500rc9-generic (kernel@tangerine) (gcc version 7.2.0 (Ubuntu 7.2.0-6ubuntu1)) #201801212130 SMP Mon Jan 22 03:36:42 UTC 2018 91:mon> r R00 = c0081162eca8 R16 = R01 = c000200e4e81fb60 R17 = R02 = c0081166ad60 R18 = R03 = 0001 R19 = R04 = c000200e44f8c7f8 R20 = c000200e618e7d80 R05 = f087 R21 = R06 = c0081165e6c8 R22 = 0001 R07 = c0081164adb0 R23 = c000200e44f99d24 R08 = R24 = 0402 R09 = R25 = R10 = R26 = c00fe1270c20 R11 = c0081163e170 R27 = c000200e44f99000 R12 = c0cfccf0 R28 = c0081164adb0 R13 = cfae3b00 R29 = c00fe1270c00 R14 = c0138948 R30 = c000200e44f8c7f8 R15 = c000200e4f019440 R31 = c00fe1270cc0 pc = c0081162ed58 qlt_free_session_done+0x200/0x680 [qla2xxx] cfar= c0081162ed1c qlt_free_session_done+0x1c4/0x680 [qla2xxx] lr = c0081162eca8 qlt_free_session_done+0x150/0x680 [qla2xxx] msr = 9280b033 cr = 28002284 ctr = c0cfccf0 xer = trap = 300 dar = 0020 dsisr = 4000 91:mon> The crash location seems close to this one fixed about two weeks ago: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux- next.git/commit/drivers/scsi/qla2xxx/qla_os.c?h=next-20180212&id=2ce87cc5b269510de9ca1185ca8a6e10ec78c069 scsi: qla2xxx: Fix memory corruption during hba reset test This patch fixes memory corrpution while performing HBA Reset test. Following stack trace is seen: [ 466.397219] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 466.433669] IP: [] qlt_free_session_done+0x260/0x5f0 [qla2xxx] [ 466.467731] PGD 0 [ 466.476718] Oops: [#1] SMP - Luciano built and provided the patch with new Qlogic change on Friday last week. root@bos1u1p1:~/chavez# ls linux-image* linux-image-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb linux-image-extra-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb - I configured and ran same test over weekend and test ran good. KVM host did not crash in qlt_free_session_done call like before. - So the patch fixed the problem. Hi Canonical, Please review and consider this a request to pull in commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069 please. Thanks! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1750441/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
I built a test kernel with commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069. The test kernel can be downloaded from: http://kernel.ubuntu.com/~jsalisbury/lp1750441 Can you test this kernel and see if it resolves this bug? Note, to test this kernel, you need to install both the linux-image and linux-image-extra .deb packages. Thanks in advance! -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1750441 Title: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: In Progress Status in linux source package in Bionic: In Progress Bug description: Problem Description: = - PCI passthru Qlogic Fibre Channel adapter from Ubuntu 18.04 KVM host to Ubuntu 18.04 KVM guest. - Stress test on Qlogic Fibre Channel on Ubuntu KVM guest caused KVM host crashed in qlt_free_session_done call. - Below stack traces from KVM host: 91:mon> t [c000200e4e81fb60] c0081162f044 qlt_free_session_done+0x4ec/0x680 [qla2xxx] (unreliable) [c000200e4e81fc90] c012fbb8 process_one_work+0x298/0x5a0 [c000200e4e81fd20] c012ff58 worker_thread+0x98/0x630 [c000200e4e81fdc0] c0138ae8 kthread+0x1a8/0x1b0 [c000200e4e81fe30] c000b528 ret_from_kernel_thread+0x5c/0xb4 91:mon> e cpu 0x91: Vector: 300 (Data Access) at [c000200e4e81f8e0] pc: c0081162ed58: qlt_free_session_done+0x200/0x680 [qla2xxx] lr: c0081162eca8: qlt_free_session_done+0x150/0x680 [qla2xxx] sp: c000200e4e81fb60 msr: 9280b033 dar: 20 dsisr: 4000 current = 0xc000200e4e7b0e00 paca= 0xcfae3b00 softe: 0irq_happened: 0x01 pid = 1119, comm = kworker/145:1 Linux version 4.15.0-041500rc9-generic (kernel@tangerine) (gcc version 7.2.0 (Ubuntu 7.2.0-6ubuntu1)) #201801212130 SMP Mon Jan 22 03:36:42 UTC 2018 91:mon> r R00 = c0081162eca8 R16 = R01 = c000200e4e81fb60 R17 = R02 = c0081166ad60 R18 = R03 = 0001 R19 = R04 = c000200e44f8c7f8 R20 = c000200e618e7d80 R05 = f087 R21 = R06 = c0081165e6c8 R22 = 0001 R07 = c0081164adb0 R23 = c000200e44f99d24 R08 = R24 = 0402 R09 = R25 = R10 = R26 = c00fe1270c20 R11 = c0081163e170 R27 = c000200e44f99000 R12 = c0cfccf0 R28 = c0081164adb0 R13 = cfae3b00 R29 = c00fe1270c00 R14 = c0138948 R30 = c000200e44f8c7f8 R15 = c000200e4f019440 R31 = c00fe1270cc0 pc = c0081162ed58 qlt_free_session_done+0x200/0x680 [qla2xxx] cfar= c0081162ed1c qlt_free_session_done+0x1c4/0x680 [qla2xxx] lr = c0081162eca8 qlt_free_session_done+0x150/0x680 [qla2xxx] msr = 9280b033 cr = 28002284 ctr = c0cfccf0 xer = trap = 300 dar = 0020 dsisr = 4000 91:mon> The crash location seems close to this one fixed about two weeks ago: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux- next.git/commit/drivers/scsi/qla2xxx/qla_os.c?h=next-20180212&id=2ce87cc5b269510de9ca1185ca8a6e10ec78c069 scsi: qla2xxx: Fix memory corruption during hba reset test This patch fixes memory corrpution while performing HBA Reset test. Following stack trace is seen: [ 466.397219] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 466.433669] IP: [] qlt_free_session_done+0x260/0x5f0 [qla2xxx] [ 466.467731] PGD 0 [ 466.476718] Oops: [#1] SMP - Luciano built and provided the patch with new Qlogic change on Friday last week. root@bos1u1p1:~/chavez# ls linux-image* linux-image-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb linux-image-extra-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb - I configured and ran same test over weekend and test ran good. KVM host did not crash in qlt_free_session_done call like before. - So the patch fixed the problem. Hi Canonical, Please review and consider this a request to pull in commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069 please. Thanks! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1750441/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
** Changed in: linux (Ubuntu) Status: New => In Progress ** Changed in: linux (Ubuntu) Importance: Undecided => High ** Changed in: linux (Ubuntu) Assignee: Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage) => Joseph Salisbury (jsalisbury) ** Also affects: linux (Ubuntu Bionic) Importance: High Assignee: Joseph Salisbury (jsalisbury) Status: In Progress -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1750441 Title: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: In Progress Status in linux source package in Bionic: In Progress Bug description: Problem Description: = - PCI passthru Qlogic Fibre Channel adapter from Ubuntu 18.04 KVM host to Ubuntu 18.04 KVM guest. - Stress test on Qlogic Fibre Channel on Ubuntu KVM guest caused KVM host crashed in qlt_free_session_done call. - Below stack traces from KVM host: 91:mon> t [c000200e4e81fb60] c0081162f044 qlt_free_session_done+0x4ec/0x680 [qla2xxx] (unreliable) [c000200e4e81fc90] c012fbb8 process_one_work+0x298/0x5a0 [c000200e4e81fd20] c012ff58 worker_thread+0x98/0x630 [c000200e4e81fdc0] c0138ae8 kthread+0x1a8/0x1b0 [c000200e4e81fe30] c000b528 ret_from_kernel_thread+0x5c/0xb4 91:mon> e cpu 0x91: Vector: 300 (Data Access) at [c000200e4e81f8e0] pc: c0081162ed58: qlt_free_session_done+0x200/0x680 [qla2xxx] lr: c0081162eca8: qlt_free_session_done+0x150/0x680 [qla2xxx] sp: c000200e4e81fb60 msr: 9280b033 dar: 20 dsisr: 4000 current = 0xc000200e4e7b0e00 paca= 0xcfae3b00 softe: 0irq_happened: 0x01 pid = 1119, comm = kworker/145:1 Linux version 4.15.0-041500rc9-generic (kernel@tangerine) (gcc version 7.2.0 (Ubuntu 7.2.0-6ubuntu1)) #201801212130 SMP Mon Jan 22 03:36:42 UTC 2018 91:mon> r R00 = c0081162eca8 R16 = R01 = c000200e4e81fb60 R17 = R02 = c0081166ad60 R18 = R03 = 0001 R19 = R04 = c000200e44f8c7f8 R20 = c000200e618e7d80 R05 = f087 R21 = R06 = c0081165e6c8 R22 = 0001 R07 = c0081164adb0 R23 = c000200e44f99d24 R08 = R24 = 0402 R09 = R25 = R10 = R26 = c00fe1270c20 R11 = c0081163e170 R27 = c000200e44f99000 R12 = c0cfccf0 R28 = c0081164adb0 R13 = cfae3b00 R29 = c00fe1270c00 R14 = c0138948 R30 = c000200e44f8c7f8 R15 = c000200e4f019440 R31 = c00fe1270cc0 pc = c0081162ed58 qlt_free_session_done+0x200/0x680 [qla2xxx] cfar= c0081162ed1c qlt_free_session_done+0x1c4/0x680 [qla2xxx] lr = c0081162eca8 qlt_free_session_done+0x150/0x680 [qla2xxx] msr = 9280b033 cr = 28002284 ctr = c0cfccf0 xer = trap = 300 dar = 0020 dsisr = 4000 91:mon> The crash location seems close to this one fixed about two weeks ago: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux- next.git/commit/drivers/scsi/qla2xxx/qla_os.c?h=next-20180212&id=2ce87cc5b269510de9ca1185ca8a6e10ec78c069 scsi: qla2xxx: Fix memory corruption during hba reset test This patch fixes memory corrpution while performing HBA Reset test. Following stack trace is seen: [ 466.397219] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 466.433669] IP: [] qlt_free_session_done+0x260/0x5f0 [qla2xxx] [ 466.467731] PGD 0 [ 466.476718] Oops: [#1] SMP - Luciano built and provided the patch with new Qlogic change on Friday last week. root@bos1u1p1:~/chavez# ls linux-image* linux-image-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb linux-image-extra-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb - I configured and ran same test over weekend and test ran good. KVM host did not crash in qlt_free_session_done call like before. - So the patch fixed the problem. Hi Canonical, Please review and consider this a request to pull in commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069 please. Thanks! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1750441/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1750441] Re: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call
** Also affects: ubuntu-power-systems Importance: Undecided Status: New ** Changed in: ubuntu-power-systems Importance: Undecided => High ** Changed in: ubuntu-power-systems Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) ** Tags added: triage-g -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1750441 Title: Boston-LC:bos1u1: Stress test on Qlogic Fibre Channel on Ubuntu KVM guest that caused KVM host crashed in qlt_free_session_done call Status in The Ubuntu-power-systems project: New Status in linux package in Ubuntu: New Bug description: Problem Description: = - PCI passthru Qlogic Fibre Channel adapter from Ubuntu 18.04 KVM host to Ubuntu 18.04 KVM guest. - Stress test on Qlogic Fibre Channel on Ubuntu KVM guest caused KVM host crashed in qlt_free_session_done call. - Below stack traces from KVM host: 91:mon> t [c000200e4e81fb60] c0081162f044 qlt_free_session_done+0x4ec/0x680 [qla2xxx] (unreliable) [c000200e4e81fc90] c012fbb8 process_one_work+0x298/0x5a0 [c000200e4e81fd20] c012ff58 worker_thread+0x98/0x630 [c000200e4e81fdc0] c0138ae8 kthread+0x1a8/0x1b0 [c000200e4e81fe30] c000b528 ret_from_kernel_thread+0x5c/0xb4 91:mon> e cpu 0x91: Vector: 300 (Data Access) at [c000200e4e81f8e0] pc: c0081162ed58: qlt_free_session_done+0x200/0x680 [qla2xxx] lr: c0081162eca8: qlt_free_session_done+0x150/0x680 [qla2xxx] sp: c000200e4e81fb60 msr: 9280b033 dar: 20 dsisr: 4000 current = 0xc000200e4e7b0e00 paca= 0xcfae3b00 softe: 0irq_happened: 0x01 pid = 1119, comm = kworker/145:1 Linux version 4.15.0-041500rc9-generic (kernel@tangerine) (gcc version 7.2.0 (Ubuntu 7.2.0-6ubuntu1)) #201801212130 SMP Mon Jan 22 03:36:42 UTC 2018 91:mon> r R00 = c0081162eca8 R16 = R01 = c000200e4e81fb60 R17 = R02 = c0081166ad60 R18 = R03 = 0001 R19 = R04 = c000200e44f8c7f8 R20 = c000200e618e7d80 R05 = f087 R21 = R06 = c0081165e6c8 R22 = 0001 R07 = c0081164adb0 R23 = c000200e44f99d24 R08 = R24 = 0402 R09 = R25 = R10 = R26 = c00fe1270c20 R11 = c0081163e170 R27 = c000200e44f99000 R12 = c0cfccf0 R28 = c0081164adb0 R13 = cfae3b00 R29 = c00fe1270c00 R14 = c0138948 R30 = c000200e44f8c7f8 R15 = c000200e4f019440 R31 = c00fe1270cc0 pc = c0081162ed58 qlt_free_session_done+0x200/0x680 [qla2xxx] cfar= c0081162ed1c qlt_free_session_done+0x1c4/0x680 [qla2xxx] lr = c0081162eca8 qlt_free_session_done+0x150/0x680 [qla2xxx] msr = 9280b033 cr = 28002284 ctr = c0cfccf0 xer = trap = 300 dar = 0020 dsisr = 4000 91:mon> The crash location seems close to this one fixed about two weeks ago: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux- next.git/commit/drivers/scsi/qla2xxx/qla_os.c?h=next-20180212&id=2ce87cc5b269510de9ca1185ca8a6e10ec78c069 scsi: qla2xxx: Fix memory corruption during hba reset test This patch fixes memory corrpution while performing HBA Reset test. Following stack trace is seen: [ 466.397219] BUG: unable to handle kernel NULL pointer dereference at 0020 [ 466.433669] IP: [] qlt_free_session_done+0x260/0x5f0 [qla2xxx] [ 466.467731] PGD 0 [ 466.476718] Oops: [#1] SMP - Luciano built and provided the patch with new Qlogic change on Friday last week. root@bos1u1p1:~/chavez# ls linux-image* linux-image-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb linux-image-extra-4.15.0-041500rc9-generic_4.15.0-041500rc9.201801212130_ppc64el.deb - I configured and ran same test over weekend and test ran good. KVM host did not crash in qlt_free_session_done call like before. - So the patch fixed the problem. Hi Canonical, Please review and consider this a request to pull in commit 2ce87cc5b269510de9ca1185ca8a6e10ec78c069 please. Thanks! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/1750441/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp