Public bug reported:
Jeff,
We have analyzed this issue and see that there are 3 missing patches
required to fix this problem. Otherwise vport creation >3 (depending on
the customers configuration) will result in a hang for FC or NVMe/FC
adapters.
Please pull these changes into the next hardware release.
These are the missing commits:
959239d [scsi] scsi: core: avoid pre-allocating big SGL for data
5418f2f [scsi] scsi: core: avoid pre-allocating big SGL for protection
information
250f285 [nvme] scsi: lib/sg_pool.c: improve APIs for allocating sg pool
Laurie
---------------------------------------------
CONFIGURATION DETAILS
Host OS with Support Pack - Ubuntu 18.04.3 - HWE ()
Guest/VM OS Details -
System(s) Under Test - IBM x3650 M4
Adapter(s) Under Test - Prism 1-port
IPL Name -
Active Profile ID -
Network Configuration -
SAN Configuration -
OneCapture file attached -
BUG REPRODUCTION DETAILS
Test Case ID or ATID -
Reproducibility of Bug - Always
Last Known Working Build -
Time to Reproduce Bug - 5 mins
Steps To Reproduce Bug -
Create 126 vports on Prism adapter, observe that server goes out of memory and
hung.
root@ubuntu18043:~# free -h
total used free shared buff/cache available
Mem: 15G 1.7G 13G 1.3M 299M 13G
Swap: 2.0G 0B 2.0G
root@ubuntu18043:~# free -h
total used free shared buff/cache available
Mem: 15G 2.1G 12G 1.3M 299M 12G
Swap: 2.0G 0B 2.0G
root@ubuntu18043:~# free -h
total used free shared buff/cache available
Mem: 15G 6.1G 8.7G 1.5M 301M 8.7G
Swap: 2.0G 0B 2.0G
root@ubuntu18043:~# free -h
total used free shared buff/cache available
Mem: 15G 13G 1.4G 1.6M 303M 1.4G
Swap: 2.0G 0B 2.0G
root@ubuntu18043:~# free -h
total used free shared buff/cache available
Mem: 15G 14G 523M 1.7M 303M 528M
Swap: 2.0G 0B 2.0G
root@ubuntu18043:~# free -h
total used free shared buff/cache available
Mem: 15G 14G 147M 96K 48M 5.0M
Swap: 2.0G 38M 2.0G
root@ubuntu18043:~# free -h
total used free shared buff/cache available
Mem: 15G 14G 145M 96K 49M 3.6M
Swap: 2.0G 38M 2.0G
Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.166689] scsi host112: Emulex
LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0
Logical Link Speed: 8000 Mbps PCI resettable
Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.233464] lpfc 0000:11:00.0:
0:(111):1825 Vport Created.
Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.234071] scsi host1: vport-1:0-110
created via shost1 channel 0
Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.698396] scsi host113: Emulex
LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0
Logical Link Speed: 8000 Mbps PCI resettable
Nov 28 05:37:30 ubuntu18043 kernel: [ 1523.862582] lpfc 0000:11:00.0:
0:(112):1825 Vport Created.
Nov 28 05:37:30 ubuntu18043 kernel: [ 1523.863142] scsi host1: vport-1:0-111
created via shost1 channel 0
Nov 28 05:37:31 ubuntu18043 kernel: [ 1525.097335] scsi host114: Emulex
LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0
Logical Link Speed: 8000 Mbps PCI resettable
After creating 111 vports, server ran out of memory.
As server is in hung state, no Crash dump got collected.
** Affects: linux (Ubuntu)
Importance: Undecided
Status: New
--
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1858840
Title:
Vport creation leads to out of memory and server hung on Ubuntu
18.04.3 on Broadcom FC HBAs
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1858840/+subscriptions
--
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs