Hello Sven, Thank you. I did enable numaMemorInterleave but the issues stays.
In the meantime I switched to version 5.0.0-2 just to see if it’s version dependent – it’s not. All gpfs filesystems are unmounted when this happens. At shutdown I often need to do a hard reset to force a reboot – o.k., I never waited more than 5 minutes once I saw a hang, maybe it would recover after some more time. ‘rmmod mmfs26’ doesn’t hang all the times, maybe at every other shutdown or mmstartup/mmshutdown cycle. While rmmod hangs the system seems slow, command like ‘ps -efH’ or ‘history’ take a long time and some mm commands just block, a few times the system gets completely inaccessible. I’ll reinstall the systems and move back to 4.2.3-8 and see if this is a stable configuration to start from an to rule out any hardware/BIOS issues. I append output from numactl -H below. Cheers, Heiner Test with 5.0.0-2 [root@xbl-ces-2 ~]# numactl -H available: 2 nodes (0-1) node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 node 0 size: 130942 MB node 0 free: 60295 MB node 1 cpus: 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 node 1 size: 131072 MB node 1 free: 60042 MB node distances: node 0 1 0: 10 21 1: 21 10 [root@xbl-ces-2 ~]# mmdiag --config | grep numaM ! numaMemoryInterleave yes # cat /proc/cmdline BOOT_IMAGE=/vmlinuz-3.10.0-693.17.1.el7.x86_64 root=/dev/mapper/vg_root-lv_root ro crashkernel=auto rd.lvm.lv=vg_root/lv_root console=tty0 console=ttyS0,115200 nosmap Example output of ps -efH during mmshutdown when rmmod did hang (last line) This is with 5.0.0-2. As I see all gpfs processe already terminated, just root 1 0 0 14:30 ? 00:00:10 /usr/lib/systemd/systemd --switched-root --system --deserialize 21 root 1035 1 0 14:30 ? 00:00:02 /usr/lib/systemd/systemd-journald root 1055 1 0 14:30 ? 00:00:00 /usr/sbin/lvmetad -f root 1072 1 0 14:30 ? 00:00:11 /usr/lib/systemd/systemd-udevd root 1478 1 0 14:31 ? 00:00:00 /usr/sbin/sssd -i -f root 1484 1478 0 14:31 ? 00:00:00 /usr/libexec/sssd/sssd_be --domain D.PSI.CH --uid 0 --gid 0 --debug-to-files root 1486 1478 0 14:31 ? 00:00:00 /usr/libexec/sssd/sssd_nss --uid 0 --gid 0 --debug-to-files root 1487 1478 0 14:31 ? 00:00:00 /usr/libexec/sssd/sssd_pam --uid 0 --gid 0 --debug-to-files root 1479 1 0 14:31 ? 00:00:00 /usr/sbin/rasdaemon -f -r root 1482 1 0 14:31 ? 00:00:04 /usr/sbin/irqbalance --foreground dbus 1483 1 0 14:31 ? 00:00:00 /bin/dbus-daemon --system --address=systemd: --nofork --nopidfile --systemd-activation root 1496 1 0 14:31 ? 00:00:00 /usr/sbin/smartd -n -q never root 1498 1 0 14:31 ? 00:00:00 /usr/sbin/gssproxy -D nscd 1507 1 0 14:31 ? 00:00:01 /usr/sbin/nscd nrpe 1526 1 0 14:31 ? 00:00:00 /usr/sbin/nrpe -c /etc/nagios/nrpe.cfg -d root 1531 1 0 14:31 ? 00:00:00 /usr/lib/systemd/systemd-logind root 1533 1 0 14:31 ? 00:00:00 /usr/sbin/rpc.gssd root 1803 1 0 14:31 ttyS0 00:00:00 /sbin/agetty --keep-baud 115200 38400 9600 ttyS0 vt220 root 1804 1 0 14:31 tty1 00:00:00 /sbin/agetty --noclear tty1 linux root 2405 1 0 14:32 ? 00:00:00 /sbin/dhclient -q -cf /etc/dhcp/dhclient-ib0.conf -lf /var/lib/dhclient/dhclient--ib0.l root 2461 1 0 14:32 ? 00:00:00 /usr/sbin/sshd -D root 11561 2461 0 14:35 ? 00:00:00 sshd: root@pts/0 root 11565 11561 0 14:35 pts/0 00:00:00 -bash root 16024 11565 0 14:50 pts/0 00:00:05 ps -efH root 11609 2461 0 14:35 ? 00:00:00 sshd: root@pts/1 root 11644 11609 0 14:35 pts/1 00:00:00 -bash root 2718 1 0 14:32 ? 00:00:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/mmccrmonitor 15 0 no root 2758 1 0 14:32 ? 00:00:00 /usr/libexec/postfix/master -w postfix 2785 2758 0 14:32 ? 00:00:00 pickup -l -t unix -u postfix 2786 2758 0 14:32 ? 00:00:00 qmgr -l -t unix -u root 3174 1 0 14:32 ? 00:00:00 /usr/sbin/crond -n ntp 3179 1 0 14:32 ? 00:00:00 /usr/sbin/ntpd -u ntp:ntp -g root 3915 1 3 14:32 ? 00:00:33 python /usr/lpp/mmfs/bin/mmsysmon.py root 13618 1 0 14:36 ? 00:00:00 /usr/lpp/mmfs/bin/mmsdrserv 1191 10 10 /var/adm/ras/mmsdrserv.log 8192 yes no root 15936 1 0 14:49 pts/1 00:00:00 /usr/lpp/mmfs/bin/mmksh /usr/lpp/mmfs/bin/runmmfs root 15992 15936 0 14:49 pts/1 00:00:00 /sbin/rmmod mmfs26 -- Paul Scherrer Institut Science IT Heiner Billich WHGA 106 CH 5232 Villigen PSI 056 310 36 02 https://www.psi.ch From: <[email protected]> on behalf of Sven Oehme <[email protected]> Reply-To: gpfsug main discussion list <[email protected]> Date: Wednesday 11 July 2018 at 15:47 To: gpfsug main discussion list <[email protected]> Subject: Re: [gpfsug-discuss] /sbin/rmmod mmfs26 hangs on mmshutdown Hi, what does numactl -H report ? also check if this is set to yes : root@fab3a:~# mmlsconfig numaMemoryInterleave numaMemoryInterleave yes Sven On Wed, Jul 11, 2018 at 6:40 AM Billich Heinrich Rainer (PSI) <[email protected]<mailto:[email protected]>> wrote: Hello, I have two nodes which hang on ‘mmshutdown’, in detail the command ‘/sbin/rmmod mmfs26’ hangs. I get kernel messages which I append below. I wonder if this looks familiar to somebody? Is it a known bug? I can avoid the issue if I reduce pagepool from 128G to 64G. Running ‘systemctl stop gpfs’ shows the same issue. It forcefully terminates after a while, but ‘rmmod’ stays stuck. Two functions cxiReleaseAndForgetPages and put_page seem to be involved, the first part of gpfs, the second a kernel call. The servers have 256G memory and 72 (virtual) cores each. I run 5.0.1-1 on RHEL7.4 with kernel 3.10.0-693.17.1.el7.x86_64. I can try to switch back to 5.0.0 Thank you & kind regards, Heiner Jul 11 14:12:04 node-1.x.y mmremote[1641]: Unloading module mmfs26 Jul 11 14:12:04 node-1.x.y mmsysmon[2440]: [E] Event raised: The Spectrum Scale service process not running on this node. Normal operation cannot be done Jul 11 14:12:04 node-1.x.y mmsysmon[2440]: [I] Event raised: The Spectrum Scale service process is running Jul 11 14:12:04 node-1.x.y mmsysmon[2440]: [E] Event raised: The node is not able to form a quorum with the other available nodes. Jul 11 14:12:38 node-1.x.y sshd[2826]: Connection closed by xxx port 52814 [preauth] Jul 11 14:12:41 node-1.x.y kernel: NMI watchdog: BUG: soft lockup - CPU#28 stuck for 23s! [rmmod:2695] Jul 11 14:12:41 node-1.x.y kernel: Modules linked in: mmfs26(OE-) mmfslinux(OE) tracedev(OE) tcp_diag inet_diag rdma_ucm(OE) ib_ucm(OE) rdma_cm(OE) iw_cm(OE) ib_ipoib(OE) ib_cm(OE) ib_uverbs(OE) ib_umad(OE) mlx5_fpga_tools(OE) mlx5_ib(OE) mlx5_core(OE) mlxfw(OE) mlx4_en(OE) mlx4_ib(OE) ib_core(OE) vfat fat ext4 sb_edac edac_core intel_powerclamp coretemp intel_rapl iosf_mbi mbcache jbd2 kvm irqbypass crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd iTCO_wdt iTCO_vendor_support ipmi_ssif pcc_cpufreq hpilo ipmi_si sg hpwdt pcspkr i2c_i801 lpc_ich ipmi_devintf wmi ioatdma shpchp ipmi_msghandler acpi_power_meter binfmt_misc nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs libcrc32c sd_mod crc_t10dif crct10dif_generic mgag200 i2c_algo_bit drm_kms_helper syscopyarea sysfillrect Jul 11 14:12:41 node-1.x.y kernel: sysimgblt fb_sys_fops ttm ixgbe mlx4_core(OE) crct10dif_pclmul mdio mlx_compat(OE) crct10dif_common drm ptp crc32c_intel devlink hpsa pps_core i2c_core scsi_transport_sas dca dm_mirror dm_region_hash dm_log dm_mod [last unloaded: tracedev] Jul 11 14:12:41 node-1.x.y kernel: CPU: 28 PID: 2695 Comm: rmmod Tainted: G W OEL ------------ 3.10.0-693.17.1.el7.x86_64 #1 Jul 11 14:12:41 node-1.x.y kernel: Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 01/22/2018 Jul 11 14:12:41 node-1.x.y kernel: task: ffff8808c4814f10 ti: ffff881619778000 task.ti: ffff881619778000 Jul 11 14:12:41 node-1.x.y kernel: RIP: 0010:[<ffffffff816a2970>] [<ffffffff816a2970>] put_compound_page+0xc3/0x174 Jul 11 14:12:41 node-1.x.y kernel: RSP: 0018:ffff88161977bd50 EFLAGS: 00000246 Jul 11 14:12:41 node-1.x.y kernel: RAX: 0000000000000283 RBX: 00000000fae3d201 RCX: 0000000000000284 Jul 11 14:12:41 node-1.x.y kernel: RDX: 0000000000000283 RSI: 0000000000000246 RDI: ffffea003d478000 Jul 11 14:12:41 node-1.x.y kernel: RBP: ffff88161977bd68 R08: ffff881ffae3d1e0 R09: 0000000180800059 Jul 11 14:12:41 node-1.x.y kernel: R10: 00000000fae3d201 R11: ffffea007feb8f40 R12: 00000000fae3d201 Jul 11 14:12:41 node-1.x.y kernel: R13: ffff88161977bd40 R14: 0000000000000000 R15: ffff88161977bd40 Jul 11 14:12:41 node-1.x.y kernel: FS: 00007f81a1db0740(0000) GS:ffff883ffee80000(0000) knlGS:0000000000000000 Jul 11 14:12:41 node-1.x.y kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 11 14:12:41 node-1.x.y kernel: CR2: 00007fa96e38f980 CR3: 0000000c36b2c000 CR4: 00000000001607e0 Jul 11 14:12:41 node-1.x.y kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Jul 11 14:12:41 node-1.x.y kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Jul 11 14:12:41 node-1.x.y kernel: Call Trace: Jul 11 14:12:41 node-1.x.y kernel: [<ffffffff81192275>] put_page+0x45/0x50 Jul 11 14:12:41 node-1.x.y kernel: [<ffffffffc08e3562>] cxiReleaseAndForgetPages+0xb2/0x1c0 [mmfslinux] Jul 11 14:12:41 node-1.x.y kernel: [<ffffffffc08e3ae5>] cxiDeallocPageList+0x45/0x110 [mmfslinux] Jul 11 14:12:41 node-1.x.y kernel: [<ffffffff811e0b02>] ? kmem_cache_free+0x1e2/0x200 Jul 11 14:12:41 node-1.x.y kernel: [<ffffffffc08e3cda>] cxiFreeSharedMemory+0x12a/0x130 [mmfslinux] Jul 11 14:12:41 node-1.x.y kernel: [<ffffffffc0c70c12>] kxFreeAllSharedMemory+0xe2/0x160 [mmfs26] Jul 11 14:12:41 node-1.x.y kernel: [<ffffffffc0c5bd15>] mmfs+0xc85/0xca0 [mmfs26] Jul 11 14:12:41 node-1.x.y kernel: [<ffffffffc08c8f16>] gpfs_clean+0x26/0x30 [mmfslinux] Jul 11 14:12:41 node-1.x.y kernel: [<ffffffffc0da5565>] cleanup_module+0x25/0x30 [mmfs26] Jul 11 14:12:41 node-1.x.y kernel: [<ffffffff8110044b>] SyS_delete_module+0x19b/0x300 Jul 11 14:12:41 node-1.x.y kernel: [<ffffffff816b89fd>] system_call_fastpath+0x16/0x1b Jul 11 14:12:41 node-1.x.y kernel: Code: d1 00 00 00 4c 89 e7 e8 3a ff ff ff e9 c4 00 00 00 4c 39 e3 74 c1 41 8b 54 24 1c 85 d2 74 b8 8d 4a 01 89 d0 f0 41 0f b1 4c 24 1c <39> c2 74 04 89 c2 eb e8 e8 f3 f0 ae ff 49 89 c5 f0 41 0f ba 2c Jul 11 14:13:23 node-1.x.y systemd[1]: gpfs.service stopping timed out. Terminating. Jul 11 14:13:27 node-1.x.y kernel: NMI watchdog: BUG: soft lockup - CPU#28 stuck for 21s! [rmmod:2695] Jul 11 14:13:27 node-1.x.y kernel: Modules linked in: mmfs26(OE-) mmfslinux(OE) tracedev(OE) tcp_diag inet_diag rdma_ucm(OE) ib_ucm(OE) rdma_cm(OE) iw_cm(OE) ib_ipoib(OE) ib_cm(OE) ib_uverbs(OE) ib_umad(OE) mlx5_fpga_tools(OE) mlx5_ib(OE) mlx5_core(OE) mlxfw(OE) mlx4_en(OE) mlx4_ib(OE) ib_core(OE) vfat fat ext4 sb_edac edac_core intel_powerclamp coretemp intel_rapl iosf_mbi mbcache jbd2 kvm irqbypass crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd iTCO_wdt iTCO_vendor_support ipmi_ssif pcc_cpufreq hpilo ipmi_si sg hpwdt pcspkr i2c_i801 lpc_ich ipmi_devintf wmi ioatdma shpchp ipmi_msghandler Jul 11 14:13:27 node-1.x.y kernel: INFO: rcu_sched detected stalls on CPUs/tasks: Jul 11 14:13:27 node-1.x.y kernel: { Jul 11 14:13:27 node-1.x.y kernel: 28 Jul 11 14:13:27 node-1.x.y kernel: } Jul 11 14:13:27 node-1.x.y kernel: (detected by 17, t=60002 jiffies, g=267734, c=267733, q=36089) Jul 11 14:13:27 node-1.x.y kernel: Task dump for CPU 28: Jul 11 14:13:27 node-1.x.y kernel: rmmod R Jul 11 14:13:27 node-1.x.y kernel: running task Jul 11 14:13:27 node-1.x.y kernel: 0 2695 2642 0x00000008 Jul 11 14:13:27 node-1.x.y kernel: Call Trace: Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffff811dea1c>] ? __free_slab+0xdc/0x200 Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffff816a28ad>] ? __put_compound_page+0x22/0x22 Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffff81192275>] ? put_page+0x45/0x50 Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc08e3562>] ? cxiReleaseAndForgetPages+0xb2/0x1c0 [mmfslinux] Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc08e3ae5>] ? cxiDeallocPageList+0x45/0x110 [mmfslinux] Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc08e3cda>] ? cxiFreeSharedMemory+0x12a/0x130 [mmfslinux] Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc0c70c12>] ? kxFreeAllSharedMemory+0xe2/0x160 [mmfs26] Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc0c5bd15>] ? mmfs+0xc85/0xca0 [mmfs26] Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc08c8f16>] ? gpfs_clean+0x26/0x30 [mmfslinux] Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc0da5565>] ? cleanup_module+0x25/0x30 [mmfs26] Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffff8110044b>] ? SyS_delete_module+0x19b/0x300 Jul 11 14:13:27 node-1.x.y kernel: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffff816b89fd>] ? system_call_fastpath+0x16/0x1b Jul 11 14:13:27 node-1.x.y kernel: acpi_power_meter Jul 11 14:13:27 node-1.x.y kernel: binfmt_misc nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables xfs libcrc32c sd_mod crc_t10dif crct10dif_generic mgag200 i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm ixgbe mlx4_core(OE) crct10dif_pclmul mdio mlx_compat(OE) crct10dif_common drm ptp crc32c_intel devlink hpsa pps_core i2c_core scsi_transport_sas dca dm_mirror dm_region_hash dm_log dm_mod [last unloaded: tracedev] Jul 11 14:13:27 node-1.x.y kernel: CPU: 28 PID: 2695 Comm: rmmod Tainted: G W OEL ------------ 3.10.0-693.17.1.el7.x86_64 #1 Jul 11 14:13:27 node-1.x.y kernel: Hardware name: HP ProLiant DL380 Gen9/ProLiant DL380 Gen9, BIOS P89 01/22/2018 Jul 11 14:13:27 node-1.x.y kernel: task: ffff8808c4814f10 ti: ffff881619778000 task.ti: ffff881619778000 Jul 11 14:13:27 node-1.x.y kernel: RIP: 0010:[<ffffffff816a28ad>] [<ffffffff816a28ad>] __put_compound_page+0x22/0x22 Jul 11 14:13:27 node-1.x.y kernel: RSP: 0018:ffff88161977bd70 EFLAGS: 00000282 Jul 11 14:13:27 node-1.x.y kernel: RAX: 002fffff00008010 RBX: 0000000000000135 RCX: 00000000000001c1 Jul 11 14:13:27 node-1.x.y kernel: RDX: ffff8814adbbf000 RSI: 0000000000000246 RDI: ffffea00650e7040 Jul 11 14:13:27 node-1.x.y kernel: RBP: ffff88161977bd78 R08: ffff881ffae3df60 R09: 0000000180800052 Jul 11 14:13:27 node-1.x.y kernel: R10: 00000000fae3db01 R11: ffffea007feb8f40 R12: ffff881ffae3df60 Jul 11 14:13:27 node-1.x.y kernel: R13: 0000000180800052 R14: 00000000fae3db01 R15: ffffea007feb8f40 Jul 11 14:13:27 node-1.x.y kernel: FS: 00007f81a1db0740(0000) GS:ffff883ffee80000(0000) knlGS:0000000000000000 Jul 11 14:13:27 node-1.x.y kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jul 11 14:13:27 node-1.x.y kernel: CR2: 00007fa96e38f980 CR3: 0000000c36b2c000 CR4: 00000000001607e0 Jul 11 14:13:27 node-1.x.y kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Jul 11 14:13:27 node-1.x.y kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Jul 11 14:13:27 node-1.x.y kernel: Call Trace: Jul 11 14:13:27 node-1.x.y kernel: [<ffffffff81192275>] ? put_page+0x45/0x50 Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc08e3562>] cxiReleaseAndForgetPages+0xb2/0x1c0 [mmfslinux] Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc08e3ae5>] cxiDeallocPageList+0x45/0x110 [mmfslinux] Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc08e3cda>] cxiFreeSharedMemory+0x12a/0x130 [mmfslinux] Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc0c70c12>] kxFreeAllSharedMemory+0xe2/0x160 [mmfs26] Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc0c5bd15>] mmfs+0xc85/0xca0 [mmfs26] Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc08c8f16>] gpfs_clean+0x26/0x30 [mmfslinux] Jul 11 14:13:27 node-1.x.y kernel: [<ffffffffc0da5565>] cleanup_module+0x25/0x30 [mmfs26] Jul 11 14:13:27 node-1.x.y kernel: [<ffffffff8110044b>] SyS_delete_module+0x19b/0x300 Jul 11 14:13:27 node-1.x.y kernel: [<ffffffff816b89fd>] system_call_fastpath+0x16/0x1b Jul 11 14:13:27 node-1.x.y kernel: Code: c0 0f 95 c0 0f b6 c0 5d c3 0f 1f 44 00 00 55 48 89 e5 53 48 8b 07 48 89 fb a8 20 74 05 e8 0c f8 ae ff 48 89 df ff 53 60 5b 5d c3 <0f> 1f 44 00 00 55 48 89 e5 41 55 41 54 53 48 8b 07 48 89 fb f6 -- Paul Scherrer Institut Science IT Heiner Billich WHGA 106 CH 5232 Villigen PSI 056 310 36 02 https://www.psi.ch _______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org<http://spectrumscale.org> http://gpfsug.org/mailman/listinfo/gpfsug-discuss
_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at spectrumscale.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss
