On 17/02/2026 08:39, Mathias Dietz wrote:
Hi Jonathan,
Thank you for your question regarding the swapped_warn event.
In IBM Spectrum Scale, different categories of health events serve
different purposes:
*State-change events (such as degraded, failed, etc.)* indicate actual
problems or failures. These events typically resolve automatically; only
a small number require manual resolution using the mmhealth event
resolve command.
*TIP events* provide recommendations or highlight minor issues,
best‑practice deviations, or configuration optimizations. These events
do not indicate a failure. If desired, TIP events can be permanently
hidden using mmhealth event hide, and once hidden they will not appear
again.
The *swapped_warn *event falls into the *TIP event category*.
You are correct that a small amount of swap usage is not necessarily
problematic, especially when plenty of RAM is available and the system
is not actively swapping. However, we have seen multiple real-world
cases where even moderate swap usage negatively impacted system
responsiveness and overall Scale performance.
> > Because of this, the TIP is intended to help users who want to extract
maximum performance from their systems. If swap usage is not a concern
in your environment, you can safely hide the TIP and continue operating
normally.
I agree with everything you have said, but the problem is that GPFS is
warning about swap usage even though no swap usage is occurring.
That there is usage of swap space is not, and has not been, for over
quarter of a century now, a valid measure of whether the Linux kernel is
paging memory in/out of disk.
You can determine if the kernel is actually paging memory to and from
disk with the vmstat command. Checking the nodes reporting the
swapped_warn tip shows that they have not paged a single memory page
since the last reboot, despite swap usage.
As mentioned earlier, the Linux kernel will preemptively write memory
pages to swap space as a precaution, so swap usage does not reflect what
the GPFS developers think it does.
I don't want to hide the tip because it would be useful if it actually
did what it said on the tin. The problem is that, at the moment, due to
faulty assumptions, the tip is as useful as a chocolate teapot.
The TL;DR is that GPFS needs to switch to using a valid measure of paging.
JAB.
--
Jonathan A. Buzzard Tel: +44141-5483420
HPC System Administrator, ARCHIE-WeSt.
University of Strathclyde, John Anderson Building, Glasgow. G4 0NG
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at gpfsug.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org