Hi, This is a settings, we had the exact same issue in the past and when we change the following parameters it has never killed the CES nodes anymore.
maxFilesToCache=1000000 maxStatCache=100000 Thanks Christian On Wed, 6 Sept 2023 at 20:59, Christoph Martin <[email protected]> wrote: > Hi all, > > on a three node GPFS cluster with CES enabled and AFM-DR mirroring to a > second cluster we see frequent OOM killer events due to a constantly > growing mmfsd. > The machines have 256G memory. The pagepool is configured to 16G. > The GPFS version is 5.1.6-1. > After a restart mmfsd rapidly grows to about 100G usage and grows over > some days up to 250G virtual and 220G physical memory usage. > OOMkiller tries kill process like pmcollector or others and sometime > kills mmfsd. > > Does anybody see a similar behavior? > Any guess what could help with this problem? > > Regards > Christoph Martin > > -- > Christoph Martin > Zentrum für Datenverarbeitung (ZDV) > Leiter Unix & Cloud > > Johannes Gutenberg-Universität Mainz > Anselm Franz von Bentzel-Weg 12, 55128 Mainz > Tel: +49 6131 39 26337 > [email protected] > www.zdv.uni-mainz.de > > _______________________________________________ > gpfsug-discuss mailing list > gpfsug-discuss at gpfsug.org > http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org > -- Med Vänliga Hälsningar Christian Petersson E-Post: [email protected] Mobil: 070-3251577
_______________________________________________ gpfsug-discuss mailing list gpfsug-discuss at gpfsug.org http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org
