On Mon, 2019-10-07 at 19:22 +0300, Tomer Perry wrote:

[SNIP]

> 
> So, do you experience large number of node expels/crashes etc. that
> might be related to that ( otherwise, it might be some other bug that
> needs to be fixed...). 
> 

Not as far as I can determine. The logs show only 58 expels in the last
six months and around 2/3rds of those where on essentially dormant
nodes that where being used for development work on fixing issues with
the xcat node deployment for the compute nodes (triggering an rinstall
on a node that was up with GPFS mounted but actually doing nothing).

I have done an mmcheckquota which didn't take long to complete and now
I the "in doubt" is a more reasonable sub 10GB. I shall monitor what
happens more closely in future.


JAB.

-- 
Jonathan A. Buzzard                         Tel: +44141-5483420
HPC System Administrator, ARCHIE-WeSt.
University of Strathclyde, John Anderson Building, Glasgow. G4 0NG



_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss

Reply via email to