[gpfsug-discuss] How to properly debug CES / Ganesha?

Leonardo Sala Fri, 25 Aug 2023 07:48:24 -0700

Hallo,

since some time we do have seemingly random issues with a particularcustomer accessing data over Ganesha / CES (5.1.8). What happens is thatthe CES server owning their IP gets a very high cpu load, and everyoperation on the NFS clients become sluggish. It does seem not relatedto throughput, and looking at the metrics [*] I do not see a correlationwith e.g. increased NFS ops. I see no events in GPFS, and nothingsuspicious in the ganesha and gpfs log files.

What would be a good procedure to identify the misbehaving client (Isuspect NFS, as it seems there is only 1 idle SMB client)? I have putnow LOGLEVEL=INFO in ganesha to see if I catch anything interesting, butI would be curious on how this kind of apparently random issues could bebetter debugged and restricted to a client


Thanks a lot!

regards

leo

[*]

for i in read write; do for j in ops queue lat req err; do mmperfmonquery "ces-server|NFSIO|/export/path|NFSv41|nfs_${i}_$j"2023-08-25-14:40:00 2023-08-25-15:05:00 -b60; done; done



--
Paul Scherrer Institut
Dr. Leonardo Sala
Group Leader Data Analysis and Research Infrastructure
Group Leader Data Curation a.i.
Deputy Department Head Science IT Infrastructure and Services department
Science IT Infrastructure and Services department (AWI)
WHGA/036
Forschungstrasse 111
5232 Villigen PSI
Switzerland

Phone: +41 56 310 3369
[email protected]
www.psi.ch

_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at gpfsug.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss_gpfsug.org

[gpfsug-discuss] How to properly debug CES / Ganesha?

Reply via email to