I'm running lustre 1.6.4.1 with patches 14006 14007 and 14008 applied.  
They all relate to nfs. On the system serving the nfs mounts I  
frequently see this --

LustreError: 11-0: an error occurred while communicating with  
[EMAIL PROTECTED] The mds_getattr_lock operation failed with -13
LustreError: 28508:0:(llite_nfs.c:243:ll_get_parent()) failure -13  
inode 22878046 get parent

And periodically (every 8 hours or so) the server crashes under load.  
The following error is found on the MDS and OSSs. --

LustreError: 138-a: data-MDT0000: A client on nid [EMAIL PROTECTED]  
was evicted due to a lock blocking callback to [EMAIL PROTECTED]  
timed out: rc -107
Lustre: MGS: haven't heard from client  
c0197fd1-42b6-5517-49f4-43470769cc6d (at [EMAIL PROTECTED]) in 238  
seconds. I think it's dead, and I am evicting it.


Any ideas?

-Aaron

Aaron Knister
Associate Systems Analyst
Center for Ocean-Land-Atmosphere Studies

(301) 595-7000
[EMAIL PROTECTED]




_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Reply via email to