I'm running lustre 1.6.4.1 with patches 14006 14007 and 14008 applied. They all relate to nfs. On the system serving the nfs mounts I frequently see this --
LustreError: 11-0: an error occurred while communicating with [EMAIL PROTECTED] The mds_getattr_lock operation failed with -13 LustreError: 28508:0:(llite_nfs.c:243:ll_get_parent()) failure -13 inode 22878046 get parent And periodically (every 8 hours or so) the server crashes under load. The following error is found on the MDS and OSSs. -- LustreError: 138-a: data-MDT0000: A client on nid [EMAIL PROTECTED] was evicted due to a lock blocking callback to [EMAIL PROTECTED] timed out: rc -107 Lustre: MGS: haven't heard from client c0197fd1-42b6-5517-49f4-43470769cc6d (at [EMAIL PROTECTED]) in 238 seconds. I think it's dead, and I am evicting it. Any ideas? -Aaron Aaron Knister Associate Systems Analyst Center for Ocean-Land-Atmosphere Studies (301) 595-7000 [EMAIL PROTECTED] _______________________________________________ Lustre-discuss mailing list [email protected] https://mail.clusterfs.com/mailman/listinfo/lustre-discuss
