I have a client (one of our login nodes) that was evicted by one of  
the OST's but not both of them.  So some files are accessible others  
are not.  Strange thing is that both the OST's live on the same OSS.

The errors in dmesg are:

LustreError: 11-0: an error occurred while communicating with  
[EMAIL PROTECTED] The obd_ping operation failed with -107
Lustre: nobackup-OST0001-osc-000001007d548400: Connection to service  
nobackup-OST0001 via nid [EMAIL PROTECTED] was lost; in progress  
operations using this service will wait for recovery to complete.
LustreError: 167-0: This client was evicted by nobackup-OST0001; in  
progress operations using this service will fail.
LustreError: 29595:0:(file.c:1052:ll_glimpse_size()) obd_enqueue  
returned rc -5, returning -EIO
LustreError: 29629:0:(file.c:1052:ll_glimpse_size()) obd_enqueue  
returned rc -5, returning -EIO


OST0000 also lives at 141.212.30.181, so its strange that only one  
will kill it off.  Is there a way to ask lustre to restore this?  Up  
till this point, the client would recover quickly, but this time its  
just waiting.

Brock Palen
Center for Advanced Computing
[EMAIL PROTECTED]
(734)936-1985


_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to