I have a client (one of our login nodes) that was evicted by one of the OST's but not both of them. So some files are accessible others are not. Strange thing is that both the OST's live on the same OSS.
The errors in dmesg are: LustreError: 11-0: an error occurred while communicating with [EMAIL PROTECTED] The obd_ping operation failed with -107 Lustre: nobackup-OST0001-osc-000001007d548400: Connection to service nobackup-OST0001 via nid [EMAIL PROTECTED] was lost; in progress operations using this service will wait for recovery to complete. LustreError: 167-0: This client was evicted by nobackup-OST0001; in progress operations using this service will fail. LustreError: 29595:0:(file.c:1052:ll_glimpse_size()) obd_enqueue returned rc -5, returning -EIO LustreError: 29629:0:(file.c:1052:ll_glimpse_size()) obd_enqueue returned rc -5, returning -EIO OST0000 also lives at 141.212.30.181, so its strange that only one will kill it off. Is there a way to ask lustre to restore this? Up till this point, the client would recover quickly, but this time its just waiting. Brock Palen Center for Advanced Computing [EMAIL PROTECTED] (734)936-1985 _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
