Hi, Just quickly looking at the log you've posted, it looks like you're timing out with overloaded network.
-cf On 10/27/2011 10:08 AM, David Noriega wrote: > I get these errors, any ideas? Running Lustre 1.8.4. This client is > also the server where we nfs export the filesystem. > > LustreError: 4994:0:(dir.c:384:ll_readdir_18()) error reading dir > 575283686/935610515 page 0: rc -110 > LustreError: 11-0: an error occurred while communicating with > 192.168.5.104@tcp. The mds_readpage operation failed with -107 > LustreError: 28410:0:(dir.c:384:ll_readdir_18()) error reading dir > 579577179/4015460576 page 0: rc -110 > LustreError: Skipped 12 previous similar messages > Lustre: lustre-MDT0000-mdc-ffff810338e81400: Connection to service > lustre-MDT0000 via nid 192.168.5.104@tcp was lost; in progress > operations using this service will wait for recovery to complete. > LustreError: 167-0: This client was evicted by lustre-MDT0000; in > progress operations using this service will fail. > LustreError: 25118:0:(client.c:858:ptlrpc_import_delay_req()) @@@ > IMP_INVALID req@ffff8101f87d8c00 x1383759180968916/t0 > o35->[email protected]@tcp:23/10 lens 408/1128 e 0 to > 1 dl 0 ref 1 fl Rpc:/0/0 rc 0/0 > LustreError: 25118:0:(file.c:116:ll_close_inode_openhandle()) inode > 17928860 mdc close failed: rc = -108 > LustreError: 25118:0:(mdc_locks.c:646:mdc_enqueue()) ldlm_cli_enqueue: -108 > LustreError: 9199:0:(file.c:116:ll_close_inode_openhandle()) inode > 579577179 mdc close failed: rc = -108 > LustreError: 9199:0:(file.c:116:ll_close_inode_openhandle()) Skipped 1 > previous similar message > Lustre: lustre-MDT0000-mdc-ffff810338e81400: Connection restored to > service lustre-MDT0000 using nid 192.168.5.104@tcp. > nfsd: non-standard errno: -43 > nfsd: non-standard errno: -43 > LustreError: 4994:0:(dir.c:384:ll_readdir_18()) error reading dir > 575283686/935610515 page 0: rc -110 > LustreError: 4994:0:(dir.c:384:ll_readdir_18()) Skipped 29 previous > similar messages > LustreError: 11-0: an error occurred while communicating with > 192.168.5.104@tcp. The mds_readpage operation failed with -107 > Lustre: lustre-MDT0000-mdc-ffff810338e81400: Connection to service > lustre-MDT0000 via nid 192.168.5.104@tcp was lost; in progress > operations using this service will wait for recovery to complete. > LustreError: 167-0: This client was evicted by lustre-MDT0000; in > progress operations using this service will fail. > LustreError: 4994:0:(client.c:858:ptlrpc_import_delay_req()) @@@ > IMP_INVALID req@ffff8102a576c000 x1383759180969003/t0 > o37->[email protected]@tcp:23/10 lens 408/600 e 0 to 1 > dl 0 ref 1 fl Rpc:/0/0 rc 0/0 > LustreError: 4994:0:(client.c:858:ptlrpc_import_delay_req()) Skipped > 34 previous similar messages > nfsd: non-standard errno: -108 > nfsd: non-standard errno: -4 > nfsd: non-standard errno: -4 > nfsd: non-standard errno: -108 > LustreError: 25118:0:(file.c:116:ll_close_inode_openhandle()) inode > 17928860 mdc close failed: rc = -4 > LustreError: 25118:0:(file.c:116:ll_close_inode_openhandle()) Skipped > 1 previous similar message > LustreError: 25118:0:(mdc_locks.c:646:mdc_enqueue()) ldlm_cli_enqueue: -108 > LustreError: 25118:0:(mdc_locks.c:646:mdc_enqueue()) Skipped 4 > previous similar messages > LustreError: 28407:0:(file.c:3280:ll_inode_revalidate_fini()) failure > -108 inode 558497795 > LustreError: 28407:0:(file.c:3280:ll_inode_revalidate_fini()) Skipped > 3 previous similar messages > nfsd: non-standard errno: -108 > Lustre: lustre-MDT0000-mdc-ffff810338e81400: Connection restored to > service lustre-MDT0000 using nid 192.168.5.104@tcp. > LustreError: 11-0: an error occurred while communicating with > 192.168.5.104@tcp. The mds_close operation failed with -116 > LustreError: Skipped 1 previous similar message > LustreError: 28407:0:(file.c:116:ll_close_inode_openhandle()) inode > 558497794 mdc close failed: rc = -116 > LustreError: 28407:0:(file.c:116:ll_close_inode_openhandle()) Skipped > 4 previous similar messages > LustreError: 11-0: an error occurred while communicating with > 192.168.5.104@tcp. The mds_close operation failed with -116 > > _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
