Hi,

I have another lustre error I'm trying to debug. Again the system is a
2.6.15 kernel with Lustre 1.4.6.

When I run tar -cvf /local/foo.tar /lustre/user/job0815/ it works for
a while and then I get:

OST Errors:
--------------------

[3458604.789746] LustreError: 4193:0:(filter_io.c:373:filter_preprw_read()) io 
error -5
[3458606.258634] LustreError: 4213:0:(filter_io.c:373:filter_preprw_read()) io 
error -5
[3458606.267275] LustreError: 4213:0:(filter_io.c:373:filter_preprw_read()) 
previously skipped 2 similar messages
[3458609.444835] LustreError: 4193:0:(filter_io.c:373:filter_preprw_read()) io 
error -5
[3458609.453534] LustreError: 4193:0:(filter_io.c:373:filter_preprw_read()) 
previously skipped 5 similar messages
[3458616.005135] LustreError: 4189:0:(filter_io.c:373:filter_preprw_read()) io 
error -5
[3458616.013779] LustreError: 4189:0:(filter_io.c:373:filter_preprw_read()) 
previously skipped 7 similar messages

Client Syslog:
-----------------------------

Nov 28 17:28:53 ni-01-01 kernel: [533212.314689] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x462629/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:54 ni-01-01 kernel: [533213.109922] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x463031/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:54 ni-01-01 kernel: [533213.176032] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x463059/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:55 ni-01-01 kernel: [533213.795154] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x463220/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:55 ni-01-01 kernel: [533214.209407] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x463335/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:55 ni-01-01 kernel: [533214.465867] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x463449/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:55 ni-01-01 kernel: [533214.633638] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x463503/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:56 ni-01-01 kernel: [533214.978952] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x463679/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:58 ni-01-01 kernel: [533216.982179] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x464427/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:58 ni-01-01 kernel: [533217.005021] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) previously skipped 1 similar 
messages
Nov 28 17:28:59 ni-01-01 kernel: [533218.202467] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x465117/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:28:59 ni-01-01 kernel: [533218.225331] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) previously skipped 1 similar 
messages
Nov 28 17:29:02 ni-01-01 kernel: [533221.130829] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) @@@ type == PTL_RPC_MSG_ERR, err 
== -5 [EMAIL PROTECTED] x466182/t0 o3->[EMAIL PROTECTED]:28 lens 328/280 ref 2 
fl Rpc:R/0/0 rc 0/-5
Nov 28 17:29:02 ni-01-01 kernel: [533221.153689] LustreError: 
10462:0:(client.c:577:ptlrpc_check_status()) previously skipped 2 similar 
messages


Stopping tar and running "cat file_with_error >/dev/null" on the same
host works fine after the error. So I assume it is not a physical
drive giving an error or filesystem error on the medium or something.

Any tips?

MfG
        Goswin

_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Reply via email to