Dear All,
I am seeing following errors on MDS:
Nov 1 12:14:13 mds01 kernel: LustreError: 17076:0:(mds_open.c:
1474:mds_close()) Skipped 139 previous similar messages
Nov 1 12:14:27 mds01 kernel: LustreError: 17088:0:(mds_open.c:
1474:mds_close()) @@@ no handle for file close ino 26997837: cookie
0x47a2a9d95b67cfb6 [EMAIL PROTECTED] x32950/t0 o35->451db1a1-8c58-
[EMAIL PROTECTED]:-1 lens 296/560 ref 0
fl Interpret:/0/0 rc 0/0
Nov 1 12:14:32 mds01 kernel: LustreError: 17089:0:(mds_open.c:
1474:mds_close()) @@@ no handle for file close ino 28697676: cookie
0x47a2a9d964ee6f0e [EMAIL PROTECTED] x28502/t0 o35->e838fcbc-4b8c-
[EMAIL PROTECTED]:-1 lens 296/560 ref 0
fl Interpret:/0/0 rc 0/0
Nov 1 12:14:32 mds01 kernel: LustreError: 17089:0:(mds_open.c:
1474:mds_close()) Skipped 4 previous similar messages
Nov 1 12:14:47 mds01 kernel: LustreError: 17061:0:(mds_open.c:
1474:mds_close()) @@@ no handle for file close ino 28697676: cookie
0x47a2a9d964ee7ade [EMAIL PROTECTED] x113640/t0 o35->d774ea81-
[EMAIL PROTECTED]:-1 lens 296/560
ref 0 fl Interpret:/0/0 rc 0/0
Nov 1 12:14:47 mds01 kernel: LustreError: 17061:0:(mds_open.c:
1474:mds_close()) Skipped 2 previous similar messages
Nov 1 12:15:47 mds01 kernel: Lustre: ddn-home-MDT0000: haven't heard
from client 9e6c2d9a-1649-3c61-0fda-b5052af0e09f (at [EMAIL PROTECTED])
in 227 seconds. I think it's dead, and I am evicting it.
Nov 1 12:15:47 mds01 kernel: Lustre: Skipped 33 previous similar
messagesNov 1 12:15:55 mds01 kernel: LustreError: 17076:0:
(mds_open.c:1474:mds_close()) @@@ no handle for file close ino
25301776: cookie 0x47a2a9d95b4c6238 [EMAIL PROTECTED] x211014/t0
o35->[EMAIL PROTECTED]:-1
lens 296/560 ref 0 fl Interpret:/0/0 rc 0/0
Nov 1 12:15:55 mds01 kernel: LustreError: 17076:0:(mds_open.c:
1474:mds_close()) Skipped 24 previous similar messagesNov 1 12:15:55
mds01 kernel: LustreError: 17076:0:(ldlm_lib.c:
1437:target_send_reply_msg()) @@@ processing error (-116)
[EMAIL PROTECTED] x211014/t0 o35->31eec1e1-1f7d-a43b-
[EMAIL PROTECTED]:-1 lens 296/560 ref 0 fl
Interpret:/0/0 rc -116/0
Nov 1 12:15:55 mds01 kernel: LustreError: 17076:0:(ldlm_lib.c:
1437:target_send_reply_msg()) Skipped 39 previous similar
messagesNov 1 12:17:00 mds01 kernel: LustreError: 16649:0:
(mds_open.c:1474:mds_close()) @@@ no handle for file close ino
28968880: cookie 0x47a2a9d95bcda4b2 [EMAIL PROTECTED] x58156/t0
o35->[EMAIL PROTECTED]:-1
lens 296/560 ref 0 fl Interpret:/0/0 rc 0/0
Nov 1 12:17:00 mds01 kernel: LustreError: 16649:0:(mds_open.c:
1474:mds_close()) Skipped 2 previous similar messages
Nov 1 12:18:00 mds01 kernel: LustreError: 17066:0:(mds_open.c:
1474:mds_close()) @@@ no handle for file close ino 28968880: cookie
0x47a2a9d95bcd8dd6 [EMAIL PROTECTED] x42691/t0 o35->0a5afd52-
[EMAIL PROTECTED]:-1 lens 296/560
ref 0 fl Interpret:/0/0 rc 0/0
Nov 1 12:18:00 mds01 kernel: LustreError: 17066:0:(mds_open.c:
1474:mds_close()) Skipped 1 previous similar message
and OSS are showing following errors:
Nov 1 12:21:35 storage10.beowulf.cluster kernel: LustreError:
23337:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
[EMAIL PROTECTED] x434527/t0 o101->[EMAIL PROTECTED]@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 1 12:21:35 storage10.beowulf.cluster kernel: LustreError:
23337:0:(client.c:519:ptlrpc_import_delay_req()) Skipped 119 previous
similar messages
Nov 1 12:21:59 storage08.beowulf.cluster kernel: LustreError:
22609:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
[EMAIL PROTECTED] x755145/t0 o101->[EMAIL PROTECTED]@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 1 12:21:59 storage08.beowulf.cluster kernel: LustreError:
22609:0:(client.c:519:ptlrpc_import_delay_req()) Skipped 119 previous
similar messages
Nov 1 12:23:31 storage09.beowulf.cluster kernel: LustreError:
23045:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
[EMAIL PROTECTED] x511984/t0 o101->[EMAIL PROTECTED]@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 1 12:23:31 storage09.beowulf.cluster kernel: LustreError:
23045:0:(client.c:519:ptlrpc_import_delay_req()) Skipped 119 previous
similar messages
Nov 1 12:24:32 storage07.beowulf.cluster kernel: LustreError:
22220:0:(client.c:519:ptlrpc_import_delay_req()) @@@ IMP_INVALID
[EMAIL PROTECTED] x1064767/t0 o101->[EMAIL PROTECTED]@tcp_0:26
lens 232/240 ref 1 fl Rpc:/0/0 rc 0/0
Nov 1 12:24:32 storage07.beowulf.cluster kernel: LustreError:
22220:0:(client.c:519:ptlrpc_import_delay_req()) Skipped 119 previous
similar messages
Does anybody has an idea what can be the reason of this errors?
My system consist of 4 OSS, 24 OST, 1 MDS, 585 clients
Lustre version is 1.6.3
Kernel version on the whole cluster is 2.6.9-55.0.9.EL_lustre.1.6.3smp
Thanks for you help!
Mr Wojciech Turek
Assistant System Manager
University of Cambridge
High Performance Computing service
email: [EMAIL PROTECTED]
tel. +441223763517
_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss