The problem I was refering to: With the new filesystem we just created I am getting the following problem,
clients loose connection to the MGS and the MGS says it evicted them, machines are on the same network and there is no errors on the interfaces. The MGS says: Lustre: MGS: haven't heard from client e8eb1779-5cea-9cc7- b5ae-4c5ccf54f5ca (at [EMAIL PROTECTED]) in 240 seconds. I think it's dead, and I am evicting it. LustreError: 9103:0:(mgs_handler.c:538:mgs_handle()) lustre_mgs: operation 400 on unconnected MGS LustreError: 9103:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@@ processing error (-107) [EMAIL PROTECTED] x24929/t0 o400-><?>@<?>: 0/0 lens 128/0 e 0 to 0 dl 1218142953 ref 1 fl Interpret:/0/0 rc -107/0 The "operation 400 on unconnected MGS" is the only new message I am not familiar with. Once the client losses connection with the MGS I will see the OST's start booting the client also. Servers are 1.6.5.1 clients are patch-less 1.6.4.1 on RHEL4. Any insight would be great. Brock Palen www.umich.edu/~brockp Center for Advanced Computing [EMAIL PROTECTED] (734)936-1985 _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
