The problem I was refering to:

With the new filesystem we just created I am getting the following  
problem,

clients loose connection to the MGS and the MGS says it evicted  
them,  machines are on the same network and there is no errors on the  
interfaces.  The MGS  says:

Lustre: MGS: haven't heard from client e8eb1779-5cea-9cc7- 
b5ae-4c5ccf54f5ca (at [EMAIL PROTECTED]) in 240 seconds. I think it's  
dead, and I am evicting it.
LustreError: 9103:0:(mgs_handler.c:538:mgs_handle()) lustre_mgs:  
operation 400 on unconnected MGS
LustreError: 9103:0:(ldlm_lib.c:1536:target_send_reply_msg()) @@@  
processing error (-107)  [EMAIL PROTECTED] x24929/t0 o400-><?>@<?>: 
0/0 lens 128/0 e 0 to 0 dl 1218142953 ref 1 fl Interpret:/0/0 rc -107/0


The "operation 400 on unconnected MGS"  is the only new message I am  
not familiar with.  Once the client losses connection with the MGS I  
will see the OST's start booting the client also.


Servers are 1.6.5.1  clients are patch-less 1.6.4.1  on RHEL4.

Any insight would be great.

Brock Palen
www.umich.edu/~brockp
Center for Advanced Computing
[EMAIL PROTECTED]
(734)936-1985



_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to