On Thu, 2008-08-07 at 12:06 -0400, Brock Palen wrote: > In doing some testing with our new hardware I did the following: > > I rebooted the active MDS server, it failed over to the second one as > expected. While this was happening a client was reset. > > When the MDS came up on the new server by heartbeat it went into > recovery as expected. The MDS now has been in recovery for 1.5 > hours. I don't think this is normal. > > What would cause this? I know by having a client go down (the reset > above) while the MDS is down but before recovery will cause recovery > to time out but 1.5 hours is unacceptable time to wait for the file > system to come back. > > This is a stock 1.6.5.1 install.
Hrm. Can you provide the syslog from the backup MDS from the time it was mounted until present? b.
signature.asc
Description: This is a digitally signed message part
_______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
