Hi, yet another question coming up due to our bad luck with our MDS these days: When a restarted MDS goes into recovery, it reports the ETA in /proc/fs/lustre/mds/Name/recovery_status
How is this time calculated? I'm asking because in our recent cases, recovery starts with ETAs of 15000 - 21000 sec. We are using Lustre 1.6.5.1 with adaptive timeouts enabled, and I wonder how the min/max values given there affect the recovery times. On our old test cluster, there are no adaptive timeouts, but the notorious static value of 1000s. Recovery of that MDS took ~3000s the last time. Regards, Thomas
begin:vcard fn:Thomas Roth n:Roth;Thomas org:GSI;IT adr:;;Planckstr.1;Darmstadt;;64291;Germany email;internet:[email protected] tel;work:+49-6159-71 2126 tel;fax:+49-6159-71 2986 tel;cell:+49-176-51376960 x-mozilla-html:TRUE url:http://www.gsi.de/informationen/wti/it/index.html version:2.1 end:vcard
_______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
