Looks like that didn't fix it. One of the login nodes repeated the behavior. The strange thing is that the MDS does not show anything about the NID of the client. The client just says it lost connection with it, but the MDS never says it has not heard from the client and is kicking it out.
Very strange. Brock Palen www.umich.edu/~brockp Center for Advanced Computing [EMAIL PROTECTED] (734)936-1985 On Sep 4, 2008, at 11:34 PM, Brock Palen wrote: > >>> >>> Is this enough information? >> >> Probably. If you are running 1.6.5, try disabling statahead on >> all of >> your clients... >> >> # echo 0 > /proc/fs/lustre/.../statahead_max > > I thought statahead was fixed in 1.6.5 ? Main reason we upgraded. > Login nodes already are showing that behavior again. > I will try it out > >> >> Of course, this setting goes back to it's default of 32 on a reboot. >> >> b. >> >> _______________________________________________ >> Lustre-discuss mailing list >> Lustre-discuss@lists.lustre.org >> http://lists.lustre.org/mailman/listinfo/lustre-discuss > > _______________________________________________ > Lustre-discuss mailing list > Lustre-discuss@lists.lustre.org > http://lists.lustre.org/mailman/listinfo/lustre-discuss > > _______________________________________________ Lustre-discuss mailing list Lustre-discuss@lists.lustre.org http://lists.lustre.org/mailman/listinfo/lustre-discuss