On Wed, Aug 6, 2008 at 10:17 AM, Brian J. Murrell <[EMAIL PROTECTED]> wrote:
> But this kind of eviction is simply due to clients that are unresponsive
> from the POV of the MDS.  They are neither making filesystem RPC nor are
> they "ping"ing (keepalives) so the MDS assumes they have died and evicts
> them to get back the locks it could be holding and not having that dead
> client holding up other, living clients.
>
> So you need to investigate why the clients are dying or appear to be
> dead (i.e. going silent) to the MDS.

Is there anything in /proc or /sys I can look at to see whatever
"keepalive" parameters are setup?

The systems aren't dying.

I need to know how to least obtrusively force the clients to keep
pinging, or tell the MDS to give them a longer time before timeout.

I don't see why this only effects the RHEL5 clients.  Maybe that's a hint.

Thanks,

Chris
_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to