When the server stops responding, what processes are running and what are they doing at the time?
On Wed, Dec 10, 2008 at 6:16 AM, Eric Chris Garrison <[EMAIL PROTECTED]>wrote: > Hello, > > A couple of months ago, I upgraded our OpenAFS servers to 1.4.7. Three > weeks ago, a problem where the main metadata server (1st of 3) would stop > responding to AFS requests properly and within a couple of hours, all > clients become unable to get files, vos commands stop responding, etc. If > the machine is rebooted, the problem goes away until the next restart. Just > restarting openafs-server does not fix the problem, however. > > Oddly, when I did a manual "bos restart <server> -all" it didn't reproduce > the problem. I was thinking that this meant the problem wasn't the bos > restart at all... but when I changed the day on which the bos restart > happened, the problem changed days with it. > > Sorry for the vagueness, but no one has been online to observe this > starting, we're just doing forensics on the aftermath. > > I'd appreciate any suggestions on why this might be happening and things to > check. > > Thank you, > > Chris > -- > Eric Chris Garrison | Principal Mass Storage Specialist > [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> | Indiana > University - Research Storage <mailto:[EMAIL PROTECTED]> > _______________________________________________ > OpenAFS-info mailing list > [email protected] > https://lists.openafs.org/mailman/listinfo/openafs-info > -- Derrick
