On Tuesday, August 09, 2005 10:58:22 AM -0500 Rich Sudlow <[EMAIL PROTECTED]> wrote:
We've been having problems with our cell for the last couple years with AFS clients after fileservers are taken out of service. Before that things seemed to work ok when doing fileserver moves and rebuilding. All data was moved off the fileserver but the clients still seem to have some need to talk to it. In the past the AFS admins have left the fileservers up and empty for a number of days to try to resolve this issue - but it doesn't resolve the issue.
That's because there is no "issue" here. What you've just described is the result of the cache manager's normal checkservers loop, in which it pings _every server it has ever had to talk to_ every 5 minutes or so, to see if it is still up (or down, as the case may be). This is also why 'fs checkservers' is reporting the server down -- it reports on every server that client has contacted since startup.
This behavior is normal and is unrelated to the problem you were actually seeing, which was apparently about an unexpectedly missing rep site. The 'fs checkv' that Kim Kimball suggested was presumably effective because your cache manager picked a different site next time around.
I'd get that release problem fixed, and see if that doesn't make most of your troubles go away. Under normal conditions, it should be sufficient to leave an emptied fileserver up for two hours after the last volume is moved off.
-- Jeffrey T. Hutzelman (N3NHS) <[EMAIL PROTECTED]> Sr. Research Systems Programmer School of Computer Science - Research Computing Facility Carnegie Mellon University - Pittsburgh, PA _______________________________________________ OpenAFS-info mailing list [email protected] https://lists.openafs.org/mailman/listinfo/openafs-info
