Re: [OpenAFS] Server crash

Rob Banz Fri, 07 Dec 2007 11:10:28 -0800



Look at the FileLog and see what failed to attach.

This is one reason I dislike that optimization.

For the most part, its been a win for me. With a decent filesystem onthe back-end, I haven't had volume attachment problems running with afast-restart fileserver. I'd say if I had seen an issue where I didhave a multitude of volumes that needed salvaging, its not too hard toeither write a little script to troll your FileLog and run salvager onthe appropriate volumes -- or stop the fileserver and salvage thewhole partition.

In the environment I was responsible for, the only time I was havingto implement drastic measures (kill -9'ing the fileserver) was in theinstance of those dreaded clogged RX calls due to (usually) connectiontable lockups -- and I never had a problem with using the fast-restartfileserver, and it brought us back into service in a few minutesrather than the hour+ that a salvage would cause. Even in the coupleinstances where we did have storage go offline, at least since we usedZFS, everything would come up fine in the fast-restart environment...I think your success or failure with it is very dependent on thebehavior of your backing filesystem and how it orders transactions...


-rob
_______________________________________________
OpenAFS-info mailing list
[email protected]
https://lists.openafs.org/mailman/listinfo/openafs-info

Re: [OpenAFS] Server crash

Reply via email to