Richard Pieri wrote: > Edward Ned Harvey (blu) wrote: >> An active system will notice mysqld died, recognize that it's not >> supposed to do that right now, and restart it. > > Which is a stupid way to run in production. There's a reason why the > daemon died. That reason needs to be identified so that corrective steps > can be taken. Blind restarts can obfuscate this information, can cause > damage to data, and can exacerbate existing damage.
Not to say your points are invalid, but Netflix would disagree with you. They created a testing tool that intentionally kills random services on their production systems just to test that automated recovery works correctly. -Tom -- Tom Metro The Perl Shop, Newton, MA, USA "Predictable On-demand Perl Consulting." http://www.theperlshop.com/ _______________________________________________ Discuss mailing list Discuss@blu.org http://lists.blu.org/mailman/listinfo/discuss