> From: [email protected] [mailto:discuss- > [email protected]] On Behalf Of Tom Metro > > Richard Pieri wrote: > > Edward Ned Harvey (blu) wrote: > >> An active system will notice mysqld died, recognize that it's not > >> supposed to do that right now, and restart it. > > > > Which is a stupid way to run in production. There's a reason why the > > daemon died. That reason needs to be identified so that corrective steps > > can be taken. Blind restarts can obfuscate this information, can cause > > damage to data, and can exacerbate existing damage. > > Not to say your points are invalid, but Netflix would disagree with you. > They created a testing tool that intentionally kills random services on > their production systems just to test that automated recovery works > correctly.
I would rather receive notification that a production service was *restarted* rather than *is down* Richard wants to say that's stupid. I not only disagree, I think Richard's position is insulting and ignorantly one-sided. _______________________________________________ Discuss mailing list [email protected] http://lists.blu.org/mailman/listinfo/discuss
