[dwm] Re: Crash-only software

Marcin Cieslak Tue, 03 Feb 2009 13:34:01 -0800

markus schnalke wrote:

This is just a thought, because I stumpled upon the concept and think
it's a quite interesting approach.


See: http://en.wikipedia.org/wiki/Crash-only_software

I don't like this approach. I have always preferred software that "failsfast". As soon as something is wrong - just abort with debugginginformation what went wrong.

I see some issues with the approach described in the paper. It assumesthat the state saved is okay - I think that crashes occur _because_internal state is inconsistent or wrong. Sure, you can dump internalstate regularly for recovery - but it's like with backups - you neverknow which one is really clean and okay until you try to restore.

Software bugs will sometimes create incorrect data. This may gounnoticed for some longer time.

I think that authors unnecessarily assume that software components are"black boxes" that need to be kept up at all costs. This is not theright approach for availability I think. Most issues will occur when thecomponent is upgraded and needs to use/migrate old data or sometimes tocooperate with still not upgraded components. If something goes wrong,the rollback becomes the issue also - if I have new, badly-behavingcomponents that dumped its state in a new format, how do I go back?


Sweeping problems under the carpet is not going to help much...

--Marcin

[dwm] Re: Crash-only software

Reply via email to