Re: [caiman-discuss] Install engine code review

Keith Mitchell Fri, 24 Sep 2010 09:25:53 -0700

Multiprocessing uses a subprocess / fork / exec / wait frameworkwhich gives us 100% control over the process running. It would alsoallow the engine cancel checkpoints instead of relying on thecheckpoints themselves to implement a cancel method.
That's one item we need to discuss and get other people's opinion.With the current model of having the enginesuggest the checkpoint should quit and trust that the checkpointswill behave and quit at it's earliest convinience,the engine really has no control over whether the checkpoint. Thegood thing about doing it this way is that the checkpointcan come to a "good" stopping point and quit. The disadvantage is ofcourse that the engine has no control
over what a checkpoint does.
With the MP module, and the engine controlling the checkpoint, oneapproach could be to roll back to the last successful dataset snapshotand end execution there. I'm also not suggesting removing theability of the individual checkpoints to control execution. MP hassimilar IPC controls that can be used to tell the engine, "Hey!Something's wrong! Can you stop?"

We could certainly have separate cancel() and kill() methods if we usedMP, which would allow for both, or add a timeout to cancel() which, ifthe checkpoint didn't cease by that time, the engine could forcibly shutit down. I definitely think the MP module is worth exploring. I thinkthe only potential for hangup is finding out how easy it is tocoordinate DOC updates across the separate processes, but that seemslike it would be surmountable.


- Keith
_______________________________________________
caiman-discuss mailing list
caiman-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/caiman-discuss

Re: [caiman-discuss] Install engine code review

Reply via email to