Re: Developing Mars lander software

Tolga Cakiroglu Tue, 18 Feb 2014 21:56:26 -0800

On Wednesday, 19 February 2014 at 01:09:43 UTC, Xinok wrote:

On Wednesday, 19 February 2014 at 00:16:03 UTC, Tolga Cakirogluwrote:
TL;DR the link though, how are they detecting that a CPUfails? An information must be passes outside of CPU to dothis. The only solution comes to my mind is that main CPUchanges a variable on an external memory at every step, andback up CPU checks it continuously to catch a failureimmediately. But this would require about 50% of CPU's poweralready.
While thinking about this kind of back up systems, knowing andreading that some people are really doing is really great.
I'm assuming this has something to do with it:
https://en.wikipedia.org/wiki/Heartbeat_%28computing%29
In clustered servers, the active node sends a continuous signalindicating it's still alive. This signal is referred to as aheartbeat. There's a standby node waiting to take over shouldit stop receiving this signal.

I think only knowing that it has failed is not enough. Becausethe process is landing, and other CPU should know where theprocess is left. With that heatbeat signal, only option is thatall sensor information must be sent both CPUs continuously andsensor values should be enough about what next step to be taken.Then I think it can continue the process flawlessly.

Re: Developing Mars lander software

Reply via email to