Re: Developing Mars lander software

Tolga Cakiroglu Tue, 18 Feb 2014 16:22:25 -0800

On Tuesday, 18 February 2014 at 23:05:21 UTC, Walter Bright wrote:

http://cacm.acm.org/magazines/2014/2/171689-mars-code/fulltext
Some interesting tidbits:
"We later revised it to require that the flight software as awhole, and each module within it, had to reach a minimalassertion density of 2%. There is compelling evidence thathigher assertion densities correlate with lower residual defectdensities."
This has been my experience with asserts, too.
"A failing assertion is now tied in with the fault-protectionsystem and by default places the spacecraft into a predefinedsafe state where the cause of the failure can be diagnosedcarefully before normal operation is resumed."
Nice to see confirmation of that.
"Running the same landing software on two CPUs in paralleloffers little protection against software defects. Twodifferent versions of the entry-descent-and-landing code weretherefore developed, with the version running on the backup CPUa simplified version of the primary version running on the mainCPU. In the case where the main CPU would have unexpectedlyfailed during the landing sequence, the backup CPU wasprogrammed to take control and continue the sequence followingthe simplified procedure."
An example of using dual systems for reliability.

TL;DR the link though, how are they detecting that a CPU fails?An information must be passes outside of CPU to do this. The onlysolution comes to my mind is that main CPU changes a variable onan external memory at every step, and back up CPU checks itcontinuously to catch a failure immediately. But this wouldrequire about 50% of CPU's power already.

While thinking about this kind of back up systems, knowing andreading that some people are really doing is really great.

Re: Developing Mars lander software

Reply via email to