On 7/9/06 14:37, "Jimi Xenidis" <[EMAIL PROTECTED]> wrote: >> these assumptions are likely not true and the CPU has gone >> down taking some locks with it. > > Hypervisors should increase the availability of the machine as a > whole, PPC machines tend to have many HA features that when unhandled > (mostly ECC) can cause a CPU to go down.
Unhandled errors should make the *machine* fail-stop, not just one CPU. High availability is important, but getting the correct answer from your computations tends to rank higher. If hardware detects an unrecoverable error that cannot be handled sanely in software either, I think most people would agree it's time for a reboot. -- Keir _______________________________________________ Xen-ppc-devel mailing list Xen-ppc-devel@lists.xensource.com http://lists.xensource.com/xen-ppc-devel