On 7/9/06 14:37, "Jimi Xenidis" <[EMAIL PROTECTED]> wrote:
>> these assumptions are likely not true and the CPU has gone
>> down taking some locks with it.
> Hypervisors should increase the availability of the machine as a
> whole, PPC machines tend to have many HA features that when unhandled
> (mostly ECC) can cause a CPU to go down.
Unhandled errors should make the *machine* fail-stop, not just one CPU. High
availability is important, but getting the correct answer from your
computations tends to rank higher. If hardware detects an unrecoverable
error that cannot be handled sanely in software either, I think most people
would agree it's time for a reboot.
Xen-ppc-devel mailing list