Re: FreeBSD 6.x CVSUP today crashes with zero load ...

Dmitry Pryanishnikov Mon, 26 Jun 2006 16:45:04 -0700

On Tue, 27 Jun 2006, M.Hirsch wrote:

If you're using hardware w/o ECC, it just can't tell whether error present
or absent. So ECC _is_ the way to detect (not mask) broken hardware.

Ok, thanks. I think I understand the meaning of ECC now.

So, unlike my supplier claims, ECC is not supposed to help against hardwarefailures.

But it is the way to detect them, right?


 ECC stands for Error Checking and Correction. It's a hardware feature,

and its primary task is Checking (that is, detection) of errors. It justhappens that number of additional bits which carry checking code is sufficientto correct _any_ _single-bit_ data error (not mask it, but really correct),and to detect any double-bit and most of several-bit errors (w/ocorrection).

Intel's ECC-capable chipset allows it. But if we're speaking about
production environment, such behaviour (abnormal termination on _corrected_
error) is unacceptable.
"abnormal termination" is not only acceptable for me, it is what I am lookingfor.Make the node crash completely, so one of the others can take over itstask(s).

Again, when single-bit correction has happened, it's not fake, the result isactually correct. Why panic the machine immediately if all data OK?


Sincerely, Dmitry
--
Atlantis ISP, System Administrator
e-mail:  [EMAIL PROTECTED]
nic-hdl: LYNX-RIPE
_______________________________________________
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Re: FreeBSD 6.x CVSUP today crashes with zero load ...

Reply via email to