Re: btrfs and ECC RAM

Bob Marley Mon, 20 Jan 2014 07:46:11 -0800

On 20/01/2014 15:57, Ian Hinder wrote:

i.e. that there is parity information stored with every piece of data,and ZFS will "correct" errors automatically from the parity information.

So this is not just parity data to check correctness but there are manymore additional bits to actually correct these errors, based on analgorithm like reed-solomon ?


Where can I find information on how much "parity" is stored in ZFS ?

I start to suspect that there is confusion here between checksummingfor data integrity and parity information. If this is really how ZFSworks, then if memory corruption interferes with this process, then Ican see how a scrub could be devastating.

I don't . If you have additional bits to correct errors (other thandetect errors), this will never be worse than having less of them.All algorithms I know of, don't behave any worse if the erroneous bitsare in the checksum part, or if the algorithm is correct+detect insteadof just detect.If the algorithm stores X+2Y extra bits (supposed ZFS case) in order todetect&correct Y erroneous bits and detect additional X erroneous bits,this will not be worse than having just X checksum bits (btrfs case).

So does ZFS really uses detect&correct parity? I'd expect this to bequite a lot computationally expensive

I don't know if ZFS really works like this. It sounds very odd to dothis without an additional checksum check. This sounds very differentto what you say below that btrfs does, which is only to check againstredundantly-stored copies, which I agree sounds much safer. The secondlink above from the ZFS FAQ just says that if you place a very highvalue on data integrity, you should be using ECC memory anyway, whichI'm sure we can all agree with.hxxp://zfsonlinux.org/faq.html#DoIHaveToUseECCMemory:
1.16 Do I have to use ECC memory for ZFS?
Using ECC memory for ZFS is strongly recommended for enterprise environments 
where the strongest data integrity guarantees are required. Without ECC memory 
rare random bit flips caused by cosmic rays or by faulty memory can go 
undetected. If this were to occur ZFS (or any other filesystem) will write the 
damaged data to disk and be unable to automatically detect the corruption.

The above sentence imho means that the data can get corrupted just priorto its first write.This is obviously applicable to every filesystem on earth, without ECC,especially if it happens prior to the computation of the parity.


BM

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: btrfs and ECC RAM

Reply via email to