Re: A couple of questions

Hubert Kario Wed, 02 Jun 2010 09:19:57 -0700

On Monday 31 May 2010 19:59:46 Paul Millar wrote:
> Hi Hubert,
> 
> On Thursday 27 May 2010 16:56:00 Hubert Kario wrote:
> > > Would [obtaining file checksum] be possible (without an awful lot
> > > of work)?
> > 
> > [Calculating checksum in-memory]  won't detect in-memory corruption
> > though, but if you want to be resilant to this, you should be looking at
> > 
> >  ECC RAM as subsequent checks can be affected by it to.
> 
> Certainly ECC RAM will help, but unfortunately it doesn't remove the
> possibility of corruption; for example, CERN found [1] that double-bit
> memory corruptions (which ECC cannot recover from) can still happen.
> 
> [1]
> http://indico.cern.ch/getFile.py/access?contribId=3&sessionId=0&resId=1&mat
> erialId=paper&confId=13797
> 
> Also, IIRC there was a case where Fermilab tracked down a data corruption
> to a faulty PCI bus in the server.  So who knows where are all the places
> corruption could occur?
> 
> I guess the real problem is that, when processing large amounts of data,
> these rare occurrences start to stack up.
>


Yes, but AFAIK btrfs checksums don't have internal checksum (e.g. you can't 
check if the read checksum is a valid one or not, it does not have control 
bits), as such, if you consider PCI bus corruption as likely, you still don't 
get 100% certanity that the data reached the HDD unharmed.

If you need such level of certanity when recording data, I'd consider 
mainframe hardware and/or duplicating whole storage stack.

Cheers,
-- 
Hubert Kario
QBS - Quality Business Software
02-656 Warszawa, ul. Ksawerów 30/85
tel. +48 (22) 646-61-51, 646-74-24
www.qbs.com.pl

System Zarządzania Jakością
zgodny z normą ISO 9001:2000
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: A couple of questions

Reply via email to