Hi. We run with 2-way replication. The wonderful folks at Yahoo! worked through > most of the bugs during 0.19.x IIRC. There was never any bugs with 2-way > replication per-se, but running a cluster with 2 replicas exposed other bugs > at a 100x rate compared to running with 3 replicas (due to the fact that a > silent corruption + loss of a single data node = file loss). > > I'd estimate we lose files at a rate of about 1 per month for 200TB of > actual data. That number would probably go down an order of magnitude or > more if we were running with 3 replicas. > > Hope this helps. > > Thanks for sharing!
So, there is a good reason to believe, that version 0.19 and higher have the file storage / silent corruption issues sorted out? Regards.
