Edward, HBase replication is entirely different than HDFS replication - The former being cross-instance replication of HBase-data-only while the latter replicates within nodes available in its own instance, and covers all data generally.
Read more about it here: http://hbase.apache.org/replication.html and decide for yourself if you'd like to use it. (P.s. Please do not cross post to multiple lists. This question fit HBase's lists.) On Wed, Oct 12, 2011 at 3:04 PM, edward choi <[email protected]> wrote: > Hi, > > I am using CDH3U0 Hadoop and Hbase. > > I know that HDFS automatically replicates data to multiple data nodes so > that when a couple of data nodes fail, the user won't lose any data. > I am currently using replication factor of 3. > > Now in Hbase, I know there is some experimental replication service, but I > am not using it. I thought that since HDFS already does replication, Hbase > wouldn't need additional replication. > Am I right on this? > > So my question would be: > Given that I am using HDFS replication and not using Hbase replication, if a > couple of regionservers fail, am I still able to access all the data I have > stored before? > > I would appreciate any help. > Thanks. > > Ed > -- Harsh J
