Did you consider using HBase Backup/Restore?  We had in the past moved data 
from one cluster to another using this technique.

http://hbase.apache.org/book/ops.backup.html


regards,
Srikanth

-----Original Message-----
From: Manuel de Ferran [mailto:[email protected]]
Sent: Tuesday, April 17, 2012 2:23 PM
To: [email protected]
Subject: Duplicate an HBase cluster

Greetings,

we have a 4 nodes cluster running HBase-0.90.3 over Hadoop-0.20-append.
We'd like to create another HBase cluster from this one with minimal HBase 
downtime. We have plenty of disk space on each datanode.

Here is what we have in mind:
- Add a new datanode aka. DN5
- Raise HDFS replication factor to 5 to have a whole copy on each datanode
- Wait until replication done
 - Disable all tables
- Stop DN5
- Copy Namenode data (dfs/name/current ...) to DN5
- Enable all tables
- Start a new namenode on DN5 aka NN2
- Reconfigure DN5 to point to NN2
- Configure a new HBase cluster on top of the new HDFS

It works on a small cluster but is it enough to have a consistent copy ?

Any hints ? Is there a best-practice ?

Thanks

________________________________

http://www.mindtree.com/email/disclaimer.html

Reply via email to