If you stop the source cluster then you can distcp the /hbase to the other cluster. Done. A perfect copy.
That is probably the most efficient/highest performing way. On Tue, Feb 9, 2010 at 6:47 PM, James Baldassari <[email protected]> wrote: > Hi, > > I'm wondering if it's possible to export all data from one HBase cluster > and import it into another. We have a lot of data that we've imported > into our staging HBase environment, and rather than repeating the > lengthy import process in our production environment we would prefer to > just copy all the data directly from HBase/HDFS in staging into > production. Is there an easy way to do this? I know Hadoop has some > distributed copy functionality, but I don't know if this will work with > HBase. The number of region servers and the replication factor will be > the same in the source and destination environments, but the > hostnames/IPs will be different. The production environment is > completely empty right now, so we don't need to worry about overwriting > data. > > I came across these links while searching for information HBase > export/import: > > http://issues.apache.org/jira/browse/HBASE-897 > http://issues.apache.org/jira/browse/HBASE-1684 > http://hadoop.apache.org/hbase/docs/current/api/org/apache/hadoop/hbase/mapreduce/Export.html > > Has anyone used these tools? Is there a better way? > > Thanks, > James > > >
