If you stop the source cluster then you can distcp the /hbase to the
other cluster. Done. A perfect copy.

That is probably the most efficient/highest performing way.

On Tue, Feb 9, 2010 at 6:47 PM, James Baldassari <[email protected]> wrote:
> Hi,
>
> I'm wondering if it's possible to export all data from one HBase cluster
> and import it into another.  We have a lot of data that we've imported
> into our staging HBase environment, and rather than repeating the
> lengthy import process in our production environment we would prefer to
> just copy all the data directly from HBase/HDFS in staging into
> production.  Is there an easy way to do this?  I know Hadoop has some
> distributed copy functionality, but I don't know if this will work with
> HBase.  The number of region servers and the replication factor will be
> the same in the source and destination environments, but the
> hostnames/IPs will be different.  The production environment is
> completely empty right now, so we don't need to worry about overwriting
> data.
>
> I came across these links while searching for information HBase
> export/import:
>
> http://issues.apache.org/jira/browse/HBASE-897
> http://issues.apache.org/jira/browse/HBASE-1684
> http://hadoop.apache.org/hbase/docs/current/api/org/apache/hadoop/hbase/mapreduce/Export.html
>
> Has anyone used these tools?  Is there a better way?
>
> Thanks,
> James
>
>
>

Reply via email to