> But ExportSnapshot supports exporting to a remote target cluster. Give the > full hdfs url. yes, I’ve tried it using hdfs://host:port/path <hdfs://host:port/path>, and succeed. But the problem is, as I understood it is not secure. Or am I wrong?
> You can also specify the remote target with a httpfs or webfdfs url, which > then you can leverage SSL on the transport. Could you please provide meow information regarding this point. Thank you. > You also can copy to local cluster and use DistCp to copy to remote cluster. The only solution I can image for secure transferring snapshot from cluster1 in datacenter1 to cluster2 in datacenter2 is to create snapshot in local storage, and then copy it using distcp. Am I right? > On 12 Mar 2015, at 21:39, Jerry He <[email protected]> wrote: > > ExportSnapshot does not use DistCp but directly use FileSystem API to copy, > as Vladimir mentioned. > But ExportSnapshot supports exporting to a remote target cluster. Give the > full hdfs url. > You can also specify the remote target with a httpfs or webfdfs url, which > then you can leverage SSL on the transport. > > You also can copy to local cluster and use DistCp to copy to remote cluster. > > Jerry > > On Thu, Mar 12, 2015 at 12:28 PM, Vladimir Rodionov <[email protected]> > wrote: > >> No, ExportSnapshot does not use DistCp it runs its own M/R job to copy data >> over to a new destination. >> >> In a map task it uses HDFS API to create/write data to a new destination. >> Therefore, the easiest way to secure communication >> during this operation is to use secure HDFS transport. >> >> http://www.cloudera.com/content/cloudera/en/documentation/cdh4/v4-3-1/CDH4-Security-Guide/cdh4sg_topic_14_2.html >> >> but there is caveat ... >> >> ExportSnapshot does not support external cluster configuration - you can't >> provide path to external cluster config dir. This seems like a good feature >> request. >> >> -Vlad >> >> >> >> >> On Thu, Mar 12, 2015 at 10:38 AM, Akmal Abbasov <[email protected]> >> wrote: >> >>> Hi, I am new to Hadoop Hbase. I have a Hbase cluster in one datacenter, >>> and I need to create a backup in the second one. Currently the second >>> HBase cluster is ready, and I would like to import data from first >> cluster. >>> I would like to use exportSnapshot tool for this, I’ve tried it one my >>> test environment, and it worked well. >>> But, since know I am going to export to a different cluster in different >>> datacenter, I would like to be sure that my data is secure. So how I can >>> make exportSnapshot secure? >>> As far as I understood exportSnapshot uses distcp tool to copy snapshot >> to >>> destination cluster, so in this case is it enough to configure distcp? >>> Thank you! >>
