Thanks for your prompt reply.

When using the command
"./bin/hadoop distcp hftp://cluster1:50070/path hdfs://cluster2/path"

- Should this command be given in cluster1?
- What does port "50070" specify? Is it the one in "fs.default.name", or
"dfs.http.address"?

/Taeho



On Mon, Feb 2, 2009 at 12:40 PM, Mark Chadwick <mchadw...@invitemedia.com>wrote:

> Taeho,
>
> The distcp command is perfect for this.  If you're copying between two
> clusters running the same version of Hadoop, you can do something like:
>
> ./bin/hadoop distcp hdfs://cluster1/path hdfs://cluster2/path
>
> If you're copying between 0.18 and 0.19, the command will look like:
>
> ./bin/hadoop distcp hftp://cluster1:50070/path hdfs://cluster2/path
>
> Hope that helps,
> -Mark
>
> On Sun, Feb 1, 2009 at 9:48 PM, Taeho Kang <tka...@gmail.com> wrote:
>
> > Dear all,
> >
> > There have been times where I needed to transfer some big data from one
> > version of Hadoop cluster to another.
> > (e.g. from hadoop 0.18 to hadoop 0.19 cluster)
> >
> > Other than copying files from one cluster to a local file system and
> upload
> > it to another,
> > is there a tool that does it?
> >
> > Thanks in advance,
> > Regards,
> >
> > /Taeho
> >
>

Reply via email to