Hi,

 

I wanted to do an "rsync --delete" between data in 2 HDFS system
directories. Do we have a utility that could do this?

 

I am aware that HDFS does not allow partial writes. An alternative would
be to write a program to generate the list of differences in paths and
then use distcp to copy the files and delete the appropriate files.

 

Any pointers to implementations (or partial implementations)?

 

Thanks,

Deepika

 

 

 

Reply via email to