Hi,
I wanted to do an "rsync --delete" between data in 2 HDFS system directories. Do we have a utility that could do this? I am aware that HDFS does not allow partial writes. An alternative would be to write a program to generate the list of differences in paths and then use distcp to copy the files and delete the appropriate files. Any pointers to implementations (or partial implementations)? Thanks, Deepika
