archive does not work with distcp -update -----------------------------------------
Key: MAPREDUCE-1431 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1431 Project: Hadoop Map/Reduce Issue Type: Bug Components: harchive Reporter: Tsz Wo (Nicholas), SZE Assignee: Mahadev konar The following distcp command works. {noformat} hadoop distcp -Dmapred.job.queue.name=q har://hdfs-nn_hostname:8020/user/tsz/t101.har/t101 t101_distcp {noformat} However, it does not work for -update. {noformat} -bash-3.1$ hadoop distcp -Dmapred.job.queue.name=q -update har://hdfs-nn_hostname:8020/user/tsz/t101.har/t101 t101_distcp 10/01/29 20:06:53 INFO tools.DistCp: srcPaths=[har://hdfs-nn_hostname:8020/user/tsz/t101.har/t101] 10/01/29 20:06:53 INFO tools.DistCp: destPath=t101 java.lang.IllegalArgumentException: Wrong FS: har://hdfs-nn_hostname:8020/user/tsz/t101.har/t101/text-00000000, expected: hdfs://nn_hostname at org.apache.hadoop.fs.FileSystem.checkPath(FileSystem.java:310) at org.apache.hadoop.hdfs.DistributedFileSystem.checkPath(DistributedFileSystem.java:99) at org.apache.hadoop.hdfs.DistributedFileSystem.getPathName(DistributedFileSystem.java:155) at org.apache.hadoop.hdfs.DistributedFileSystem.getFileChecksum(DistributedFileSystem.java:463) at org.apache.hadoop.hdfs.DistributedFileSystem.getFileChecksum(DistributedFileSystem.java:46) at org.apache.hadoop.fs.FilterFileSystem.getFileChecksum(FilterFileSystem.java:250) at org.apache.hadoop.tools.DistCp.sameFile(DistCp.java:1204) at org.apache.hadoop.tools.DistCp.setup(DistCp.java:1084) ... {noformat} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.