Ayush Saxena created HADOOP-18056:
-------------------------------------

             Summary: DistCp: Filter duplicates in the source paths
                 Key: HADOOP-18056
                 URL: https://issues.apache.org/jira/browse/HADOOP-18056
             Project: Hadoop Common
          Issue Type: Improvement
            Reporter: Ayush Saxena
            Assignee: Ayush Saxena


Add a basic filtering to remove the exact duplicate paths exposed for copying.

In case two same srcPath say /tmp/file1 is passed in the list twice. DistCp 
fails with DuplicateFileException, post building the listing.

Would be better if we do a basic filtering of duplicate paths. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to