[
https://issues.apache.org/jira/browse/HADOOP-1540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rich Haase updated HADOOP-1540:
-------------------------------
Attachment: HADOOP-1540.006.patch
Refactored the patch to do exclusion filtering while building the CopyListing.
It turns out there is a method (SimpleCopyListing#shouldCopy) which always
returns true. I've added a couple of basic classes to perform the default
(always true) behavior and a SimpleCopyFilter class, which uses a string
compare to determine what should be excluded from the copy. I think this
design will be a bit more flexible in future, and it avoids having mappers
which get a chunk of files to copy that should all be excluded.
> distcp should support an exclude list
> -------------------------------------
>
> Key: HADOOP-1540
> URL: https://issues.apache.org/jira/browse/HADOOP-1540
> Project: Hadoop Common
> Issue Type: Improvement
> Components: util
> Affects Versions: 2.6.0
> Reporter: Senthil Subramanian
> Assignee: Rich Haase
> Priority: Minor
> Labels: patch
> Attachments: HADOOP-1540.003.patch, HADOOP-1540.004.patch,
> HADOOP-1540.005.patch, HADOOP-1540.006.patch
>
>
> There should be a way to ignore specific paths (eg: those that have already
> been copied over under the current srcPath).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)