[
https://issues.apache.org/jira/browse/HDFS-14788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mukund Thakur updated HDFS-14788:
---------------------------------
Description: There is a feature in Distcp where we can ignore specific
files to get copied to the destination. This is currently based on a filter
regex which is read from a specific file. The process of creating different
regex file for different distcp jobs seems like a tedious task. What we are
proposing is to expose a regex_filter parameter which can be set during Distcp
job creation and use this filter in a new implementation CopyFilter class.
> Use dynamic regex filter to ignore copy of source files in Distcp
> -----------------------------------------------------------------
>
> Key: HDFS-14788
> URL: https://issues.apache.org/jira/browse/HDFS-14788
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: distcp
> Reporter: Mukund Thakur
> Assignee: Mukund Thakur
> Priority: Major
>
> There is a feature in Distcp where we can ignore specific files to get copied
> to the destination. This is currently based on a filter regex which is read
> from a specific file. The process of creating different regex file for
> different distcp jobs seems like a tedious task. What we are proposing is to
> expose a regex_filter parameter which can be set during Distcp job creation
> and use this filter in a new implementation CopyFilter class.
--
This message was sent by Atlassian Jira
(v8.3.2#803003)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]