[ 
https://issues.apache.org/jira/browse/HDFS-14788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mukund Thakur updated HDFS-14788:
---------------------------------
    Description: There is a feature in Distcp where we can ignore specific 
files to get copied to the destination. This is currently based on a filter 
regex which is read from a specific file. The process of creating different 
regex file for different distcp jobs seems like a tedious task. What we are 
proposing is to expose a regex_filter parameter which can be set during Distcp 
job creation and use this filter in a new implementation CopyFilter class. 

> Use dynamic regex filter to ignore copy of source files in Distcp
> -----------------------------------------------------------------
>
>                 Key: HDFS-14788
>                 URL: https://issues.apache.org/jira/browse/HDFS-14788
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: distcp
>            Reporter: Mukund Thakur
>            Assignee: Mukund Thakur
>            Priority: Major
>
> There is a feature in Distcp where we can ignore specific files to get copied 
> to the destination. This is currently based on a filter regex which is read 
> from a specific file. The process of creating different regex file for 
> different distcp jobs seems like a tedious task. What we are proposing is to 
> expose a regex_filter parameter which can be set during Distcp job creation 
> and use this filter in a new implementation CopyFilter class. 



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to