[ 
https://issues.apache.org/jira/browse/HADOOP-14854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16548264#comment-16548264
 ] 

Tsz Wo Nicholas Sze commented on HADOOP-14854:
----------------------------------------------

The shouldCopy(..) should be invoked before sourceFS.getFileStatus(..) so that 
it can avoid the RPC.

> DistCp should not issue file status calls for files in the filter list
> ----------------------------------------------------------------------
>
>                 Key: HADOOP-14854
>                 URL: https://issues.apache.org/jira/browse/HADOOP-14854
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: tools/distcp
>    Affects Versions: 2.8.1
>            Reporter: Mukul Kumar Singh
>            Assignee: Mukul Kumar Singh
>            Priority: Minor
>         Attachments: HADOOP-14854.001.patch, HADOOP-14854.002.patch
>
>
> DistCp currently excludes the files in the filter list only when the files 
> are added to the copy list.
> However distcp can be optimized by not issuing file status/get attr calls for 
> the files in the filter.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to