[
https://issues.apache.org/jira/browse/HADOOP-14854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16548264#comment-16548264
]
Tsz Wo Nicholas Sze commented on HADOOP-14854:
----------------------------------------------
The shouldCopy(..) should be invoked before sourceFS.getFileStatus(..) so that
it can avoid the RPC.
> DistCp should not issue file status calls for files in the filter list
> ----------------------------------------------------------------------
>
> Key: HADOOP-14854
> URL: https://issues.apache.org/jira/browse/HADOOP-14854
> Project: Hadoop Common
> Issue Type: Improvement
> Components: tools/distcp
> Affects Versions: 2.8.1
> Reporter: Mukul Kumar Singh
> Assignee: Mukul Kumar Singh
> Priority: Minor
> Attachments: HADOOP-14854.001.patch, HADOOP-14854.002.patch
>
>
> DistCp currently excludes the files in the filter list only when the files
> are added to the copy list.
> However distcp can be optimized by not issuing file status/get attr calls for
> the files in the filter.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]