Distcp is very slow
-------------------
Key: MAPREDUCE-1231
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1231
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: distcp
Affects Versions: 0.20.1
Reporter: Jothi Padmanabhan
Assignee: Jothi Padmanabhan
Fix For: 0.20.2
Currently distcp does a checksums check in addition to file length check to
decide if a remote file has to be copied. If the number of files is high
(thousands), this checksum check is proving to be fairly costly leading to a
long time before the copy is started.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.