Virajith Jalaparti created HADOOP-15292:
-------------------------------------------
Summary: Distcp's use of pread is slowing it down.
Key: HADOOP-15292
URL: https://issues.apache.org/jira/browse/HADOOP-15292
Project: Hadoop Common
Issue Type: Bug
Reporter: Virajith Jalaparti
Distcp currently uses positioned-reads (in RetriableFileCopyCommand#copyBytes)
when the source offset is > 0. This results in unnecessary overheads (new
BlockReader being created on the client-side, multiple readBlock() calls to the
Datanodes, each of requires the creation of a BlockSender and an inputstream to
the ReplicaInfo).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]