[ 
https://issues.apache.org/jira/browse/HDFS-11234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mingliang Liu updated HDFS-11234:
---------------------------------
    Assignee: Suresh Bahuguna

> distcp performance is suboptimal for high bandwidth/high latency setups
> -----------------------------------------------------------------------
>
>                 Key: HDFS-11234
>                 URL: https://issues.apache.org/jira/browse/HDFS-11234
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: hdfs
>    Affects Versions: 2.7.1
>            Reporter: Suresh Bahuguna
>            Assignee: Suresh Bahuguna
>
> Because distcp uses tcp socket with buffer size set to 128K, for a setup 
> which has very high bandwidth but also a very high latency, the throughput is 
> quite poor. This is because tcp stops sending more data till the time it gets 
> the ACKs. By not setting the socket size and letting linux kernel manage the 
> socket, we should be able to get optimal performance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to