[ https://issues.apache.org/jira/browse/HDFS-11234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mingliang Liu updated HDFS-11234: --------------------------------- Assignee: Suresh Bahuguna > distcp performance is suboptimal for high bandwidth/high latency setups > ----------------------------------------------------------------------- > > Key: HDFS-11234 > URL: https://issues.apache.org/jira/browse/HDFS-11234 > Project: Hadoop HDFS > Issue Type: Improvement > Components: hdfs > Affects Versions: 2.7.1 > Reporter: Suresh Bahuguna > Assignee: Suresh Bahuguna > > Because distcp uses tcp socket with buffer size set to 128K, for a setup > which has very high bandwidth but also a very high latency, the throughput is > quite poor. This is because tcp stops sending more data till the time it gets > the ACKs. By not setting the socket size and letting linux kernel manage the > socket, we should be able to get optimal performance. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org