Support efficient Hadoop distcp from external clusters
------------------------------------------------------
Key: WHIRR-81
URL: https://issues.apache.org/jira/browse/WHIRR-81
Project: Whirr
Issue Type: New Feature
Components: service/hadoop
Reporter: Tom White
On EC2 currently all external traffic to a Hadoop cluster is proxied through
the namenode, which make distcp impractical. This JIRA is to explore ways to
improve this operation, possible candidates include a SocketFactory
implementation that is aware of the cloud provider's networking (and can supply
the public addresses appropriately), or a VPN. Ideally this would support
different cloud providers, although it is possible that different providers
need different solutions.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.