Backport HADOOP-5839 to 0.20-security - fixes to ec2 scripts to allow remote
job submission
-------------------------------------------------------------------------------------------
Key: HADOOP-7809
URL: https://issues.apache.org/jira/browse/HADOOP-7809
Project: Hadoop Common
Issue Type: Improvement
Components: contrib/cloud
Reporter: Joydeep Sen Sarma
Assignee: Joydeep Sen Sarma
Fix For: 0.21.0
Attachments: 5839.1.patch, hadoop-5839.2.patch
i would very much like the option of submitting jobs from a workstation outside
ec2 to a hadoop cluster in ec2. This has been explored here:
http://www.nabble.com/public-IP-for-datanode-on-EC2-tt19336240.html
the net result of this is that we can make this work (along with using a socks
proxy) with a couple of changes in the ec2 scripts:
a) use public 'hostname' for fs.default.name setting (instead of the private
hostname being used currently)
b) mark hadoop.rpc.socket.factory.class.default as final variable in the
generated hadoop-site.xml (that applies to server side)
#a has no downside as far as i can tell since public hostnames resolve to
internal/private IP addresses within ec2 (so traffic is optimally routed).
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira