[
https://issues.apache.org/jira/browse/MAPREDUCE-4168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14261232#comment-14261232
]
Allen Wittenauer commented on MAPREDUCE-4168:
---------------------------------------------
bq. I only need hadoop to talk on my secondary interface and not on all
interfaces.
If one only needs to listen on one interface, then only that interface should
be configured in the *-site.xml files.
bq. I noticed that a major part of the traffic goes over the main interface and
not over the secondary interface that I defined in the slaves file.
The slaves files has absolutely zero to do with what the Java interfaces
configure. The etc/hadoop/slaves file is only for using the shell code and
even then only for the ssh bits. (FWIW, I'm working on fixing the
documentation via HADOOP-10908 to help clarify this, since this is a big point
of confusion.) Almost all TCP/IP stacks by default will send traffic outbound
on any network interface where the other side of the pipe is reachable. This
is *usually* a good thing, but there are times where this is problematic. :)
The biggest problem that we have in Hadoop is that we use the server-side
network configuration as the same information we pass to or expect clients to
use for connection information. In multi-home situations, these may not be
(and usually aren't) the same. So if a service is configured to use 0.0.0.0
(to bind to all interfaces), the client can't use that to connect to it since
that doesn't resolve to anything usable.
Karthik's suggestion of using a custom set of *-site.xml files installed on
client nodes to specify that the RM is actually at 10.1.2.3 (or whatever) is a
viable workaround. But it effectively tosses the problem over the wall to "let
ops deal with it." It's friction in the system we really need to fix rather
than passing the buck to the poor schmuck down the line.
There's also the problem of being able to pick which interfaces: if I only want
2 out of 3, there is no way to do it since the network configuration only
supports one entry. So it's all or nothing. The parent of this JIRA is geared
towards fixing these issues. We should probably take a fresh look at it and fix
it in 3.x, since it will likely be an incompatible change to do this correctly.
> Support multiple network interfaces
> -----------------------------------
>
> Key: MAPREDUCE-4168
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4168
> Project: Hadoop Map/Reduce
> Issue Type: New Feature
> Reporter: Tom White
>
> Umbrella jira to track the MapReduce side of HADOOP-8198.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)