[ 
https://issues.apache.org/jira/browse/HADOOP-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12624378#action_12624378
 ] 

Allen Wittenauer commented on HADOOP-3988:
------------------------------------------

We recently tried a "new kind" of fail over in our environment.  Rather than 
having a static IP for the name node, we attempted to use DNS CNAMEs to move 
the name node from one node to another.  We discovered that the data nodes 
continually attempted to contact the old machine even though DNS pointed to the 
new machine.

Since we configure a host name in hadoop, I would expect that the data nodes 
would at some point drop their cache of the IP and re-resolve.  However, this 
never happened.

I'd like to see either an option or just the default to be when a name is given 
in a configuration file, Hadoop always does a host name resolution on that 
entry prior to connection.  The operating system should be able to handle the 
job of caching any addresses that need to be cached, either through a mechanism 
like nscd or through a fully-blooded, local DNS cache.

> The elephant should remember names, not numbers.
> ------------------------------------------------
>
>                 Key: HADOOP-3988
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3988
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs
>    Affects Versions: 0.17.2
>            Reporter: Allen Wittenauer
>
> The name node and the data node should not cache the resolution of host 
> names, as doing so prevents the use of DNS CNAMEs for any sort of fail over 
> capability.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to