Dear Wiki user, You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.
The "HadoopIPv6" page has been changed by SteveLoughran. The comment on this change is: More on IPv6. http://wiki.apache.org/hadoop/HadoopIPv6?action=diff&rev1=1&rev2=2 -------------------------------------------------- - Some general info on how to avoid Hadoop problems in IPv6-enabled servers + = Hadoop and IPv6 = + Apache Hadoop does not currently support IPv6 networks, it uses IPv4 addresses for communicating between nodes. This is because Hadoop is designed to work in private datacenters, which usually have private IP addresses in the 10.x.x.x address space. + + 1. Using IPv4 addresses everywhere provides a single form of TCP addressing for all our tests. Different network configurations (DNS, reverse DNS, DNS caching) still provide lots of problems and performance issues, but there is no need to worry about which IP protocol version is used. + 1. Shorter addresses make for shorter packets, which can have a benefit on busy networks. + + This does not mean that the Hadoop team thinks that IPv4 is the best ever network protocol and that there is no reason to upgrade ever, only that it works well in datacenters. If you are using Hadoop in other places you may encounter problems. A key limitation of this design decision is that it means Hadoop needs IPv4 to work, and only IPv4 clients can talk to the cluster. Equally critically, MapReduce jobs cannot talk to services, including web services, that only work on IPv6. If your organisation moves to IPv6 only, you will encounter problems. + + + In the mean time, the main concern is that the linux distribution tries to force Hadoop to use IPv6, which does not work. + 1. Many recent Linux distributions do not allow you to turn IPv6 off. There is a risk that Hadoop or Jetty-under-Hadoop has picked up an IPv6 address, which is why other machines may not be able to talk to it. + 1. Later Linux releases default to being IPv6 only. That means unless the systems are configured to re-enable IPv4, some machines break. As of Jan 2010, this was causing problems in Debian [[http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=560044|1]], [[http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=560056|2]], which is then leading to bug reports in other programs [[http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6342561|Sun bug database]], [[https://issues.apache.org/jira/browse/HADOOP-6056|Apache Jira]]. +
