[
https://issues.apache.org/jira/browse/HADOOP-7030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tom White updated HADOOP-7030:
------------------------------
Assignee: Patrick Angeles
Status: Open (was: Patch Available)
This looks like a useful addition. Here are my comments on the patch:
* Could you combine the two types of file, so that if there are three columns
the first two are interpreted as a range, otherwise use the first as a single
host. Or just support CIDR notation?
* Have you thought about InetAddress to avoid implementing IP address parsing
logic?
http://guava-libraries.googlecode.com/svn/tags/release08/javadoc/com/google/common/net/InetAddresses.html
might be useful (there was talk of introducing Guava recently).
* RefreshableDNSToSwitchMapping isn't hooked up yet, so perhaps it should go in
a follow on JIRA.
* The name "TableMapping" is a bit general. How about "FileBasedMapping", or
similar?
* The configuration keys should go in CommonConfigurationKeysPublic.
* Primes are not needed in hashCode implementations. For Ip4
Arrays.hashCode(value) is sufficient.
* The tests swallow exceptions - there should at least be a comment saying that
this is expected. Also, fail() with a message is preferable to
assertTrue(false).
* The tests should be JUnit 4 style.
> new topology mapping implementations
> ------------------------------------
>
> Key: HADOOP-7030
> URL: https://issues.apache.org/jira/browse/HADOOP-7030
> Project: Hadoop Common
> Issue Type: New Feature
> Affects Versions: 0.21.0, 0.20.2, 0.20.1
> Reporter: Patrick Angeles
> Assignee: Patrick Angeles
> Attachments: HADOOP-7030-2.patch, HADOOP-7030.patch, topology.patch
>
>
> The default ScriptBasedMapping implementation of DNSToSwitchMapping for
> determining cluster topology has some drawbacks. Principally, it forks to an
> OS-specific script.
> This issue proposes two new Java implementations of DNSToSwitchMapping.
> TableMapping reads a two column text file that maps an IP or hostname to a
> rack ID. Ip4RangeMapping reads a three column text file where each line
> represents a start and end IP range plus a rack ID.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira