[ 
https://issues.apache.org/jira/browse/HBASE-12954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Clay B. updated HBASE-12954:
----------------------------
    Attachment: Hadoop Three Interfaces.png

Diagram showing two currently pathological networks:

h3. Multi-homed/Multi-IP Bare Metal

Here we see machines with multiple physical network interfaces (eth0, 
eth4.1000, eth5.1001) and mutiple IP addresses per physical interface (e.g. 
host-specific f-hostb.bach.example.com and a transient VIP 
services.bach.example.com).

Here, we would want to bind HBase services to a specific IP address (i.e. 
192.168.100.12)  and we would want to advertise HBase services in Zookeeper at 
a specific hostname (i.e. f-hostb.bach.example.com).

Using only a physical interface (i.e. eth5.1001) this is not possible.

h3. OpenStack

Here we see that each VM only knows of its internal (-int) address; i.e. if one 
runs an `ip addr list` they will only see a 100.127.0.0/24 address but in 
Zookeeper we would like to advertise the hostname which corresponds to the 
globally reachable NAT'd IP address (the 192.168.101.0/24) address (here for 
example `hosta.tenant.openstack.example.com`). Meanwhile, as the VM is unaware 
of the 192.168.101.0/24 network on its local IP stack we would have to bind to 
the hosts only IP interface 100.127.1.2.


> Ability impared using HBase on multihomed hosts
> -----------------------------------------------
>
>                 Key: HBASE-12954
>                 URL: https://issues.apache.org/jira/browse/HBASE-12954
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.98.4
>            Reporter: Clay B.
>            Priority: Minor
>         Attachments: Hadoop Three Interfaces.png
>
>
> For HBase clusters running on unusual networks (such as NAT'd cloud 
> environments or physical machines with multiple IP's per network interface) 
> it would be ideal to have a way to both specify:
> # which IP interface to which HBase master or region-server will bind
> # what hostname HBase will advertise in Zookeeper both for a master or 
> region-server process
> While efforts such as HBASE-8640 go a long way to normalize these two sources 
> of information, it is not possible in the current design of the properties 
> available to an administrator for these to be unambiguously specified.
> One has been able to request `hbase.master.ipc.address` or 
> `hbase.regionserver.ipc.address` but one can not specify the desired HBase 
> `hbase.master.hostname`. (It was removed in HBASE-1357, further I am unaware 
> of a region-server equivalent.)
> I use a configuration management system to generate all of my configuration 
> files on a per-machine basis. As such, an option to generate a file 
> specifying exactly which hostname to use would be helpful.
> Today, specifying the bind address for HBase works and one can use an 
> HBase-only DNS for faking what to put in Zookeeper but this is far from 
> ideal. Network interfaces have no intrinsic IP address, nor hostname. 
> Specifing a DNS server is awkward as the DNS server may differ from the 
> system's resolver and is a single IP address. Similarly, on hosts which use a 
> transient VIP (e.g. through keepalived) for other services, it means there's 
> a seemingly non-deterministic hostname choice made by HBase depending on the 
> state of the VIP at daemon start-up time.
> I will attach two networking examples I use which become very difficult to 
> manage under the current properties.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to