[ 
https://issues.apache.org/jira/browse/HBASE-12954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14302729#comment-14302729
 ] 

stack commented on HBASE-12954:
-------------------------------

[~clayb]
bq.  I think there's some past design I'm not catching as to a desire to split 
internal and external HBase identification. 

I wouldn't call it 'design'. In past, hbase relied on folks setting up their 
named properly, but we were then repeatedly burned by master having one name 
for a regionserver but then the regionserver identifying itself by another 
(usually ip and resolved hostname); so one regionserver showed up twice.

The 'fix' was to just let the master say what a regionserver should call itself.

So, given users can't be expected to set up resolve properly, expecting them to 
set a 'hostname' in each RS site.xml, is of the same calibre; it is too much to 
ask. Its fine if it optional (with the master doing good defense to protect 
against it getting two misconfigured RS using same site.xml...)

I take it you like something like the attached patch where optionally we take 
what the RS says we should use -- if it had extra stuff like the master-side 
defense against double-registration and actual tests to prove it works?

Thanks for your quality input.


> Ability impaired using HBase on multihomed hosts
> ------------------------------------------------
>
>                 Key: HBASE-12954
>                 URL: https://issues.apache.org/jira/browse/HBASE-12954
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.98.4
>            Reporter: Clay B.
>            Assignee: Ted Yu
>            Priority: Minor
>         Attachments: 12954-v1.txt, Hadoop Three Interfaces.png
>
>
> For HBase clusters running on unusual networks (such as NAT'd cloud 
> environments or physical machines with multiple IP's per network interface) 
> it would be ideal to have a way to both specify:
> # which IP interface to which HBase master or region-server will bind
> # what hostname HBase will advertise in Zookeeper both for a master or 
> region-server process
> While efforts such as HBASE-8640 go a long way to normalize these two sources 
> of information, it is not possible in the current design of the properties 
> available to an administrator for these to be unambiguously specified.
> One has been able to request {{hbase.master.ipc.address}} or 
> {{hbase.regionserver.ipc.address}} but one can not specify the desired HBase 
> {{hbase.master.hostname}}. (It was removed in HBASE-1357, further I am 
> unaware of a region-server equivalent.)
> I use a configuration management system to generate all of my configuration 
> files on a per-machine basis. As such, an option to generate a file 
> specifying exactly which hostname to use would be helpful.
> Today, specifying the bind address for HBase works and one can use an 
> HBase-only DNS for faking what to put in Zookeeper but this is far from 
> ideal. Network interfaces have no intrinsic IP address, nor hostname. 
> Specifing a DNS server is awkward as the DNS server may differ from the 
> system's resolver and is a single IP address. Similarly, on hosts which use a 
> transient VIP (e.g. through keepalived) for other services, it means there's 
> a seemingly non-deterministic hostname choice made by HBase depending on the 
> state of the VIP at daemon start-up time.
> I will attach two networking examples I use which become very difficult to 
> manage under the current properties.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to