[
https://issues.apache.org/jira/browse/HBASE-3431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12991099#comment-12991099
]
stack commented on HBASE-3431:
------------------------------
Tested w/ name resolution broke on both ends. If I broke lookup good, server
wouldn't start complaining couldn't resolve name (thats not new to my patch).
If no resolve when it got to server side then again same thing w/ a complaint
that couldn't resolve regionserver name... again not new to my patch... more a
commentary on how hbase will complain loudly already if resolve is mangled.
Messages are pretty plain about whats wrong.
I broke master resolve so the incoming RS did not resolve to a proper address
-- in the past we'd send back an IP and use that ever after and then you'd have
double-vision after next heartbeat -- and then on RS I broke it so passed back
a FQDN when Master was dealing in host names only. That worked too.
Review please. Unit tests are hard to do. Would have to somehow mock java dns
lookup. Changing the dns doesn't seem to be possible (I can see providing
alternate dns provider to jndi if you provide flags on JVM startup).
> Regionserver is not using the name given it by the master; double entry in
> master listing of servers
> ----------------------------------------------------------------------------------------------------
>
> Key: HBASE-3431
> URL: https://issues.apache.org/jira/browse/HBASE-3431
> Project: HBase
> Issue Type: Bug
> Affects Versions: 0.90.0
> Reporter: stack
> Assignee: stack
> Priority: Blocker
> Fix For: 0.90.1
>
> Attachments: 3431-v2.txt, 3431-v3.txt, 3431-v3.txt, 3431-v4.txt,
> 3431.txt
>
>
> Our man Ted Dunning found the following where RS checks in with one name, the
> master tells it use another name but we seem to go ahead and continue with
> our original name.
> In RS logs I see:
> {code}
> 2011-01-07 15:45:50,757 INFO
> org.apache.hadoop.hbase.regionserver.HRegionServer [regionserver60020]:
> Master passed us address to use. Was=perfnode11:60020, Now=10.10.30.11:60020
> {code}
> On master I see
> {code}
> 2011-01-07 15:45:38,613 INFO org.apache.hadoop.hbase.master.ServerManager
> [IPC Server handler 0 on 60000]: Registering
> server=10.10.30.11,60020,1294443935414, regionCount=0, userLoad=false
> {code}
> ....
> then later
> {code}
> 2011-01-07 15:45:44,247 INFO org.apache.hadoop.hbase.master.ServerManager
> [IPC Server handler 2 on 60000]: Registering
> server=perfnode11,60020,1294443935414, regionCount=0, userLoad=true
> {code}
> This might be since we started letting servers register in other than with
> the reportStartup.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira