DNS Configs may completely break HBase cluster
----------------------------------------------

                 Key: HBASE-3413
                 URL: https://issues.apache.org/jira/browse/HBASE-3413
             Project: HBase
          Issue Type: Bug
    Affects Versions: 0.90.0
         Environment: all
            Reporter: Mathias Herberts


I recently experienced a cluster malfunction which was caused by a change in 
DNS config for services co-hosted on the machines running region servers.

The RS are specified using IP addresses in the 'regionservers' file. Those 
machines are 1.example.com to N.example.com (there are A RRs for those names to 
each of the N IP addresses in 'regionservers').

Until recently, the PTR RRs for the RS IPs were those x.example.com names.

Then a service was deployed on some of the x.example.com machines, and new A 
RRs were added for svc.example.com which point to each of the IPs used for the 
service.

Jointly new PTR records were added too for the given IPs. Those PTR records 
have 'svc.example.com' as their PTRDATA, and this is causing the HBase cluster 
to get completely confused.

Since it is perfectly legal to have multiple PTR records, it seems important to 
make the canonicalization of RS more robust to DNS tweaks.

Maybe generating a UUID when a RS is started would help, this UUID could be 
used to register the RS in ZK and we would not rely on DNS for obtaining a 
stable canonical name (which may not even exist...).


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to