[ https://issues.apache.org/jira/browse/AMBARI-18590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15573251#comment-15573251 ]
Hudson commented on AMBARI-18590: --------------------------------- SUCCESS: Integrated in Jenkins build Ambari-trunk-Commit #5797 (See [https://builds.apache.org/job/Ambari-trunk-Commit/5797/]) AMBARI-18590 - RegionServer Registration Checks Fail During Upgrade If (jhurley: [http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=4140cc78a0153799938c2dbe3f80c11ab3be2e30]) * (edit) ambari-server/src/main/resources/common-services/HBASE/0.96.0.2.0/package/scripts/upgrade.py > RegionServer Registration Checks Fail During Upgrade If rDNS is Not Enabled > --------------------------------------------------------------------------- > > Key: AMBARI-18590 > URL: https://issues.apache.org/jira/browse/AMBARI-18590 > Project: Ambari > Issue Type: Bug > Components: ambari-agent > Affects Versions: 2.2.0 > Reporter: Jonathan Hurley > Assignee: Jonathan Hurley > Priority: Blocker > Fix For: 2.5.0 > > Attachments: AMBARI-18590.patch > > > During a rolling upgrade, the upgrade orchestration must wait for each > RegionServer to register with the HBase master before moving onto the next RS > restart. This is a very asynchronous process which may occur several minutes > after the daemon has actually started. > We have a check now which uses {{hbase shell}} along with {{status 'simple'}} > to determine if the host has registered by looking for the hostname. > However, if reverse DNS is not enabled, then this could potentially be IP > addresses. As a result, the check would always fail during upgrades: > The HBase status command we use is {{status simple}}, which returns like so: > {noformat} > active master: 10.0.0.8:16000 1475801031124 > 2 backup masters > 10.0.0.10:16000 1475801061290 > 10.0.0.13:16000 1475801046018 > 2 live servers > 10.0.0.5:16020 1475798271407 > requestsPerSecond=0.0, numberOfOnlineRegions=2, usedHeapMB=159, > maxHeapMB=7840, numberOfStores=3, numberOfStorefiles=1, > storefileUncompressedSizeMB=0, storefileSizeMB=0, memstoreSizeMB=0, > storefileIndexSizeMB=0, readRequestsCount=14, writeRequestsCount=1, > rootIndexSizeKB=0, totalStaticIndexSizeKB=0, totalStaticBloomSizeKB=0, > totalCompactingKVs=14, currentCompactedKVs=14, compactionProgressPct=1.0, > coprocessors=[MultiRowMutationEndpoint, SecureBulkLoadEndpoint] > 10.0.0.7:16020 1475872741297 > requestsPerSecond=0.0, numberOfOnlineRegions=1, usedHeapMB=1002, > maxHeapMB=7840, numberOfStores=1, numberOfStorefiles=1, > storefileUncompressedSizeMB=0, storefileSizeMB=0, memstoreSizeMB=0, > storefileIndexSizeMB=0, readRequestsCount=0, writeRequestsCount=0, > rootIndexSizeKB=0, totalStaticIndexSizeKB=0, totalStaticBloomSizeKB=0, > totalCompactingKVs=0, currentCompactedKVs=0, compactionProgressPct=NaN, > coprocessors=[SecureBulkLoadEndpoint] > 0 dead servers > Aggregate load: 0, regions: 3 > {noformat} > If this lookup fails for the hostname, we should also try by IP address. -- This message was sent by Atlassian JIRA (v6.3.4#6332)