Dinesh Nithyanandam created HBASE-24243:
-------------------------------------------

             Summary: Unable to start HRegionserver and Master node considers 
as a dead region
                 Key: HBASE-24243
                 URL: https://issues.apache.org/jira/browse/HBASE-24243
             Project: HBase
          Issue Type: Brainstorming
          Components: regionserver
            Reporter: Dinesh Nithyanandam


Hi Team,

I am currently using Apache Hbase version - 1.3.6 and I am trying to run Master 
and region server separately and then join the cluster dynamically but it was 
region server was not starting and always reports that "*The RegionServer is 
initializing*!"

Commands used as below: (Master and region are on separate nodes )

Node A - Hbase Master - /opt/hbase/bin/hbase-daemon.sh --config 
/usr/local/bin/hbase/conf start master

Node B - Hbase Region - /opt/hbase/bin/hbase-daemon.sh --config 
/usr/local/bin/hbase/conf start regionserver

Environment - Google Compute Engine (GCE) Instance groups/VM's

OS Type - CentOS -7

Also not sure on how to enable reverse DNS across both the machines and whether 
that is the problem and please do advice on how do i achieve it

*Master logs:*

>From the below master logs it clearly says that master is trying to connect to 
>region and then eventually getting disconnected from the client region server 
 * "*DEBUG 
[RpcServer.reader=1,bindAddress=pinpoint-master-v000-rh5k.c.gcp-ushi-telemetry-npe.internal,port=16000]
 ipc.RpcServer: RpcServer.listener,port=16000: DISCONNECTING client 
10.148.6.13:45732 because read count=-1. Number of active connections: 1"*

*complete logs*

2020-04-22 19:38:24,812 DEBUG [RpcServer.listener,port=16000] ipc.RpcServer: 
RpcServer.listener,port=16000: connection from 10.148.6.13:45732; # active 
connections: 1
2020-04-22 19:38:24,961 DEBUG 
[RpcServer.FifoWFPBQ.default.handler=29,queue=2,port=16000] ipc.RpcServer: 
RpcServer.FifoWFPBQ.default.handler=29,queue=2,port=16000: callId: 0 service: 
RegionServerStatusService methodName: RegionServerStartup size: 47 connection: 
10.148.6.13:45732
2020-04-22 19:38:30,591 DEBUG 
[*pinpoint-master-v000-rh5k:16000*.activeMasterManager] ipc.RpcClientImpl: 
Connecting to 
*pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020*
2020-04-22 19:38:31,268 *DEBUG [hconnection-0x5f02b9cb-shared--pool3-t1] 
ipc.RpcClientImpl: Connecting to 
pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020*
2020-04-22 19:38:31,478 DEBUG [ProcedureExecutor-3] ipc.RpcClientImpl: 
Connecting to 
pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020
2020-04-22 19:39:32,714 *DEBUG 
[RpcServer.reader=1,bindAddress=pinpoint-master-v000-rh5k.c.gcp-ushi-telemetry-npe.internal,port=16000]
 ipc.RpcServer: RpcServer.listener,port=16000: DISCONNECTING client 
10.148.6.13:45732 because read count=-1. Number of active connections: 1*

 

*Region server logs:*

>From the below logs region server discovers the master on it's own but unable 
>to join the cluster with below logs

===============================================================

 

2020-04-22 19:38:24,675 INFO 
*[regionserver/pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020]
 regionserver.HRegionServer: reportForDuty to 
master=pinpoint-master-v000-rh5k.c.gcp-ushi-telemetry-npe.internal,16000*,1587584303253
 with port=16020, startcode=1587583634667
2020-04-22 19:38:24,801 DEBUG 
[regionserver/pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020]
 ipc.RpcClientImpl: Connecting to 
pinpoint-master-v000-rh5k.c.gcp-ushi-telemetry-npe.internal/10.148.6.154:16000
2020-04-22 19:38:28,005 INFO 
[regionserver/pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020]
 regionserver.HRegionServer: reportForDuty to 
master=pinpoint-master-v000-rh5k.c.gcp-ushi-telemetry-npe.internal,16000,1587584303253
 with port=16020, startcode=1587583634667
2020-04-22 19:38:28,033 INFO 
[regionserver/pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020]
 regionserver.HRegionServer: Config from master: 
hbase.rootdir=hdfs://10.148.6.68:9000/hbase
2020-04-22 19:38:28,033 INFO 
[regionserver/pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020]
 regionserver.HRegionServer: Config from master: 
fs.defaultFS=hdfs://10.148.6.68:9000
2020-04-22 19:38:28,033 INFO 
[regionserver/pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020]
 regionserver.HRegionServer: Config from master: hbase.master.info.port=16010

===============================================================

 

2020-04-22 19:38:24,801 DEBUG 
[regionserver/pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal/10.148.6.13:16020]
 ipc.RpcClientImpl: Connecting to 
pinpoint-master-v000-rh5k.c.gcp-ushi-telemetry-npe.internal/10.148.6.154:16000
2020-04-22 19:38:30,592 DEBUG [RpcServer.listener,port=16020] ipc.RpcServer: 
RpcServer.listener,port=16020: connection from 10.148.6.154:53050; # active 
connections: 1
2020-04-22 19:38:31,269 DEBUG [RpcServer.listener,port=16020] ipc.RpcServer: 
RpcServer.listener,port=16020: connection from 10.148.6.154:53052; # active 
connections: 2
2020-04-22 19:38:31,479 DEBUG [RpcServer.listener,port=16020] ipc.RpcServer: 
RpcServer.listener,port=16020: connection from 10.148.6.154:53056; # active 
connections: 3
2020-04-22 19:39:32,413 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 3 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,440 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 4 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,443 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 5 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,445 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 6 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,447 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 7 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,450 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 8 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,452 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 9 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,454 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 10 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,456 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 11 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050
2020-04-22 19:39:32,458 DEBUG 
[RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020] ipc.RpcServer: 
RpcServer.FifoWFPBQ.priority.handler=19,queue=1,port=16020: callId: 12 service: 
AdminService methodName: OpenRegion size: 81 connection: 10.148.6.154:53050

===============================================================

2020-04-23 04:40:07,751 DEBUG 
[RpcServer.reader=3,bindAddress=pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal,port=16020]
 ipc.RpcServer: RpcServer.listener,port=16020: DISCONNECTING client 
10.148.6.13:44272 because read count=-1. Number of active connections: 1
2020-04-23 04:40:17,751 DEBUG [RpcServer.listener,port=16020] ipc.RpcServer: 
RpcServer.listener,port=16020: connection from 10.148.6.13:44280; # active 
connections: 1
2020-04-23 04:40:17,752 DEBUG 
[RpcServer.reader=4,bindAddress=pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal,port=16020]
 ipc.RpcServer: RpcServer.listener,port=16020: DISCONNECTING client 
10.148.6.13:44280 because read count=-1. Number of active connections: 1
2020-04-23 04:40:27,752 DEBUG [RpcServer.listener,port=16020] ipc.RpcServer: 
RpcServer.listener,port=16020: connection from 10.148.6.13:44282; # active 
connections: 1
2020-04-23 04:40:27,752 DEBUG 
[RpcServer.reader=5,bindAddress=pinpoint-r-v000-976s.c.gcp-ushi-telemetry-npe.internal,port=16020]
 ipc.RpcServer: RpcServer.listener,port=16020: DISCONNECTING client 
10.148.6.13:44282 because read count=-1. Number of active connections: 1
2020-04-23 04:40:37,752 DEBUG [RpcServer.listener,port=16020] ipc.RpcServer: 
RpcServer.listener,port=16020: connection from 10.148.6.13:44284; # active 
connections: 1
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to