[jira] [Commented] (HBASE-20324) Hbase master fails to become active in kerberos environment

2018-04-05 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-20324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16426958#comment-16426958
 ] 

Abhishek Kulkarni commented on HBASE-20324:
---

Any update on this? This is bloking us from long time.

Appriciate if we have any temp steps to resolve.

> Hbase master fails to become active in kerberos environment
> ---
>
> Key: HBASE-20324
> URL: https://issues.apache.org/jira/browse/HBASE-20324
> Project: HBase
>  Issue Type: Bug
> Environment: Hbase 2.0.0-beta2
> zookeeper-3.5.3-beta
> 3 nodes Env
> Kdc server on namenode
> *hadoop-2.7.3*
> *--Configured with keytabs(abhishekk1/2/3 are nodes)* 
>    *abhishekk1 is namenode/hmaster*
>    *abhishekk2/3 are datanodes/regionservers*
>Reporter: Abhishek Kulkarni
>Priority: Blocker
> Fix For: 2.0.0
>
> Attachments: hbase-root-master-.log, hbase-root-regionserver.log
>
>
>  
> [^hbase-root-master-.log]
> ^[^hbase-root-regionserver.log]^
>  
> ^^Trying to resolve this form last one month with different forums but not 
> able to resovleat all.^^



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (HBASE-19287) master hangs forever if RecoverMeta send assign meta region request to target server fail

2018-04-01 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-19287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421608#comment-16421608
 ] 

Abhishek Kulkarni commented on HBASE-19287:
---

Can anyone update me about the mailing list? 

FYI- My HDFS setup is up using same Kerberos environment.

> master hangs forever if RecoverMeta send assign meta region request to target 
> server fail
> -
>
> Key: HBASE-19287
> URL: https://issues.apache.org/jira/browse/HBASE-19287
> Project: HBase
>  Issue Type: Bug
>  Components: proc-v2
>Affects Versions: 2.0.0
>Reporter: Yi Liang
>Assignee: Yi Liang
>Priority: Major
> Fix For: 2.0.0-beta-1, 2.0.0
>
> Attachments: HBASE-19287-master-v3.patch, 
> HBASE-19287-master-v3.patch, HBASE-19287-master-v4.patch, 
> hbase-19287-master-v2.patch, master.patch
>
>
> 2017-11-10 19:26:56,019 INFO  [ProcExecWrkr-1] 
> procedure.RecoverMetaProcedure: pid=138, 
> state=RUNNABLE:RECOVER_META_ASSIGN_REGIONS; RecoverMetaProcedure 
> failedMetaServer=null, splitWal=true; Retaining meta assignment to 
> server=hadoop-slave1.hadoop,16020,1510341981454
> 2017-11-10 19:26:56,029 INFO  [ProcExecWrkr-1] procedure2.ProcedureExecutor: 
> Initialized subprocedures=[{pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454}]
> 2017-11-10 19:26:56,067 INFO  [ProcExecWrkr-2] 
> procedure.MasterProcedureScheduler: pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454 hbase:meta 
> hbase:meta,,1.1588230740
> 2017-11-10 19:26:56,071 INFO  [ProcExecWrkr-2] assignment.AssignProcedure: 
> Start pid=139, ppid=138, state=RUNNABLE:REGION_TRANSITION_QUEUE; 
> AssignProcedure table=hbase:meta, region=1588230740, 
> target=hadoop-slave1.hadoop,16020,1510341981454; rit=OFFLINE, 
> location=hadoop-slave1.hadoop,16020,1510341981454; forceNewPlan=false, 
> retain=false
> 2017-11-10 19:26:56,224 INFO  [ProcExecWrkr-4] zookeeper.MetaTableLocator: 
> Setting hbase:meta (replicaId=0) location in ZooKeeper as 
> hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:56,230 INFO  [ProcExecWrkr-4] 
> assignment.RegionTransitionProcedure: Dispatch pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454; 
> rit=OPENING, location=hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:56,382 INFO  [ProcedureDispatcherTimeoutThread] 
> procedure.RSProcedureDispatcher: Using procedure batch rpc execution for 
> serverName=hadoop-slave2.hadoop,16020,1510341988652 version=2097152
> 2017-11-10 19:26:57,542 INFO  [main-EventThread] 
> zookeeper.RegionServerTracker: RegionServer ephemeral node deleted, 
> processing expiration [hadoop-slave2.hadoop,16020,1510341988652]
> 2017-11-10 19:26:57,543 INFO  [main-EventThread] master.ServerManager: Master 
> doesn't enable ServerShutdownHandler during initialization, delay expiring 
> server hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:58,875 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Registering 
> server=hadoop-slave1.hadoop,16020,1510342016106
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Registering 
> server=hadoop-slave2.hadoop,16020,1510342023184
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Triggering server recovery; existingServer 
> hadoop-slave2.hadoop,16020,1510341988652 looks stale, new 
> server:hadoop-slave2.hadoop,16020,1510342023184
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Master doesn't enable ServerShutdownHandler during 
> initialization, delay expiring server hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:27:49,815 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> client.RpcRetryingCallerImpl: tarted=38594 ms ago, cancelled=false, 
> msg=org.apache.hadoop.hbase.NotServingRegionException: hbase:meta,,1 is not 
> online on hadoop-slave2.hadoop,16020,1510342023184
> at 
> org.apache.hadoop.hbase.regionserver.HRegionServer.getRegionByEncodedName(HRegionServer.java:3290)
> at 
> org.apache.hadoop.hbase.regionserver.RSRpcServices.getRegion(RSRpcServices.java:1370)
> at 
> org.apache.hadoop.hbase.regionserver.RSRpcServices.get(RSRpcServices.java:2401)
> at 
> 

[jira] [Commented] (HBASE-19287) master hangs forever if RecoverMeta send assign meta region request to target server fail

2018-03-31 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-19287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421431#comment-16421431
 ] 

Abhishek Kulkarni commented on HBASE-19287:
---

2018-03-31 14:00:18,202 INFO  [LruBlockCacheStatsExecutor] hfile.LruBlockCache: 
totalSize=1.03 MB, freeSize=1.38 GB, max=1.38 GB, blockCount=0, accesses=0, 
hits=0, hitRatio=0, cachingAccesses=0, cachingHits=0, 
cachingHitsRatio=0,evictions=3239, evicted=0, evictedPerRun=0.0
2018-03-31 14:00:18,208 INFO  [MobFileCache #0] mob.MobFileCache: MobFileCache 
Statistics, access: 0, miss: 0, hit: 0, hit ratio: 0%, evicted files: 0
2018-03-31 14:00:20,763 INFO  [regionserver/abhishekk3:16020.logRoller] 
wal.AbstractFSWAL: Rolled WAL 
/hbase/WALs/abhishekk3.pne.ven.veritas.com,16020,1522486816915/abhishekk3.pne.ven.veritas.com%2C16020%2C1522486816915.1522515620673
 with entries=0, filesize=83 B; new WAL 
/hbase/WALs/abhishekk3.pne.ven.veritas.com,16020,1522486816915/abhishekk3.pne.ven.veritas.com%2C16020%2C1522486816915.1522519220738
2018-03-31 14:00:20,763 INFO  [regionserver/abhishekk3:16020.logRoller] 
wal.AbstractFSWAL: Archiving 
hdfs://abhishekk1.pne.ven.veritas.com:54310/hbase/WALs/abhishekk3.pne.ven.veritas.com,16020,1522486816915/abhishekk3.pne.ven.veritas.com%2C16020%2C1522486816915.1522515620673
 to 
hdfs://abhishekk1.pne.ven.veritas.com:54310/hbase/oldWALs/abhishekk3.pne.ven.veritas.com%2C16020%2C1522486816915.1522515620673
2018-03-31 14:05:18,202 INFO  [LruBlockCacheStatsExecutor] hfile.LruBlockCache: 
totalSize=1.03 MB, freeSize=1.38 GB, max=1.38 GB, blockCount=0, accesses=0, 
hits=0, hitRatio=0, cachingAccesses=0, cachingHits=0, 
cachingHitsRatio=0,evictions=3269, evicted=0, evictedPerRun=0.0

> master hangs forever if RecoverMeta send assign meta region request to target 
> server fail
> -
>
> Key: HBASE-19287
> URL: https://issues.apache.org/jira/browse/HBASE-19287
> Project: HBase
>  Issue Type: Bug
>  Components: proc-v2
>Affects Versions: 2.0.0
>Reporter: Yi Liang
>Assignee: Yi Liang
>Priority: Major
> Fix For: 2.0.0-beta-1, 2.0.0
>
> Attachments: HBASE-19287-master-v3.patch, 
> HBASE-19287-master-v3.patch, HBASE-19287-master-v4.patch, 
> hbase-19287-master-v2.patch, master.patch
>
>
> 2017-11-10 19:26:56,019 INFO  [ProcExecWrkr-1] 
> procedure.RecoverMetaProcedure: pid=138, 
> state=RUNNABLE:RECOVER_META_ASSIGN_REGIONS; RecoverMetaProcedure 
> failedMetaServer=null, splitWal=true; Retaining meta assignment to 
> server=hadoop-slave1.hadoop,16020,1510341981454
> 2017-11-10 19:26:56,029 INFO  [ProcExecWrkr-1] procedure2.ProcedureExecutor: 
> Initialized subprocedures=[{pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454}]
> 2017-11-10 19:26:56,067 INFO  [ProcExecWrkr-2] 
> procedure.MasterProcedureScheduler: pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454 hbase:meta 
> hbase:meta,,1.1588230740
> 2017-11-10 19:26:56,071 INFO  [ProcExecWrkr-2] assignment.AssignProcedure: 
> Start pid=139, ppid=138, state=RUNNABLE:REGION_TRANSITION_QUEUE; 
> AssignProcedure table=hbase:meta, region=1588230740, 
> target=hadoop-slave1.hadoop,16020,1510341981454; rit=OFFLINE, 
> location=hadoop-slave1.hadoop,16020,1510341981454; forceNewPlan=false, 
> retain=false
> 2017-11-10 19:26:56,224 INFO  [ProcExecWrkr-4] zookeeper.MetaTableLocator: 
> Setting hbase:meta (replicaId=0) location in ZooKeeper as 
> hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:56,230 INFO  [ProcExecWrkr-4] 
> assignment.RegionTransitionProcedure: Dispatch pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454; 
> rit=OPENING, location=hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:56,382 INFO  [ProcedureDispatcherTimeoutThread] 
> procedure.RSProcedureDispatcher: Using procedure batch rpc execution for 
> serverName=hadoop-slave2.hadoop,16020,1510341988652 version=2097152
> 2017-11-10 19:26:57,542 INFO  [main-EventThread] 
> zookeeper.RegionServerTracker: RegionServer ephemeral node deleted, 
> processing expiration [hadoop-slave2.hadoop,16020,1510341988652]
> 2017-11-10 19:26:57,543 INFO  [main-EventThread] master.ServerManager: Master 
> doesn't enable ServerShutdownHandler during initialization, delay expiring 
> server hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:58,875 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Registering 
> 

[jira] [Commented] (HBASE-19287) master hangs forever if RecoverMeta send assign meta region request to target server fail

2018-03-31 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-19287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421429#comment-16421429
 ] 

Abhishek Kulkarni commented on HBASE-19287:
---

Dont see any error at regionserver other than warning

WARN  [Close-WAL-Writer-3] asyncfs.FanOutOneBlockAsyncDFSOutputHelper: complete 
file 
/hbase/WALs/abhishekk3.pne.ven.veritas.com,16020,1522486816915/abhishekk3.pne.ven.veritas.com%2C16020%2C1522486816915.1522497620342
 not finished, retry = 0



Sat Mar 31 05:00:13 EDT 2018 Starting regionserver on 
abhishekk3.pne.ven.veritas.com
core file size  (blocks, -c) 0
data seg size   (kbytes, -d) unlimited
scheduling priority (-e) 0
file size   (blocks, -f) unlimited
pending signals (-i) 63395
max locked memory   (kbytes, -l) 64
max memory size (kbytes, -m) unlimited
open files  (-n) 1024
pipe size    (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority  (-r) 0
stack size  (kbytes, -s) 8192
cpu time   (seconds, -t) unlimited
max user processes  (-u) 63395
virtual memory  (kbytes, -v) unlimited
file locks  (-x) unlimited
2018-03-31 05:00:16,504 INFO  [main] regionserver.HRegionServer: STARTING 
executorService HRegionServer
2018-03-31 05:00:16,505 INFO  [main] util.VersionInfo: HBase 2.0.0-beta-2
2018-03-31 05:00:16,505 INFO  [main] util.VersionInfo: Source code repository 
git://ve0524/home/stack/hbase.git 
revision=9e9b347d667e1fc6165c9f8ae5ae7052147e8895
2018-03-31 05:00:16,505 INFO  [main] util.VersionInfo: Compiled by stack on Fri 
Mar  2 13:29:06 PST 2018
2018-03-31 05:00:16,505 INFO  [main] util.VersionInfo: From source with 
checksum 07b3577d4d48c7a79e91a2d133e09db2
2018-03-31 05:00:16,888 INFO  [main] util.ServerCommandLine: hbase.tmp.dir: 
/tmp/hbase-root
2018-03-31 05:00:16,888 INFO  [main] util.ServerCommandLine: hbase.rootdir: 
hdfs://abhishekk1.pne.ven.veritas.com:54310/hbase
2018-03-31 05:00:16,888 INFO  [main] util.ServerCommandLine: 
hbase.cluster.distributed: true
2018-03-31 05:00:16,888 INFO  [main] util.ServerCommandLine: 
hbase.zookeeper.quorum: 
abhishekk1.pne.ven.veritas.com,abhishekk2.pne.ven.veritas.com,abhishekk3.pne.ven.veritas.com
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: 
env:PATH=/home/java/jdk1.8.0_131//bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/home/hadoop/hadoop-2.7.3//sbin:/home/hadoop/hadoop-2.7.3//bin:/home/hbase2/hbase-2.0.0-beta-2/bin
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: 
env:HBASE_PID_DIR=/var/hbase/pids
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: 
env:HBASE_REGIONSERVER_OPTS=-Djava.security.auth.login.config=/home/hbase2/hbase-2.0.0-beta-2/conf/hbaseregionserver-jaas.conf
 
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: 
env:MAIL=/var/mail/root
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: 
env:LD_LIBRARY_PATH=:/home/hadoop/hadoop-2.7.3//lib/native
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: env:LOGNAME=root
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: 
env:HBASE_REST_OPTS=
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: env:PWD=/root
2018-03-31 05:00:16,889 INFO  [main] util.ServerCommandLine: 
env:HADOOP_INSTALL=/home/hadoop/hadoop-2.7.3/
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:HBASE_ROOT_LOGGER=INFO,RFA
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:LESSOPEN=||/usr/bin/lesspipe.sh %s
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: env:SHELL=/bin/bash
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:HBASE_ENV_INIT=true
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:HBASE_MASTER_OPTS=-Djava.security.auth.login.config=/home/hbase2/hbase-2.0.0-beta-2/conf/hbasemaster-jaas.conf
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:HBASE_MANAGES_ZK=false
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:HBASE_REGIONSERVERS=/home/hbase2/hbase-2.0.0-beta-2/conf/regionservers
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:HADOOP_HOME=/home/hadoop/hadoop-2.7.3/
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:HBASE_NICENESS=0
2018-03-31 05:00:16,890 INFO  [main] util.ServerCommandLine: 
env:HBASE_OPTS=-Djava.security.auth.login.config=/home/hbase2/hbase-2.0.0-beta-2/conf/hbaseclient-jass.conf
  
-Djava.security.auth.login.config=/home/hbase2/hbase-2.0.0-beta-2/conf/hbaseregionserver-jaas.conf
  -Dhbase.log.dir=/home/hbase2/hbase-2.0.0-beta-2/logs 
-Dhbase.log.file=hbase-root-regionserver-abhishekk3.pne.ven.veritas.com.log 
-Dhbase.home.dir=/home/hbase2/hbase-2.0.0-beta-2 -Dhbase.id.str=root 
-Dhbase.root.logger=INFO,RFA 
-Djava.library.path=/home/hadoop/hadoop-2.7.3//lib/native 

[jira] [Created] (HBASE-20324) Hbase master fails to become active in kerberos environment

2018-03-31 Thread Abhishek Kulkarni (JIRA)
Abhishek Kulkarni created HBASE-20324:
-

 Summary: Hbase master fails to become active in kerberos 
environment
 Key: HBASE-20324
 URL: https://issues.apache.org/jira/browse/HBASE-20324
 Project: HBase
  Issue Type: Bug
 Environment: Hbase 2.0.0-beta2

zookeeper-3.5.3-beta

3 nodes Env

Kdc server on namenode

*hadoop-2.7.3*

*--Configured with keytabs(abhishekk1/2/3 are nodes)* 

   *abhishekk1 is namenode/hmaster*

   *abhishekk2/3 are datanodes/regionservers*
Reporter: Abhishek Kulkarni
 Attachments: hbase-root-master-.log, hbase-root-regionserver.log

 

[^hbase-root-master-.log]

^[^hbase-root-regionserver.log]^

 

^^Trying to resolve this form last one month with different forums but not able 
to resovleat all.^^



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (HBASE-19287) master hangs forever if RecoverMeta send assign meta region request to target server fail

2018-03-31 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-19287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421416#comment-16421416
 ] 

Abhishek Kulkarni commented on HBASE-19287:
---

2018-03-31 05:00:27,274 WARN  [RSProcedureDispatcher-pool3-t10] 
assignment.RegionTransitionProcedure: Remote call failed pid=12, ppid=11, 
state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
region=1588230740; rit=OPENING, 
location=abhishekk3.pne.ven.veritas.com,16020,1522486816915; exception=Call to 
abhishekk3.pne.ven.veritas.com/10.210.62.30:16020 failed on local exception: 
java.io.IOException: Can not send request because relogin is in progress.
2018-03-31 05:00:27,274 INFO  [RSProcedureDispatcher-pool3-t10] 
assignment.AssignProcedure: Retry=10 of max=10; pid=12, ppid=11, 
state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
region=1588230740; rit=OPENING, 
location=abhishekk3.pne.ven.veritas.com,16020,1522486816915
2018-03-31 05:00:27,275 INFO  [PEWorker-7] assignment.AssignProcedure: Retry=11 
of max=10; pid=12, ppid=11, state=RUNNABLE:REGION_TRANSITION_QUEUE; 
AssignProcedure table=hbase:meta, region=1588230740; rit=OFFLINE, location=null
2018-03-31 05:00:27,408 ERROR [PEWorker-7] procedure2.ProcedureExecutor: 
CODE-BUG: Uncaught runtime exception for pid=11, 
state=FAILED:RECOVER_META_ASSIGN_REGIONS, 
exception=org.apache.hadoop.hbase.client.RetriesExhaustedException via 
AssignProcedure:org.apache.hadoop.hbase.client.RetriesExhaustedException: Max 
attempts exceeded; RecoverMetaProcedure failedMetaServer=null, splitWal=true
java.lang.UnsupportedOperationException: unhandled 
state=RECOVER_META_ASSIGN_REGIONS
    at 
org.apache.hadoop.hbase.master.procedure.RecoverMetaProcedure.rollbackState(RecoverMetaProcedure.java:188)
    at 
org.apache.hadoop.hbase.master.procedure.RecoverMetaProcedure.rollbackState(RecoverMetaProcedure.java:53)
    at 
org.apache.hadoop.hbase.procedure2.StateMachineProcedure.rollback(StateMachineProcedure.java:199)
    at 
org.apache.hadoop.hbase.procedure2.Procedure.doRollback(Procedure.java:859)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeRollback(ProcedureExecutor.java:1353)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeRollback(ProcedureExecutor.java:1309)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeProcedure(ProcedureExecutor.java:1178)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.access$800(ProcedureExecutor.java:75)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor$WorkerThread.run(ProcedureExecutor.java:1740)
2018-03-31 05:00:27,413 ERROR [PEWorker-7] procedure2.ProcedureExecutor: 
CODE-BUG: Uncaught runtime exception for pid=11, 
state=FAILED:RECOVER_META_ASSIGN_REGIONS, 
exception=org.apache.hadoop.hbase.client.RetriesExhaustedException via 
AssignProcedure:org.apache.hadoop.hbase.client.RetriesExhaustedException: Max 
attempts exceeded; RecoverMetaProcedure failedMetaServer=null, splitWal=true
java.lang.UnsupportedOperationException: unhandled 
state=RECOVER_META_ASSIGN_REGIONS
    at 
org.apache.hadoop.hbase.master.procedure.RecoverMetaProcedure.rollbackState(RecoverMetaProcedure.java:188)
    at 
org.apache.hadoop.hbase.master.procedure.RecoverMetaProcedure.rollbackState(RecoverMetaProcedure.java:53)
    at 
org.apache.hadoop.hbase.procedure2.StateMachineProcedure.rollback(StateMachineProcedure.java:199)
    at 
org.apache.hadoop.hbase.procedure2.Procedure.doRollback(Procedure.java:859)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeRollback(ProcedureExecutor.java:1353)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeRollback(ProcedureExecutor.java:1309)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeProcedure(ProcedureExecutor.java:1178)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.access$800(ProcedureExecutor.java:75)
    at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor$WorkerThread.run(ProcedureExecutor.java:1740)
2018-03-31 05:00:27,423 INFO  [PEWorker-7] procedure2.ProcedureExecutor: Rolled 
back pid=11, state=ROLLEDBACK, 
exception=org.apache.hadoop.hbase.client.RetriesExhaustedException via 
AssignProcedure:org.apache.hadoop.hbase.client.RetriesExhaustedException: Max 
attempts exceeded; RecoverMetaProcedure failedMetaServer=null, splitWal=true 
exec-time=5.1550sec
2018-03-31 05:00:27,423 ERROR [master/abhishekk1:16000] master.HMaster: Failed 
to become active master
org.apache.hadoop.hbase.client.RetriesExhaustedException: Max attempts exceeded
    at 
org.apache.hadoop.hbase.master.assignment.AssignProcedure.startTransition(AssignProcedure.java:181)
    at 
org.apache.hadoop.hbase.master.assignment.RegionTransitionProcedure.execute(RegionTransitionProcedure.java:295)
    at 

[jira] [Commented] (HBASE-19287) master hangs forever if RecoverMeta send assign meta region request to target server fail

2018-03-31 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-19287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421413#comment-16421413
 ] 

Abhishek Kulkarni commented on HBASE-19287:
---

    at 
org.apache.hbase.thirdparty.io.netty.util.concurrent.DefaultThreadFactory$DefaultRunnableDecorator.run(DefaultThreadFactory.java:138)
    at java.lang.Thread.run(Thread.java:748)
Caused by: java.io.IOException: 
org.apache.hbase.thirdparty.io.netty.handler.codec.DecoderException: 
org.apache.hadoop.ipc.RemoteException(javax.security.sasl.SaslException): GSS 
initiate failed
    at org.apache.hadoop.hbase.ipc.IPCUtil.toIOE(IPCUtil.java:148)
    ... 25 more
Caused by: org.apache.hbase.thirdparty.io.netty.handler.codec.DecoderException: 
org.apache.hadoop.ipc.RemoteException(javax.security.sasl.SaslException): GSS 
initiate failed
    at 
org.apache.hbase.thirdparty.io.netty.handler.codec.ByteToMessageDecoder.callDecode(ByteToMessageDecoder.java:459)
    at 
org.apache.hbase.thirdparty.io.netty.handler.codec.ByteToMessageDecoder.channelRead(ByteToMessageDecoder.java:265)
    at 
org.apache.hbase.thirdparty.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:362)
    ... 12 more
Caused by: 
org.apache.hadoop.ipc.RemoteException(javax.security.sasl.SaslException): GSS 
initiate failed
    at 
org.apache.hadoop.hbase.security.SaslChallengeDecoder.tryDecodeError(SaslChallengeDecoder.java:92)
    at 
org.apache.hadoop.hbase.security.SaslChallengeDecoder.decode(SaslChallengeDecoder.java:109)
    at 
org.apache.hbase.thirdparty.io.netty.handler.codec.ByteToMessageDecoder.decodeRemovalReentryProtection(ByteToMessageDecoder.java:489)
    at 
org.apache.hbase.thirdparty.io.netty.handler.codec.ByteToMessageDecoder.callDecode(ByteToMessageDecoder.java:428)
    ... 14 more
2018-03-31 05:00:24,484 WARN  [RSProcedureDispatcher-pool3-t1] 
assignment.RegionTransitionProcedure: Remote call failed pid=12, ppid=11, 
state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
region=1588230740, target=abhishekk2.pne.ven.veritas.com,16020,1522480363659; 
rit=OPENING, location=abhishekk2.pne.ven.veritas.com,16020,1522486814482; 
exception=Call to abhishekk2.pne.ven.veritas.com/10.210.62.29:16020 failed on 
local exception: java.io.IOException: 
org.apache.hbase.thirdparty.io.netty.handler.codec.DecoderException: 
org.apache.hadoop.ipc.RemoteException(javax.security.sasl.SaslException): GSS 
initiate failed
2018-03-31 05:00:24,485 INFO  [RSProcedureDispatcher-pool3-t1] 
assignment.AssignProcedure: Retry=1 of max=10; pid=12, ppid=11, 
state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
region=1588230740, target=abhishekk2.pne.ven.veritas.com,16020,1522480363659; 
rit=OPENING, location=abhishekk2.pne.ven.veritas.com,16020,1522486814482
2018-03-31 05:00:24,485 INFO  [PEWorker-5] assignment.AssignProcedure: Starting 
pid=12, ppid=11, state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure 
table=hbase:meta, region=1588230740; rit=OFFLINE, location=null; 
forceNewPlan=true, retain=false
2018-03-31 05:00:24,638 INFO  [PEWorker-6] zookeeper.MetaTableLocator: Setting 
hbase:meta (replicaId=0) location in ZooKeeper as 
abhishekk2.pne.ven.veritas.com,16020,1522486814482
2018-03-31 05:00:24,643 INFO  [PEWorker-6] 
assignment.RegionTransitionProcedure: Dispatch pid=12, ppid=11, 
state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
region=1588230740; rit=OPENING, 
location=abhishekk2.pne.ven.veritas.com,16020,1522486814482
2018-03-31 05:00:24,794 WARN  [RSProcedureDispatcher-pool3-t2] 
procedure.RSProcedureDispatcher: Failed dispatch to 
server=abhishekk2.pne.ven.veritas.com,16020,1522486814482 try=0
java.io.IOException: Call to abhishekk2.pne.ven.veritas.com/10.210.62.29:16020 
failed on local exception: java.io.IOException: Can not send request because 
relogin is in progress.
    at org.apache.hadoop.hbase.ipc.IPCUtil.wrapException(IPCUtil.java:180)
    at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient.onCallFinished(AbstractRpcClient.java:390)
    at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient.access$100(AbstractRpcClient.java:95)
    at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient$3.run(AbstractRpcClient.java:410)
    at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient$3.run(AbstractRpcClient.java:406)
    at org.apache.hadoop.hbase.ipc.Call.callComplete(Call.java:103)
    at org.apache.hadoop.hbase.ipc.Call.setException(Call.java:118)
    at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient.callMethod(AbstractRpcClient.java:423)
    at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient.callBlockingMethod(AbstractRpcClient.java:328)
    at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient.access$200(AbstractRpcClient.java:95)
    at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient$BlockingRpcChannelImplementation.callBlockingMethod(AbstractRpcClient.java:571)
    at 

[jira] [Commented] (HBASE-19287) master hangs forever if RecoverMeta send assign meta region request to target server fail

2018-03-31 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-19287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421410#comment-16421410
 ] 

Abhishek Kulkarni commented on HBASE-19287:
---

Cant paste all log here. Also not able to attach file. Is there a way to attch 
file?
Pasting few failures which are allowed.
===

at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeProcedure(ProcedureExecutor.java:1221)
at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.access$800(ProcedureExecutor.java:75)
at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor$WorkerThread.run(ProcedureExecutor.java:1740)
2018-03-31 05:00:22,646 INFO  [PEWorker-1] master.SplitLogManager: finished 
splitting (more than or equal to) 0 bytes in 0 log files in 
[hdfs://abhishekk1.pne.ven.veritas.com:54310/hbase/WALs/abhishekk2.pne.ven.veritas.com,16020,1522480363659-splitting]
 in 14ms
2018-03-31 05:00:22,654 INFO  [PEWorker-1] procedure.RecoverMetaProcedure: 
pid=11, state=RUNNABLE:RECOVER_META_ASSIGN_REGIONS; RecoverMetaProcedure 
failedMetaServer=null, splitWal=true; Retaining meta assignment to 
server=abhishekk2.pne.ven.veritas.com,16020,1522480363659
2018-03-31 05:00:22,660 INFO  [PEWorker-1] procedure2.ProcedureExecutor: 
Initialized subprocedures=[{pid=12, ppid=11, 
state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
region=1588230740, target=abhishekk2.pne.ven.veritas.com,16020,1522480363659}]
2018-03-31 05:00:22,677 INFO  [PEWorker-2] procedure.MasterProcedureScheduler: 
pid=12, ppid=11, state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure 
table=hbase:meta, region=1588230740, 
target=abhishekk2.pne.ven.veritas.com,16020,1522480363659, 
hbase:meta,,1.1588230740
2018-03-31 05:00:22,684 INFO  [PEWorker-2] assignment.AssignProcedure: Starting 
pid=12, ppid=11, state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure 
table=hbase:meta, region=1588230740, 
target=abhishekk2.pne.ven.veritas.com,16020,1522480363659; rit=OFFLINE, 
location=abhishekk2.pne.ven.veritas.com,16020,1522480363659; 
forceNewPlan=false, retain=false
2018-03-31 05:00:22,861 INFO  [master/abhishekk1:16000] 
balancer.BaseLoadBalancer: Reassigned 1 regions. 1 retained the pre-restart 
assignment. 
2018-03-31 05:00:22,863 INFO  [PEWorker-3] assignment.AssignProcedure: Early 
suspend! pid=12, ppid=11, state=RUNNABLE:REGION_TRANSITION_DISPATCH; 
AssignProcedure table=hbase:meta, region=1588230740, 
target=abhishekk2.pne.ven.veritas.com,16020,1522480363659; rit=OFFLINE, 
location=abhishekk2.pne.ven.veritas.com,16020,1522486814482
2018-03-31 05:00:23,629 INFO  [HBase-Metrics2-1] 
impl.GlobalMetricRegistriesAdapter: Registering 
Master,sub=Coprocessor.Master.CP_org.apache.hadoop.hbase.security.access.AccessController
 Metrics about HBase MasterObservers
2018-03-31 05:00:24,237 INFO  [PEWorker-4] zookeeper.MetaTableLocator: Setting 
hbase:meta (replicaId=0) location in ZooKeeper as 
abhishekk2.pne.ven.veritas.com,16020,1522486814482
2018-03-31 05:00:24,243 INFO  [PEWorker-4] 
assignment.RegionTransitionProcedure: Dispatch pid=12, ppid=11, 
state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
region=1588230740, target=abhishekk2.pne.ven.veritas.com,16020,1522480363659; 
rit=OPENING, location=abhishekk2.pne.ven.veritas.com,16020,1522486814482
2018-03-31 05:00:24,482 WARN  [RSProcedureDispatcher-pool3-t1] 
procedure.RSProcedureDispatcher: Failed dispatch to 
server=abhishekk2.pne.ven.veritas.com,16020,1522486814482 try=0
java.io.IOException: Call to abhishekk2.pne.ven.veritas.com/10.210.62.29:16020 
failed on local exception: java.io.IOException: 
org.apache.hbase.thirdparty.io.netty.handler.codec.DecoderException: 
org.apache.hadoop.ipc.RemoteException(javax.security.sasl.SaslException): GSS 
initiate failed
at org.apache.hadoop.hbase.ipc.IPCUtil.wrapException(IPCUtil.java:180)
at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient.onCallFinished(AbstractRpcClient.java:390)
at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient.access$100(AbstractRpcClient.java:95)
at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient$3.run(AbstractRpcClient.java:410)
at 
org.apache.hadoop.hbase.ipc.AbstractRpcClient$3.run(AbstractRpcClient.java:406)
at org.apache.hadoop.hbase.ipc.Call.callComplete(Call.java:103)
at org.apache.hadoop.hbase.ipc.Call.setException(Call.java:118)
at 
org.apache.hadoop.hbase.ipc.BufferCallBeforeInitHandler.userEventTriggered(BufferCallBeforeInitHandler.java:92)
at 
org.apache.hbase.thirdparty.io.netty.channel.AbstractChannelHandlerContext.invokeUserEventTriggered(AbstractChannelHandlerContext.java:329)
at 
org.apache.hbase.thirdparty.io.netty.channel.AbstractChannelHandlerContext.invokeUserEventTriggered(AbstractChannelHandlerContext.java:315)
at 

[jira] [Comment Edited] (HBASE-19287) master hangs forever if RecoverMeta send assign meta region request to target server fail

2018-03-31 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-19287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421195#comment-16421195
 ] 

Abhishek Kulkarni edited comment on HBASE-19287 at 3/31/18 7:06 AM:


I still see this issue with my Hbase 2.0.0-Beta2 version. I am using Kerberos 
security and not able to start Hbase Master with above error.

Zokeeper ver- zookeeper-3.5.3-beta

Log LIne-

INFO  [PEWorker-2] procedure.RecoverMetaProcedure: pid=29, 
state=RUNNABLE:RECOVER_META_ASSIGN_REGIONS; RecoverMetaProcedure 
failedMetaServer=null, splitWal=true; Retaining meta assignment to server=

 

I am blocked on this, can anyone help with resolution steps?


was (Author: ahk123in):
I still see this issue with my Hbase 2.0.0-Beta2 version. I am using Kerberos 
security and not able to start Hbase Master with above error.

Zokeeper ver- zookeeper-3.5.3-beta

Log LIne-

INFO  [PEWorker-2] procedure.RecoverMetaProcedure: pid=29, 
state=RUNNABLE:RECOVER_META_ASSIGN_REGIONS; RecoverMetaProcedure 
failedMetaServer=null, splitWal=true; Retaining meta assignment to server=

> master hangs forever if RecoverMeta send assign meta region request to target 
> server fail
> -
>
> Key: HBASE-19287
> URL: https://issues.apache.org/jira/browse/HBASE-19287
> Project: HBase
>  Issue Type: Bug
>  Components: proc-v2
>Affects Versions: 2.0.0
>Reporter: Yi Liang
>Assignee: Yi Liang
>Priority: Major
> Fix For: 2.0.0-beta-1, 2.0.0
>
> Attachments: HBASE-19287-master-v3.patch, 
> HBASE-19287-master-v3.patch, HBASE-19287-master-v4.patch, 
> hbase-19287-master-v2.patch, master.patch
>
>
> 2017-11-10 19:26:56,019 INFO  [ProcExecWrkr-1] 
> procedure.RecoverMetaProcedure: pid=138, 
> state=RUNNABLE:RECOVER_META_ASSIGN_REGIONS; RecoverMetaProcedure 
> failedMetaServer=null, splitWal=true; Retaining meta assignment to 
> server=hadoop-slave1.hadoop,16020,1510341981454
> 2017-11-10 19:26:56,029 INFO  [ProcExecWrkr-1] procedure2.ProcedureExecutor: 
> Initialized subprocedures=[{pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454}]
> 2017-11-10 19:26:56,067 INFO  [ProcExecWrkr-2] 
> procedure.MasterProcedureScheduler: pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454 hbase:meta 
> hbase:meta,,1.1588230740
> 2017-11-10 19:26:56,071 INFO  [ProcExecWrkr-2] assignment.AssignProcedure: 
> Start pid=139, ppid=138, state=RUNNABLE:REGION_TRANSITION_QUEUE; 
> AssignProcedure table=hbase:meta, region=1588230740, 
> target=hadoop-slave1.hadoop,16020,1510341981454; rit=OFFLINE, 
> location=hadoop-slave1.hadoop,16020,1510341981454; forceNewPlan=false, 
> retain=false
> 2017-11-10 19:26:56,224 INFO  [ProcExecWrkr-4] zookeeper.MetaTableLocator: 
> Setting hbase:meta (replicaId=0) location in ZooKeeper as 
> hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:56,230 INFO  [ProcExecWrkr-4] 
> assignment.RegionTransitionProcedure: Dispatch pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454; 
> rit=OPENING, location=hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:56,382 INFO  [ProcedureDispatcherTimeoutThread] 
> procedure.RSProcedureDispatcher: Using procedure batch rpc execution for 
> serverName=hadoop-slave2.hadoop,16020,1510341988652 version=2097152
> 2017-11-10 19:26:57,542 INFO  [main-EventThread] 
> zookeeper.RegionServerTracker: RegionServer ephemeral node deleted, 
> processing expiration [hadoop-slave2.hadoop,16020,1510341988652]
> 2017-11-10 19:26:57,543 INFO  [main-EventThread] master.ServerManager: Master 
> doesn't enable ServerShutdownHandler during initialization, delay expiring 
> server hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:58,875 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Registering 
> server=hadoop-slave1.hadoop,16020,1510342016106
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Registering 
> server=hadoop-slave2.hadoop,16020,1510342023184
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Triggering server recovery; existingServer 
> hadoop-slave2.hadoop,16020,1510341988652 looks stale, new 
> server:hadoop-slave2.hadoop,16020,1510342023184
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Master doesn't 

[jira] [Commented] (HBASE-19287) master hangs forever if RecoverMeta send assign meta region request to target server fail

2018-03-31 Thread Abhishek Kulkarni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-19287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421195#comment-16421195
 ] 

Abhishek Kulkarni commented on HBASE-19287:
---

I still see this issue with my Hbase 2.0.0-Beta2 version. I am using Kerberos 
security and not able to start Hbase Master with above error.

Zokeeper ver- zookeeper-3.5.3-beta

Log LIne-

INFO  [PEWorker-2] procedure.RecoverMetaProcedure: pid=29, 
state=RUNNABLE:RECOVER_META_ASSIGN_REGIONS; RecoverMetaProcedure 
failedMetaServer=null, splitWal=true; Retaining meta assignment to server=

> master hangs forever if RecoverMeta send assign meta region request to target 
> server fail
> -
>
> Key: HBASE-19287
> URL: https://issues.apache.org/jira/browse/HBASE-19287
> Project: HBase
>  Issue Type: Bug
>  Components: proc-v2
>Affects Versions: 2.0.0
>Reporter: Yi Liang
>Assignee: Yi Liang
>Priority: Major
> Fix For: 2.0.0-beta-1, 2.0.0
>
> Attachments: HBASE-19287-master-v3.patch, 
> HBASE-19287-master-v3.patch, HBASE-19287-master-v4.patch, 
> hbase-19287-master-v2.patch, master.patch
>
>
> 2017-11-10 19:26:56,019 INFO  [ProcExecWrkr-1] 
> procedure.RecoverMetaProcedure: pid=138, 
> state=RUNNABLE:RECOVER_META_ASSIGN_REGIONS; RecoverMetaProcedure 
> failedMetaServer=null, splitWal=true; Retaining meta assignment to 
> server=hadoop-slave1.hadoop,16020,1510341981454
> 2017-11-10 19:26:56,029 INFO  [ProcExecWrkr-1] procedure2.ProcedureExecutor: 
> Initialized subprocedures=[{pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454}]
> 2017-11-10 19:26:56,067 INFO  [ProcExecWrkr-2] 
> procedure.MasterProcedureScheduler: pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_QUEUE; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454 hbase:meta 
> hbase:meta,,1.1588230740
> 2017-11-10 19:26:56,071 INFO  [ProcExecWrkr-2] assignment.AssignProcedure: 
> Start pid=139, ppid=138, state=RUNNABLE:REGION_TRANSITION_QUEUE; 
> AssignProcedure table=hbase:meta, region=1588230740, 
> target=hadoop-slave1.hadoop,16020,1510341981454; rit=OFFLINE, 
> location=hadoop-slave1.hadoop,16020,1510341981454; forceNewPlan=false, 
> retain=false
> 2017-11-10 19:26:56,224 INFO  [ProcExecWrkr-4] zookeeper.MetaTableLocator: 
> Setting hbase:meta (replicaId=0) location in ZooKeeper as 
> hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:56,230 INFO  [ProcExecWrkr-4] 
> assignment.RegionTransitionProcedure: Dispatch pid=139, ppid=138, 
> state=RUNNABLE:REGION_TRANSITION_DISPATCH; AssignProcedure table=hbase:meta, 
> region=1588230740, target=hadoop-slave1.hadoop,16020,1510341981454; 
> rit=OPENING, location=hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:56,382 INFO  [ProcedureDispatcherTimeoutThread] 
> procedure.RSProcedureDispatcher: Using procedure batch rpc execution for 
> serverName=hadoop-slave2.hadoop,16020,1510341988652 version=2097152
> 2017-11-10 19:26:57,542 INFO  [main-EventThread] 
> zookeeper.RegionServerTracker: RegionServer ephemeral node deleted, 
> processing expiration [hadoop-slave2.hadoop,16020,1510341988652]
> 2017-11-10 19:26:57,543 INFO  [main-EventThread] master.ServerManager: Master 
> doesn't enable ServerShutdownHandler during initialization, delay expiring 
> server hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:26:58,875 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Registering 
> server=hadoop-slave1.hadoop,16020,1510342016106
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Registering 
> server=hadoop-slave2.hadoop,16020,1510342023184
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Triggering server recovery; existingServer 
> hadoop-slave2.hadoop,16020,1510341988652 looks stale, new 
> server:hadoop-slave2.hadoop,16020,1510342023184
> 2017-11-10 19:27:05,832 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> master.ServerManager: Master doesn't enable ServerShutdownHandler during 
> initialization, delay expiring server hadoop-slave2.hadoop,16020,1510341988652
> 2017-11-10 19:27:49,815 INFO  
> [RpcServer.default.FPBQ.Fifo.handler=29,queue=2,port=16000] 
> client.RpcRetryingCallerImpl: tarted=38594 ms ago, cancelled=false, 
> msg=org.apache.hadoop.hbase.NotServingRegionException: hbase:meta,,1 is not 
> online on hadoop-slave2.hadoop,16020,1510342023184
> at 
> org.apache.hadoop.hbase.regionserver.HRegionServer.getRegionByEncodedName(HRegionServer.java:3290)