[
https://issues.apache.org/jira/browse/HBASE-9634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13775963#comment-13775963
]
shankarlingayya commented on HBASE-9634:
----------------------------------------
how many zookeeper instances to you have? Do you kill it/them?
===> 1 zookeeper instance, not killed at all, it is running normal
what kill is it? unplug, kill 9, kill 15?
===> used the 'hbase-daemon.sh stop regionserver'
What's the replication factor, and do you kill the datanode(s)?
===> Replication factor is 3, no datanode is killed, it is running normally
after step 5, do you flush the table?
===> No flush is done, data are added successfuly in the hbase
what are the logs of the region server which it failing to open the region?
===> We added huge number of records, but only the below region is transition,
but all the other regions are fine.
2013-09-23 18:28:06,610 INFO
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Opening of
region {NAME => 't1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.',
STARTKEY => 'row507465', ENDKEY => 'row508987', ENCODED =>
2d9fad2aee78103f928d8c7fe16ba6cd,} failed, marking as FAILED_OPEN in ZK
2013-09-23 18:46:12,160 DEBUG org.apache.hadoop.hbase.regionserver.HRegion:
Instantiated t1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.
2013-09-23 18:46:12,160 ERROR
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Failed open of
region=t1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd., starting
to roll back the global memstore size.
> HBase Table few regions are not getting recovered from the
> 'Transition'/'OFFLINE state'
> ----------------------------------------------------------------------------------------
>
> Key: HBASE-9634
> URL: https://issues.apache.org/jira/browse/HBASE-9634
> Project: HBase
> Issue Type: Bug
> Components: master, regionserver
> Affects Versions: 0.94.11
> Environment: SuSE11
> Reporter: shankarlingayya
>
> {noformat}
> HBase Table few regions are not getting recovered from the
> 'Transition'/'OFFLINE state'
> Test Procedure:
> 1. Setup Non HA Hadoop Cluster with two nodes (Node1-XX.XX.XX.XX,
> Node2-YY.YY.YY.YY)
> 2. Install Zookeeper & HRegionServer in Node-1
> 3. Install HMaster & HRegionServer in Node-2
> 4. From Node2 create HBase Table ( table name 't1' with one column family
> 'cf1' )
> 5. Perform addrecord 99649 rows
> 6. Perform kill and restart of Node1 Region Server & Node2 Region Server in a
> loop for 10-20 times
> 2013-09-23 18:28:06,610 INFO
> org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Opening of
> region {NAME =>
> 't1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.', STARTKEY =>
> 'row507465', ENDKEY => 'row508987', ENCODED =>
> 2d9fad2aee78103f928d8c7fe16ba6cd,} failed, marking as FAILED_OPEN in ZK
> 2013-09-23 18:46:12,160 DEBUG org.apache.hadoop.hbase.regionserver.HRegion:
> Instantiated t1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.
> 2013-09-23 18:46:12,160 ERROR
> org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Failed open
> of region=t1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.,
> starting to roll back the global memstore size.
> {noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira