[jira] [Commented] (HBASE-9634) HBase Table few regions are not getting recovered from the 'Transition'/'OFFLINE state'

shankarlingayya (JIRA) Mon, 23 Sep 2013 20:40:53 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-9634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13775963#comment-13775963
 ]


shankarlingayya commented on HBASE-9634:
----------------------------------------

how many zookeeper instances to you have? Do you kill it/them?
===> 1 zookeeper instance, not killed at all, it is running normal

what kill is it? unplug, kill 9, kill 15?
===> used the 'hbase-daemon.sh stop regionserver'

What's the replication factor, and do you kill the datanode(s)?
===> Replication factor is 3, no datanode is killed, it is running normally

after step 5, do you flush the table?
===> No flush is done, data are added successfuly in the hbase

what are the logs of the region server which it failing to open the region?
===> We added huge number of records, but only the below region is transition, 
but all the other regions are fine.

2013-09-23 18:28:06,610 INFO 
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Opening of 
region {NAME => 't1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.', 
STARTKEY => 'row507465', ENDKEY => 'row508987', ENCODED => 
2d9fad2aee78103f928d8c7fe16ba6cd,} failed, marking as FAILED_OPEN in ZK

2013-09-23 18:46:12,160 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
Instantiated t1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.
2013-09-23 18:46:12,160 ERROR 
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Failed open of 
region=t1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd., starting 
to roll back the global memstore size.



                
> HBase Table few regions are not getting recovered from the 
> 'Transition'/'OFFLINE state' 
> ----------------------------------------------------------------------------------------
>
>                 Key: HBASE-9634
>                 URL: https://issues.apache.org/jira/browse/HBASE-9634
>             Project: HBase
>          Issue Type: Bug
>          Components: master, regionserver
>    Affects Versions: 0.94.11
>         Environment: SuSE11
>            Reporter: shankarlingayya
>
> {noformat}
> HBase Table few regions are not getting recovered from the 
> 'Transition'/'OFFLINE state'
> Test Procedure:
> 1. Setup Non HA Hadoop Cluster with two nodes (Node1-XX.XX.XX.XX,  
> Node2-YY.YY.YY.YY)
> 2. Install Zookeeper & HRegionServer in Node-1
> 3. Install HMaster & HRegionServer in Node-2
> 4. From Node2 create HBase Table ( table name 't1' with one column family 
> 'cf1' )
> 5. Perform addrecord 99649 rows 
> 6. Perform kill and restart of Node1 Region Server & Node2 Region Server in a 
> loop for 10-20 times
> 2013-09-23 18:28:06,610 INFO 
> org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Opening of 
> region {NAME => 
> 't1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.', STARTKEY => 
> 'row507465', ENDKEY => 'row508987', ENCODED => 
> 2d9fad2aee78103f928d8c7fe16ba6cd,} failed, marking as FAILED_OPEN in ZK
> 2013-09-23 18:46:12,160 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
> Instantiated t1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd.
> 2013-09-23 18:46:12,160 ERROR 
> org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Failed open 
> of region=t1,row507465,1379937224590.2d9fad2aee78103f928d8c7fe16ba6cd., 
> starting to roll back the global memstore size.
> {noformat}  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-9634) HBase Table few regions are not getting recovered from the 'Transition'/'OFFLINE state'

Reply via email to