[ 
https://issues.apache.org/jira/browse/HBASE-4442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ming Ma updated HBASE-4442:
---------------------------

    Description: 
We use a script that restart RS on a rolling fashion and found couple new 
issues in HBase in the area of server shutdown and region assignments. Running 
such test on a cluster will help to verify changes made in the area of server 
shutdown and regions assignment. The script could do the followings:

1. Based on the cluster config file, get the list of RSs and master server.
2. Restart RSs and Master periodically. The parameters like the sleep time 
between stop and start, the percentage of machines down at at given time, etc. 
should be configurable. 
3. Use hbck or other ways to verify the cluster is moving. There will be 
inconsistency at a given time when you have lots of regions in transition. So 
that is ok. Script should mark test failure only when things aren't moving, for 
example, .META. isn't online for > timeoutmonitor period.


  was:
We use a script that restart RS on a rolling fashion and found couple new 
issues in HBase in the area of server shutdown and region assignments. Running 
such test on a cluster will help to verify changes made in the area of server 
shutdown and regions assignment. The script could do the followings:

1. Based on the cluster config file, get the list of RSs and master server.
2. Restart RSs and Master periodically. The parameters like the sleep time 
between stop and start, the percentage of machines down at at given time, etc. 
should be configurable. 
3. Use hbck or other ways to verify the cluster is moving. There will be 
inconsistency at a given time when you have lots of regions in transition. So 
that is ok. Script should mark test failure only when things are moving, for 
example, .META. isn't online for > timeoutmonitor period.



good point, Andy.

> Provide test script to do rolling restart of RSs, master and do verification
> ----------------------------------------------------------------------------
>
>                 Key: HBASE-4442
>                 URL: https://issues.apache.org/jira/browse/HBASE-4442
>             Project: HBase
>          Issue Type: Improvement
>          Components: test
>            Reporter: Ming Ma
>            Assignee: Michael Weng
>
> We use a script that restart RS on a rolling fashion and found couple new 
> issues in HBase in the area of server shutdown and region assignments. 
> Running such test on a cluster will help to verify changes made in the area 
> of server shutdown and regions assignment. The script could do the followings:
> 1. Based on the cluster config file, get the list of RSs and master server.
> 2. Restart RSs and Master periodically. The parameters like the sleep time 
> between stop and start, the percentage of machines down at at given time, 
> etc. should be configurable. 
> 3. Use hbck or other ways to verify the cluster is moving. There will be 
> inconsistency at a given time when you have lots of regions in transition. So 
> that is ok. Script should mark test failure only when things aren't moving, 
> for example, .META. isn't online for > timeoutmonitor period.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to