[
https://issues.apache.org/jira/browse/HBASE-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13692974#comment-13692974
]
Jean-Marc Spaggiari commented on HBASE-8716:
--------------------------------------------
If there a JIRA for the region_mover speedup? Rolling restart takes a while...
I have 2500 regions, and it's about 1 second to move the region to another
server, and another 1 seconds to get it back when it's restarted... which mean
500 seconds for a test, which is about 1h30! I looked into the attached patch
and it's not addressing the speed issue.
13/06/25 08:23:23 INFO region_mover: Moving region
9bccee9621bb69b85c26f56d56907164 (425 of 747) to
server=node2,60020,1370440537805
13/06/25 08:23:24 INFO region_mover: Moving region
042cff36b09b84ea269316807920214b (426 of 747) to
server=node7,60020,1370440536028
13/06/25 08:23:25 INFO region_mover: Moving region
8019e2101a52f70f0a452ce21759fefb (427 of 747) to
server=node3,60020,1372016527324
13/06/25 08:23:26 INFO region_mover: Moving region
2073f3c12eab4e6aa361c8bb99d7c73f (428 of 747) to
server=node4,60020,1370440535633
13/06/25 08:23:27 INFO region_mover: Moving region
8c337b81abfaac7799eeaa363e2e36a6 (429 of 747) to
server=node7,60020,1370440536028
13/06/25 08:23:28 INFO region_mover: Moving region
216f53a77be536613d8e9d048e3a2445 (430 of 747) to
server=node4,60020,1370440535633
13/06/25 08:23:29 INFO region_mover: Moving region
b0f0c2a9fadf7d6914ed886653ad9b96 (431 of 747) to
server=node3,60020,1372016527324
13/06/25 08:23:30 INFO region_mover: Moving region
9ff6d8efe351ee6ffb2f94b27f59848b (432 of 747) to
server=node1,60020,1370440537389
13/06/25 08:23:31 INFO region_mover: Moving region
c42acf8e36a8872d5c85e4304a8c9aef (433 of 747) to
server=node4,60020,1370440535633
13/06/25 08:23:32 INFO region_mover: Moving region
d32c316c66b81e602315c30f5aebf35f (434 of 747) to
server=node7,60020,1370440536028
> Fixups/Improvements for graceful_stop.sh/region_mover.rb
> --------------------------------------------------------
>
> Key: HBASE-8716
> URL: https://issues.apache.org/jira/browse/HBASE-8716
> Project: HBase
> Issue Type: Improvement
> Reporter: stack
> Assignee: stack
> Fix For: 0.95.2
>
> Attachments: 8716.txt
>
>
> It is a while since these scripts were touched. Giving them a spring
> cleaning and seeing if can make them return error codes on failure (seems
> like style previous was that the operator would watch the output and react to
> it but I see cases where tools want to call these scripts and they want
> return code to indicate whether the rolling upgrade worked or not). Also,
> see if can make the rolling restart faster since one-by-one while minimally
> disruptive and 'safe', it is slow one clusters of hundreds of nodes.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira