[
https://issues.apache.org/jira/browse/HBASE-15924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15566852#comment-15566852
]
Andrew Purtell commented on HBASE-15924:
----------------------------------------
I took this for a spin.
{{./bin/hbase-daemon.sh autostart}} and {{autorestart}} works differently from
{{start}} in that we don't see a message printed like
{noformat}
echo starting $command, logging to $HBASE_LOGOUT
{noformat}
which lead me to believe the command silently failed to launch the requested
process. Please fix this, it's going to cause confusion.
There is an off-by-one error dealing with the autostart window retry limit. I
tried this with {{./bin/hbase-daemon.sh --autostart-window-retry-limit 3
autostart regionserver}} then in order to stop autostart had to kill the
regionserver 4 times:
{noformat}
1308 jps -l
1309 kill 11956
1310 jps -l
1311 kill 12760
1312 jps -l
1313 kill 13205
1314 jps -l
1315 kill 13645
{noformat}
until finally:
{noformat}
Tue Oct 11 15:41:20 PDT 2016 Autostart window retry limit: 3 exceeded for given
window size: 0 hours
{noformat}
> Enhance hbase services autorestart capability to hbase-daemon.sh
> -----------------------------------------------------------------
>
> Key: HBASE-15924
> URL: https://issues.apache.org/jira/browse/HBASE-15924
> Project: HBase
> Issue Type: Improvement
> Affects Versions: 0.98.19
> Reporter: Loknath Priyatham Teja Singamsetty
> Assignee: Loknath Priyatham Teja Singamsetty
> Fix For: 0.98.24
>
> Attachments: HBASE-15924.master.0001.patch,
> HBASE-15924.master.0002.patch, HBASE-15924.master.0003.patch
>
>
> As part of HBASE-5939, the autorestart for hbase services has been added to
> deal with scenarios where hbase services (master/regionserver/master-backup)
> gets killed or goes down leading to unplanned outages. The changes were made
> to hbase-daemon.sh to support autorestart option.
> However, the autorestart implementation doesn't work in standalone mode and
> other than that have few gaps with the implementation as per release notes of
> HBASE-5939. Here is an attempt to re-design and fix the functionality
> considering all possible usecases with hbase service operations.
> Release Notes of HBASE-5939:
> ------------------------------------------
> When launched with autorestart, HBase processes will automatically restart if
> they are not properly terminated, either by a "stop" command or by a cluster
> stop. To ensure that it does not overload the system when the server itself
> is corrupted and the process cannot be restarted, the server sleeps for 5
> minutes before restarting if it was already started 5 minutes ago previously.
> To use it, launch the process with "bin/start-hbase autorestart". This option
> is not fully compatible with the existing "restart" command: if you ask for a
> restart on a server launched with autorestart, the server will restart but
> the next server instance won't be automatically restarted.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)