[
https://issues.apache.org/jira/browse/ACCUMULO-4663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17271594#comment-17271594
]
Jeffrey Manno commented on ACCUMULO-4663:
-----------------------------------------
This appears to be a duplicate of -ACCUMULO-4410- which was fixed after this
issue was created. That solution removed ShutdownTServer.requestedShutdown and
retains only the current set of servers on each update call for
serversToShutodwn.
> ShutdownTServer attempts shutdown over and over again, can end up blocking
> migrations
> -------------------------------------------------------------------------------------
>
> Key: ACCUMULO-4663
> URL: https://issues.apache.org/jira/browse/ACCUMULO-4663
> Project: Accumulo
> Issue Type: Bug
> Affects Versions: 1.7.2
> Reporter: John Vines
> Priority: Major
>
> Also affects 1.7.1
> ACCUMULO-1259 identified a problem with it repeatedly invoking
> master.shutdownTServer. One side effect of this is a race where a server goes
> down, gets removed from the online tablet sets, etc. and then gets re-added
> to the serversToShutdown set. This will cause the balancer to not balance due
> to shutdown in progress and never gets rectified. Only workaround is to
> restart the master (or bring that server back up, I'm guessing).
> ACCUMULO-3897 attempted to fix that problem by attempting shutdown once and
> only once. It does this by setting a local boolean. But because we do not
> reserialize our fate repos between isReady calls, this boolean effectively is
> reset between each check, making it pointless.
> I believe there are 2 problems here- 1 is that
> ShutdownTServer.requestedShutdown is not implemented correctly
> 2 is we should have a mechanism to remove from serversToShutdown any server
> that is not present.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)