[
https://issues.apache.org/jira/browse/IMPALA-10594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17313683#comment-17313683
]
ASF subversion and git services commented on IMPALA-10594:
----------------------------------------------------------
Commit 9adb093ae056f1bb7fcd14ba138fd23517648226 in impala's branch
refs/heads/master from Thomas Tauber-Marshall
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=9adb093 ]
IMPALA-10594: Handle failed coordinators in admissiond
This patch adds a statestore callback for the admissiond that monitors
for coordinators that have been removed from the cluster membership
and releases all of the resources for queries running on those
coordinators.
Testing:
- Added a custom cluster test that kills a coordinator and verifies
that resources for queries running on it are eventually released.
Change-Id: I883f323bb765680ef24b3c3f51fb209dea15f0b0
Reviewed-on: http://gerrit.cloudera.org:8080/17209
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>
> Handle failed coordinators in admissiond
> ----------------------------------------
>
> Key: IMPALA-10594
> URL: https://issues.apache.org/jira/browse/IMPALA-10594
> Project: IMPALA
> Issue Type: Sub-task
> Components: Backend
> Affects Versions: Impala 4.0
> Reporter: Thomas Tauber-Marshall
> Assignee: Thomas Tauber-Marshall
> Priority: Major
>
> Currently, when a coordinator fails the admissiond will not detect this and
> the resources being used by queries on that coordinator will never be
> released. We should monitor the cluster membership through the statestore and
> have the admissiond detect failed coordinators and cancel all of their
> queries.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]