[
https://issues.apache.org/jira/browse/HDDS-7098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17700975#comment-17700975
]
Neil Joshi commented on HDDS-7098:
----------------------------------
Looks like we would like to capture the replica health state for unhealthy
containers. This is something that is not currently captured here and trying
to use the replica hostname (fqdn or nic) or bcsid (last seen) is neither
consistent nor reliable to differentiate healthy from unhealthy containers.
[~deveshsingh] , it looks like this ticket can be resolved if for each
unhealthy container listed, within the replica listing to also include the
replica health state. For each replica in container set, provide the state of
the container (can be CLOSED, UNHEALTHY). This would be under the recon api,
{code:java}
api/v1/containers/unhealthy {code}
> Provide a way for admin to identify all unhealthy container replicas
> --------------------------------------------------------------------
>
> Key: HDDS-7098
> URL: https://issues.apache.org/jira/browse/HDDS-7098
> Project: Apache Ozone
> Issue Type: Sub-task
> Reporter: Ethan Rose
> Assignee: Devesh Kumar Singh
> Priority: Major
> Attachments: MissingContainers.png, image-2023-03-02-16-01-07-814.png
>
>
> Currently UNHEALTHY is a state that a container replica can be in
> (ContainerReplicaProto#State), but not a state that the container can be in
> overall (LifeCycleState). This means {{ozone admin container list}} has no
> info about unhealthy containers, because it currently does not print replica
> information. [Recon's
> API|https://ozone.apache.org/docs/current/interface/reconapi.html] and UI
> does not expose replica information either. The only way to determine
> unhealthy containers is to run {{ozone admin container info <ID>}} for a
> container that is already suspected to have unhealthy replicas. This jira
> aims to provide a way to identify and filter container replica states,
> through either Recon's UI, Recon's REST API, or client CLI.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]