[ 
https://issues.apache.org/jira/browse/HDDS-6697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17532602#comment-17532602
 ] 

Jie Yao edited comment on HDDS-6697 at 5/6/22 3:28 AM:
-------------------------------------------------------

thanks [~sodonnell] for opening this jira.  i think before this,  we need to 
refactor RM further。

in HDDS-6572, i extract movescheduler as a standalone class. meanwhile, i add a 
inflightActionsManager to manager all the inflight actions.

after this , we can create a new Jira to extract all the command sending 
function(Eg. sendDeleteCommand, SendReplicateCommand) to a standalone class or 
function. it will receive a list of commands(may be returned by detect 
container health), and then fire the event to send them.

also , i have a doc for detecting EC container health, and have uploaded it , 
please take a look


was (Author: jacksonyao):
thanks [~sodonnell] for opening this jira.  i think before this,  we need to 
refactor RM further。

in [HDDS-6572|https://issues.apache.org/jira/browse/HDDS-6572], i extract 
movescheduler as a standalone class. meanwhile, i add a inflightActionsManager 
to manager all the inflight actions.

after this , we can create a new Jira to extract all the command sending 
function(Eg. sendDeleteCommand, SendReplicateCommand) to a standalone class or 
function. it will receive a list of commands(may be returned by detect 
container health), and then fire the event to send them.

also , i have a doc for detecting EC container health, and have uploaded it 

> EC: ReplicationManager - create class to detect container health issues
> -----------------------------------------------------------------------
>
>                 Key: HDDS-6697
>                 URL: https://issues.apache.org/jira/browse/HDDS-6697
>             Project: Apache Ozone
>          Issue Type: Sub-task
>            Reporter: Stephen O'Donnell
>            Assignee: Stephen O'Donnell
>            Priority: Major
>         Attachments: EC Container Group Health Check.pdf
>
>
> Define an interface to allow a single containers health to be checked. The 
> check health method should receive as parameters everything it needs to check 
> the container health (eg the ContainerInfo, ContainerReplica list, ... ?) and 
> return a status indicating the health of the container, eg HEALTHY, 
> UNDER_REPLICATED, OVER_REPLICATED ...
> The status object could also container some commands needing send to the 
> command queue, eg to close the container, delete a replica, force close etc.
> The idea here is to create a standalone health check class for EC with few 
> dependencies so it can be tested in isolation easily.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to