[jira] [Commented] (HDDS-896) Handle over replicated containers in SCM

Anu Engineer (JIRA) Wed, 02 Jan 2019 14:16:01 -0800


    [ 
https://issues.apache.org/jira/browse/HDDS-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16732469#comment-16732469
 ]


Anu Engineer commented on HDDS-896:
-----------------------------------

[~nandakumar131]

+1, Thanks for a very good patch, Feel free to commit at will. I have some 
minor nits. leaving those here since I wrote them down already.
 # nit: ReplicaManager.java: Add some info traces when we start delete/replica 
operations? It is kind of useful to have these statements in the log.
 #  nit: ReplicaManager.java: There are some classes that has a Java doc 
comment that says -- Add JavaDoc. :)
 # A future JIRA suggestion-- The time that we collect in 
CloseContainerCommandHandler.java, it might be good to report back to SCM. That 
way we can build a average time to close and std.dev etc for this operation. 
Same comment for other commands in the DN.
 # It is new year and I am just back from a long vacation. Hence a trick 
question for you. Probably not something that we need to worry at all, I am 
just thinking aloud. Feel free to ignore this comment completely. Is it 
possible for the SCM to be confused and send conflicting commands to a DN. For 
example, what happens if I send -- CloseContainer, ReplicateContainer and 
DeleteContainer together? Is the data node guaranteed to process only one 
command at a time? should we introduce the same kind of locking that we have in 
SCM inside a datanode ? A locking is needed, if and only if we take the 
position that we will allow completely parallel operations to proceed in 
parallel, Then we need to worry about making a mistake and having these threads 
pick up a command that conflicts. Let us talk some time. Please note that this 
comment has no bearing at all on this patch.

> Handle over replicated containers in SCM
> ----------------------------------------
>
>                 Key: HDDS-896
>                 URL: https://issues.apache.org/jira/browse/HDDS-896
>             Project: Hadoop Distributed Data Store
>          Issue Type: Improvement
>          Components: SCM
>            Reporter: Nanda kumar
>            Assignee: Nanda kumar
>            Priority: Major
>         Attachments: HDDS-896.000.patch, HDDS-896.001.patch
>
>
> When SCM detects that a container is over-replicated, it has to delete some 
> replicas to bring the number of replicas to match the required value. If the 
> container is in QUASI_CLOSED state, we should check the {{originNodeId}} 
> field while choosing the replica to delete.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HDDS-896) Handle over replicated containers in SCM

Reply via email to