[ 
https://issues.apache.org/jira/browse/HDFS-12742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303653#comment-16303653
 ] 

Weiwei Yang commented on HDFS-12742:
------------------------------------

Hi [~shashikant]

Thanks for revisiting this.

bq.  i think the better approach here would be to iterate through all the keys 
and build the container list in KSM and send container deletion request to SCM 
to delete the containers specific to the particular KSM instance directly at 
SCM instead of sending key deletion request.

I can't see the difference of this approach comparing to directly sending key 
deletion requests, they both need to iterate all keys. If a KSM has millions of 
keys, it will be a very slow process. And it might need more than 1 day or 2 
until all data actually gets deleted on datanodes. That was why I suggested to 
operate on metadata level, that will be much faster.

Thanks


> Add support for KSM --expunge command
> -------------------------------------
>
>                 Key: HDFS-12742
>                 URL: https://issues.apache.org/jira/browse/HDFS-12742
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>    Affects Versions: HDFS-7240
>            Reporter: Shashikant Banerjee
>            Assignee: Shashikant Banerjee
>             Fix For: HDFS-7240
>
>         Attachments: HDFS-12742-HDFS-7240.001.patch, 
> HDFS-12742-HDFS-7240.002.patch
>
>
> KSM --expunge will delete all the data from the data nodes for all the keys 
> in the KSM db. 
> User will have no control over the deletion.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to