[ 
https://issues.apache.org/jira/browse/HADOOP-16140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16777820#comment-16777820
 ] 

Adam Antal commented on HADOOP-16140:
-------------------------------------

Thanks for the summary, [~sodonnell].

As I go through the options you suggested I agree that option 2 looks the most 
promising.

My main point is that it's not accidental that most OS-es have the same kind of 
deleting mechanism: first move the files the to trash, and then delete them 
permanently. We can skipTrash-delete files from anywhere, but have to wait the 
trash for checkpointing to be able to remove something? It is kinda confusing, 
yes! As this jira aims to work on that, I believe the {{dfs -expunge 
-immediate}} is the closest what it's missing from the trash system, and would 
benefit the most from customer's point of view - so I'd vote on that. What is 
your opinion?

I think the other improvements should be delt with in some follow-up jiras, but 
they're still important and has to be discussed. 

> Add emptyTrash option to purge trash immediately
> ------------------------------------------------
>
>                 Key: HADOOP-16140
>                 URL: https://issues.apache.org/jira/browse/HADOOP-16140
>             Project: Hadoop Common
>          Issue Type: Improvement
>          Components: fs
>    Affects Versions: 3.3.0
>            Reporter: Stephen O'Donnell
>            Assignee: Stephen O'Donnell
>            Priority: Major
>         Attachments: HDFS-14200.001.patch
>
>
> I have always felt the HDFS trash is missing a simple way to empty the 
> current users trash immediately. We have "expunge" but in my experience 
> supporting clusters, end users find this confusing. When most end users run 
> expunge, they really want to empty their trash immediately and get confused 
> when expunge does not do this.
> This can result in users performing somewhat dangerous "skipTrash" operations 
> on the trash to free up space. The alternative, which most users will not 
> figure out on their own is:
> # Run the expunge command once - this will move the current folder to a 
> checkpoint and remove any old checkpoints older than the retention interval
> # Wait over 1 minute and then run expunge again, overriding fs.trash.interval 
> to 1 minute using the following command hadoop fs -Dfs.trash.interval=1 
> -expunge.
> With this Jira I am proposing to add a extra command, "hdfs dfs -emptyTrash" 
> that purges everything in the logged in users Trash directories immediately.
> How would the community feel about adding this new option? I will upload a 
> patch for comments.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to