[ 
https://issues.apache.org/jira/browse/HADOOP-15193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16641997#comment-16641997
 ] 

Steve Loughran commented on HADOOP-15193:
-----------------------------------------

DDB batch delete just takes the list of operations and runs through them in 
sequence, retrying if needed. There is no speedup compared to making individual 
requests

We do need a call in the metastore API though, as it can be a bit cleverer 
about the operation.

In particular: if I delete a directory, do I need to explicitly add deleted 
markers to all the children, or would a delete marker on the dir be enough? If 
so, you could be very efficient & not create deleted file markers, just those 
for the directories 


> add bulk delete call to metastore API & DDB impl
> ------------------------------------------------
>
>                 Key: HADOOP-15193
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15193
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>    Affects Versions: 3.0.0
>            Reporter: Steve Loughran
>            Priority: Major
>
> recursive dir delete (and any future bulk delete API like HADOOP-15191) 
> benefits from using the DDB bulk table delete call, which takes a list of 
> deletes and executes. Hopefully this will offer better perf. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to