[
https://issues.apache.org/jira/browse/HADOOP-18544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17684537#comment-17684537
]
ASF GitHub Bot commented on HADOOP-18544:
-----------------------------------------
HarshitGupta11 opened a new pull request, #5354:
URL: https://github.com/apache/hadoop/pull/5354
### Description of PR
In applications which do many single-file deletions on the same dir, a lot
of time is wasted in maybeCreateFakeParentDirectory().
Proposed: add an option to disable the probe, for use by applications which
are happy for parent dirs to sometimes disappear after a cleanup.
file by file delete is still woefully inefficient because of the HEAD
request on every file, but there's no need to amplify the damage.
### How was this patch tested?
The patch was tested against s3 bucket in US-West 2
### For code changes:
##Caveats:
Parent directories might disappear on delete or on renames.
##What breaks:
The rename tests are failing for the FileContext renames as both
S3AFileSystem and the FileContext have different probes and different rules.
- [ ] Does the title or this PR starts with the corresponding JIRA issue id
(e.g. 'HADOOP-17799. Your PR title ...')?
- [ ] Object storage: have the integration tests been executed and the
endpoint declared according to the connector-specific documentation?
- [ ] If adding new dependencies to the code, are these dependencies
licensed in a way that is compatible for inclusion under [ASF
2.0](http://www.apache.org/legal/resolved.html#category-a)?
- [ ] If applicable, have you updated the `LICENSE`, `LICENSE-binary`,
`NOTICE-binary` files?
> S3A: add option to disable probe for dir marker recreation on delete/rename.
> ----------------------------------------------------------------------------
>
> Key: HADOOP-18544
> URL: https://issues.apache.org/jira/browse/HADOOP-18544
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: fs/s3
> Affects Versions: 3.3.4
> Reporter: Steve Loughran
> Assignee: Harshit Gupta
> Priority: Major
>
> In applications which do many single-file deletions on the same dir, a lot of
> time is wasted in {{maybeCreateFakeParentDirectory()}}.
> Proposed: add an option to disable the probe, for use by applications which
> are happy for parent dirs to sometimes disappear after a cleanup.
> file by file delete is still woefully inefficient because of the HEAD request
> on every file, but there's no need to amplify the damage.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]