This is an automated email from the ASF dual-hosted git repository.
sivabalan pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/hudi.git
The following commit(s) were added to refs/heads/asf-site by this push:
new e356a32 [HUDI-3496] Add note for S3 Versioned Bucket (#4900)
e356a32 is described below
commit e356a32f5308a7812ad278f2e4dac33fe9117b8b
Author: Vinay Patil <[email protected]>
AuthorDate: Fri Feb 25 09:50:58 2022 +0530
[HUDI-3496] Add note for S3 Versioned Bucket (#4900)
---
website/docs/s3_hoodie.md | 9 ++++++++-
1 file changed, 8 insertions(+), 1 deletion(-)
diff --git a/website/docs/s3_hoodie.md b/website/docs/s3_hoodie.md
index c39c73c..a67de25 100644
--- a/website/docs/s3_hoodie.md
+++ b/website/docs/s3_hoodie.md
@@ -77,4 +77,11 @@ AWS hadoop libraries to add to our classpath
AWS glue data libraries are needed if AWS glue data is used
- com.amazonaws.glue:aws-glue-datacatalog-hive2-client:1.11.0
- - com.amazonaws:aws-java-sdk-glue:1.11.475
\ No newline at end of file
+ - com.amazonaws:aws-java-sdk-glue:1.11.475
+
+## AWS S3 Versioned Bucket
+
+With versioned buckets any object deleted creates a [Delete
Marker](https://docs.aws.amazon.com/AmazonS3/latest/userguide/DeleteMarker.html),
as Hudi cleans up files using [Cleaner
utility](https://hudi.apache.org/docs/hoodie_cleaner) the number of Delete
Markers increases over time.
+It is important to configure the [Lifecycle
Rule](https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-lifecycle-mgmt.html)
correctly
+to clean up these delete markers as the List operation can choke if the number
of delete markers reaches 1000.
+We recommend cleaning up Delete Markers after 1 day in Lifecycle Rule.
\ No newline at end of file