voonhous commented on code in PR #18867:
URL: https://github.com/apache/hudi/pull/18867#discussion_r3315875042


##########
website/docs/blob_unstructured_data.md:
##########
@@ -158,7 +159,8 @@ INSERT INTO media_assets VALUES (
         'reference', named_struct(
             'external_path', 's3://my-bucket/media/container_001.bin',
             'offset',        8388608,       -- byte offset in the container
-            'length',        1073741824     -- number of bytes
+            'length',        1073741824,    -- number of bytes
+            'managed',       false          -- Hudi does not manage this 
external file

Review Comment:
   I think i might have a different mental model from what y'all have. 
   
   - `managed=true`: Cleaner **WILL** delete orphaned blobs/files if they are 
no longer referenced
   - `managed=false`: Cleaner **WILL NOT** delete the orphaned blobs/files.
   
   And managed is only relevant to `OUT_OF_LINE` blobs right?
   
   Nonetheless, this flag is not really used now, we just need to let users 
know what they are. Future blob cleaner implementation requires this flag.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to