[ 
https://issues.apache.org/jira/browse/HDDS-15455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HDDS-15455:
----------------------------------
    Labels: pull-request-available  (was: )

> Implement Custom DataNode Container Directory Discovery and Duplicate 
> Detection
> -------------------------------------------------------------------------------
>
>                 Key: HDDS-15455
>                 URL: https://issues.apache.org/jira/browse/HDDS-15455
>             Project: Apache Ozone
>          Issue Type: Sub-task
>            Reporter: Sreeja
>            Assignee: Sreeja
>            Priority: Major
>              Labels: pull-request-available
>
> Implement logic to traverse all storage volumes configured in 
> *{{hdds.datanode.dir}}* and discover container directories present under the 
> DataNode container storage hierarchy.
> For each discovered container directory:
>  * Extract the container ID from the directory name.
>  * Collect the container directory path, storage volume, and directory size.
>  * Determine the metadata status:
>  ** {{*MISSING_METADATA*}} if {{metadata/\{containerId}.container}} does not 
> exist.
>  ** {{*INVALID_METADATA*}} if the metadata file exists but cannot be parsed, 
> or if the container ID stored in the metadata does not match the 
> directory-name container ID.
>  ** *{{VALID}}* otherwise.
> Store the results as a mapping:
> {{containerId -> List<ContainerOccurrence>}}
> where each occurrence contains the container directory path, volume, size, 
> and metadata status.
> Use this mapping to identify duplicate container directories by detecting 
> container IDs associated with more than one on-disk occurrence across storage 
> volumes on the same DataNode.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to