[ 
https://issues.apache.org/jira/browse/HDDS-9588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HDDS-9588:
---------------------------------
    Labels: pull-request-available  (was: )

> DN import of container is not safe while replication
> ----------------------------------------------------
>
>                 Key: HDDS-9588
>                 URL: https://issues.apache.org/jira/browse/HDDS-9588
>             Project: Apache Ozone
>          Issue Type: Bug
>            Reporter: Sumit Agrawal
>            Assignee: Sumit Agrawal
>            Priority: Major
>              Labels: pull-request-available
>
> Scenario:
> 1. SCM has triggered replicate from Src1 to Target1 DN.
> 2. Timeout (10 min)
> 3. SCM again triggered replicate from Src2 to Target1 DN (same target)
> This creates possibility of parallel / sequential execution of import at 
> target DN.
>  
> Environment have multiple volume, as per code analysis, there are below 
> scenario where import can have issue at target,
> Since import do not check if container is already present or any existing 
> import is going on for same container,
>  # Volume chosen for first import is V1 and import is success
>  # Volume chosen for second import is V2 and import will be success but add 
> to container set will be ignored/failed
> This have issue that V2 volume will have container and consume disk space, 
> but never reported. This have less impact, just report unknown container.
>  
> When have same volume for both import, and operation are getting parallel for 
> Import at target DN,
>  * While un-tar the zip at volume location, there can be issue of blocks 
> deleted or corrupted, as operation performed,
>  ** delete container folder from temp location
>  ** un-tar the zip file
>  ** move the files to destination
> So in this case,
> Import 1: un-tar the zip file and started move of files
> Import 2: parallely deleted the container folder and un-tar in progress
> This can cause missing of files and/or corrupt the files checksum.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to