xiaojunxiang created HDFS-17498:
-----------------------------------

             Summary: Distcp the concat files error, because sourceFS's 
chesksum is not equals to targetFS's chesksum.
                 Key: HDFS-17498
                 URL: https://issues.apache.org/jira/browse/HDFS-17498
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: distcp
    Affects Versions: 3.3.4
            Reporter: xiaojunxiang
         Attachments: image-2024-04-24-15-54-16-253.png, 
image-2024-04-24-15-54-58-047.png, image-2024-04-24-15-55-25-519.png, 
image-2024-04-24-15-55-48-752.png

When we use distcp, the sourceFS's checksum and targetFS's checksum are checked 
for consistency after the file transfer is complete. 

However, for some files produced by ClientProcotol's concat(RPC method) on the 
source side, the Block Size is less than 128MB(such as sourceFS file 
=10MB+10MB, targetFS file = 20MB), so the checksum of the source and 
destination side will be inconsistent, So It waill cause distcp failed

!image-2024-04-24-15-54-16-253.png!

!image-2024-04-24-15-54-58-047.png!

!image-2024-04-24-15-55-25-519.png!

!image-2024-04-24-15-55-48-752.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to