Ke Han created HDFS-17174:
-----------------------------

             Summary: Side effects of checksum command: wrongly changes the 
modification time of a file
                 Key: HDFS-17174
                 URL: https://issues.apache.org/jira/browse/HDFS-17174
             Project: Hadoop HDFS
          Issue Type: Bug
    Affects Versions: 3.2.4
            Reporter: Ke Han


After starting up 4-nodes hdfs cluster for 3.2.4. I executed some commands to 
put a folder in local fs to hdfs. I used -{_}put -p to preserves access and 
modification times{_}. 

However, when I use -checksum command to retrieve the checksum, I noticed that 
it will change the modification time of the file I just uploaded. (Side effects)
{code:java}
dfs -mkdir /lSFVKIFi
dfs -put -f   /tmp/upfuzz/hdfs/xbutTMQg/GbkVxPvqoc /lSFVKIFi/
dfs -mkdir /lSFVKIFi/PWXVE
dfs -setacl  -k -m acl /PWXVE --set acl2 /
ec -enablePolicy -policy XOR-2-1-1024k
dfsadmin -setBalancerBandwidth 54480 /lSFVKIFi/
dfsadmin -refreshNodes
ec -verifyClusterSetup -policy RS-LEGACY-6-3-1024k
dfs -mv /lSFVKIFi/GbkVxPvqoc /lSFVKIFi/PWXVE
dfsadmin -metasave /lSFVKIFi/PWXVE
dfsadmin -clrQuota /lSFVKIFi/
dfsadmin -setBalancerBandwidth 29653 /lSFVKIFi/
dfs -expunge -immediate
ec -setPolicy -path /lSFVKIFi/ -policy XOR-2-1-1024k
dfs -put -f  -d /tmp/upfuzz/hdfs/LNSEzfJm/z /lSFVKIFi/PWXVE
dfs -expunge -immediate
dfsadmin -setQuota 6117966147645490803 /lSFVKIFi/
ec -unsetPolicy -path /lSFVKIFi/PWXVE
dfs -put -f -p -d /tmp/upfuzz/hdfs/OFcqsksu/vFrDCyPAiJY /lSFVKIFi/
dfsadmin -refreshUserToGroupsMappings
dfsadmin -metasave /lSFVKIFi/PWXVE
dfsadmin -refreshUserToGroupsMappings
dfs -touchz /lSFVKIFi/vFrDCyPAiJY/-setSpaceQuota.txt
dfsadmin -setSpaceQuota 10 -storageType ARCHIVE /lSFVKIFi/
dfs -put   -d /tmp/upfuzz/hdfs/JWahtWfR/Ai /lSFVKIFi/PWXVE/z
dfs -put   -d /tmp/upfuzz/hdfs/vwyIeBXc/CJV /lSFVKIFi/PWXVE
dfsadmin -setBalancerBandwidth 29653 /lSFVKIFi/
dfsadmin -refreshServiceAcl
dfsadmin -clrQuota /lSFVKIFi/
dfs -touchz /lSFVKIFi/WXVE.yaml
dfsadmin -refreshCallQueue
dfsadmin -setBalancerBandwidth 10 /lSFVKIFi/PWXVE

// READ COMMAND
// First read
> dfs -ls   -h  -t -S  -u /lSFVKIFi/vFrDCyPAiJY
Found 2 items
-rw-r--r--   1 root  6345          0 2023-08-30 22:03 
/lSFVKIFi/vFrDCyPAiJY/-setSpaceQuota.txt
-rw-r--r--   1 20001 6345       24 K 2023-08-30 18:38 /lSFVKIFi/vFrDCyPAiJY/mfHU

// Execute the check sum
dfs -checksum /lSFVKIFi/PWXVE/z/eJMiAwNEO

// Second read
> dfs -ls   -h  -t -S  -u /lSFVKIFi/vFrDCyPAiJY 
Found 2 items
-rw-r--r--   1 20001 6345       24 K 2023-08-30 22:03 /lSFVKIFi/vFrDCyPAiJY/mfHU
-rw-r--r--   1 root  6345          0 2023-08-30 22:03 
/lSFVKIFi/vFrDCyPAiJY/-setSpaceQuota.txt{code}
The modification is changed. However, checksum shouldn't have any side effects 
according to its specification.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to