ruiliang created HDFS-17589: ------------------------------- Summary: hdfs EC data old blk reconstruct old blk not delete Key: HDFS-17589 URL: https://issues.apache.org/jira/browse/HDFS-17589 Project: Hadoop HDFS Issue Type: Bug Affects Versions: 3.1.1 Reporter: ruiliang
The reason is that the cluster was faulty before, and Datanodes kept losing connections and recovering, resulting in a lot of EC data reconstruct, but a lot of old blk failed to clean up correctly. Has this been repaired? What patch do I need to add, thank you The following is a detailed check log {code:java} ====datanode delete data ec blk ? grep blk_-9223372036371044656 hadoop-hdfs-root-datanode-fs-hiido-dn-12-66-111.hiido.host.xxx.com.log 2024-07-18 17:25:07,879 INFO datanode.DataNode (DataXceiver.java:writeBlock(738)) - Receiving BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793 src: /10.12.66.111:25066 dest: /10.12.66.111:1019 2024-07-18 17:25:17,396 INFO datanode.DataNode (StripedBlockReconstructor.java:run(86)) - ok EC reconstruct striped block: BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793 blockId: -9223372036371044656 2024-07-18 17:25:17,396 INFO datanode.DataNode (DataXceiver.java:writeBlock(914)) - Received BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793 src: /10.12.66.111:25066 dest: /10.12.66.111:1019 of size 193986560 2024-07-18 17:25:25,465 INFO impl.FsDatasetAsyncDiskService (FsDatasetAsyncDiskService.java:deleteAsync(225)) - Scheduling blk_-9223372036371044656_1688858793 replica FinalizedReplica, blk_-9223372036371044656_1688858793, FINALIZED getBlockURI() = file:/data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656 for deletion 2024-07-18 17:25:25,746 INFO impl.FsDatasetAsyncDiskService (FsDatasetAsyncDiskService.java:run(333)) - Deleted BP-1822992414-10.12.65.48-1660893388633 blk_-9223372036371044656_1688858793 URI file:/data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656=============my config dfs.blockreport.intervalMsec =21600000============namenode3 log hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18 04:34:39,523 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18 04:34:40,131 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18 10:34:38,950 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18 10:34:39,559 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log:2024-07-18 16:34:38,564 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log:2024-07-18 16:34:39,190 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17 04:34:39,462 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17 04:34:40,083 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17 10:34:39,686 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17 10:34:40,295 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17 16:34:39,667 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17 16:34:40,301 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17 22:34:38,187 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17 22:34:38,794 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 =====namenode2 log active grep blk_-9223372036371044656 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn2.hiido.host.xxx.com.log.10 2024-07-18 17:25:04,786 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 2024-07-18 17:25:05,703 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 ========namenode2 log root@fs-hiido-yycluster06-yynn1:/data/logs/hadoop/hdfs# grep blk_-9223372036371044656 hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn1.hiido.host.xxx.com.log 2024-07-18 07:20:41,525 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 2024-07-18 07:20:42,049 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 2024-07-18 13:20:40,726 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 2024-07-18 13:20:41,251 WARN BlockStateChange (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to storageType DISK on node 10.12.66.154:1019 ================ hdfs fsck -fs hdfs://yycluster06 -blockId blk_-9223372036371044656 Connecting to namenode via http://fs-hiido-yycluster06-yynn2.hiido.host.xxx.com:50070/fsck?ugi=hdfs&blockId=blk_-9223372036371044656+&path=%2F FSCK started by hdfs (auth:KERBEROS_SSL) from /10.12.19.4 at Thu Jul 18 17:57:06 CST 2024Block Id: blk_-9223372036371044656 Block belongs to: /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz No. of Expected Replica: 5 No. of live Replica: 5 No. of excess Replica: 0 No. of stale Replica: 0 No. of decommissioned Replica: 0 No. of decommissioning Replica: 0 No. of corrupted Replica: 0 Block replica on datanode/rack: fs-hiido-dn-12-66-225.hiido.host.xxxyy.com/4F08-02-03 is HEALTHY Block replica on datanode/rack: fs-hiido-dn-12-67-38.hiido.host.xxxyy.com/4F08-02-15 is HEALTHY Block replica on datanode/rack: fs-hiido-dn-12-66-191.hiido.host.xxx.com/4F08-12-06 is HEALTHY Block replica on datanode/rack: fs-hiido-dn-12-67-5.hiido.host.xxxyy.com/4F08-12-09 is HEALTHY Block replica on datanode/rack: fs-hiido-dn-12-66-154.hiido.host.xxxyy.com/4F08-02-13 is HEALTHYhdfs fsck -fs hdfs://yycluster06 /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz -files -blocks -locations /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz 581647843 bytes, erasure-coded: policy=RS-3-2-1024k, 1 block(s): OK 0. BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793 len=581647843 Live_repl=5 [ blk_-9223372036371044656:DatanodeInfoWithStorage[10.12.66.154:1019,DS-4b66fe61-93ca-4f8d-8fe0-e00f2ed09e82,DISK], blk_-9223372036371044655:DatanodeInfoWithStorage[10.12.67.5:1019,DS-af799695-8e9b-4884-a741-7a0742db6f79,DISK], blk_-9223372036371044654:DatanodeInfoWithStorage[10.12.66.191:1019,DS-cef171fa-8e8e-43bb-9fd1-aab84ea046cd,DISK], blk_-9223372036371044653:DatanodeInfoWithStorage[10.12.67.38:1019,DS-d2346a26-14c4-41e9-b349-e198ab5b684e,DISK], blk_-9223372036371044652:DatanodeInfoWithStorage[10.12.66.225:1019,DS-13c19c1d-9221-45cc-919f-b54ada9cab15,DISK]] ========================================================================================================================================root@fs-hiido-dn-12-66-154:/data/logs/hadoop/hdfs# ll /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465* -rw-r--r-- 1 hdfs hdfs 193986560 Feb 26 19:54 /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656 -rw-r--r-- 1 hdfs hdfs 1515527 Feb 26 19:54 /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656_1688858793.meta -rw-r--r-- 1 hdfs hdfs 193986560 Feb 27 21:49 /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652 -rw-r--r-- 1 hdfs hdfs 1515527 Feb 27 21:49 /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652_1688858793.meta root@fs-hiido-dn-12-67-5:/home/liangrui# ll /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465* -rw-r--r-- 1 hdfs hdfs 193986560 Nov 29 2023 /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044655 -rw-r--r-- 1 hdfs hdfs 1515527 Nov 29 2023 /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044655_1688858793.metaroot@fs-hiido-dn-12-66-191:/home/liangrui# ll /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465* -rw-r--r-- 1 hdfs hdfs 193674723 Aug 30 2023 /data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044654 -rw-r--r-- 1 hdfs hdfs 1513091 Aug 30 2023 /data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044654_1688858793.metaroot@fs-hiido-dn-12-67-38:/home/liangrui# ll /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465* -rw-r--r-- 1 hdfs hdfs 193986560 Apr 19 18:20 /data1/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044653 -rw-r--r-- 1 hdfs hdfs 1515527 Apr 19 18:20 /data1/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044653_1688858793.meta root@fs-hiido-dn-12-66-225:/home/liangrui# ll /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465* -rw-r--r-- 1 hdfs hdfs 193986560 Dec 1 2023 /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652 -rw-r--r-- 1 hdfs hdfs 1515527 Dec 1 2023 /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652_1688858793.meta======== root@fs-hiido-dn-12-66-154:/data/logs/hadoop/hdfs# md5sum /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652 b661b6d711d753a82c3bf42bb2ceec51 /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-922337203637104465210.12.66.225 md5sum /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652 b661b6d711d753a82c3bf42bb2ceec51 /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652 {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org