[ 
https://issues.apache.org/jira/browse/HDFS-17589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ruiliang updated HDFS-17589:
----------------------------
    Attachment: FindDirtyBlk-1.py
                DiskBlkAPI-1.py

> hdfs EC data  new blk reconstruct   old blk not delete
> ------------------------------------------------------
>
>                 Key: HDFS-17589
>                 URL: https://issues.apache.org/jira/browse/HDFS-17589
>             Project: Hadoop HDFS
>          Issue Type: Bug
>    Affects Versions: 3.1.1
>            Reporter: ruiliang
>            Priority: Major
>         Attachments: DiskBlkAPI-1.py, DiskBlkAPI.py, FindDirtyBlk-1.py, 
> FindDirtyBlk.py
>
>
> The reason is that the cluster was faulty before, and Datanodes kept losing 
> connections and recovering, resulting in a lot of EC data reconstruct, but a 
> lot of old blk failed to clean up correctly. Has this been repaired? What 
> patch do I need to add, thank you
> The following is a detailed check log
>  
> ok:     blk_-9223372036371044652  in 10.12.66.225  
> {color:#de350b}error:  blk_-9223372036371044652 in  
> 10.12.66.154(/data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652)
>  {color}
> {color:#de350b}Why didn't you delete it?{color}
>  
> {code:java}
> ====datanode delete data ec blk ?
>  grep blk_-9223372036371044656  
> hadoop-hdfs-root-datanode-fs-hiido-dn-12-66-111.hiido.host.xxx.com.log
> 2024-07-18 17:25:07,879 INFO  datanode.DataNode 
> (DataXceiver.java:writeBlock(738)) - Receiving 
> BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793 
> src: /10.12.66.111:25066 dest: /10.12.66.111:1019
> 2024-07-18 17:25:17,396 INFO  datanode.DataNode 
> (StripedBlockReconstructor.java:run(86)) - ok EC reconstruct striped block: 
> BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793  
> blockId: -9223372036371044656
> 2024-07-18 17:25:17,396 INFO  datanode.DataNode 
> (DataXceiver.java:writeBlock(914)) - Received 
> BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793 
> src: /10.12.66.111:25066 dest: /10.12.66.111:1019 of size 193986560
> 2024-07-18 17:25:25,465 INFO  impl.FsDatasetAsyncDiskService 
> (FsDatasetAsyncDiskService.java:deleteAsync(225)) - Scheduling 
> blk_-9223372036371044656_1688858793 replica FinalizedReplica, 
> blk_-9223372036371044656_1688858793, FINALIZED
>   getBlockURI()     = 
> file:/data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656
>  for deletion
> 2024-07-18 17:25:25,746 INFO  impl.FsDatasetAsyncDiskService 
> (FsDatasetAsyncDiskService.java:run(333)) - Deleted 
> BP-1822992414-10.12.65.48-1660893388633 blk_-9223372036371044656_1688858793 
> URI 
> file:/data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656=============my
>  config
> dfs.blockreport.intervalMsec    =21600000============namenode3 log
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18
>  04:34:39,523 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18
>  04:34:40,131 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18
>  10:34:38,950 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-18
>  10:34:39,559 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log:2024-07-18
>  16:34:38,564 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log:2024-07-18
>  16:34:39,190 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
>  04:34:39,462 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
>  04:34:40,083 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
>  10:34:39,686 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
>  10:34:40,295 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
>  16:34:39,667 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
>  16:34:40,301 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
>  22:34:38,187 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn3.hiido.host.xxxyy.com.log.1:2024-07-17
>  22:34:38,794 WARN  BlockStateChange (BlockManager.java:addStoredBlock(3238)) 
> - BLOCK* addStoredBlock: block blk_-9223372036371044656_1688858793 moved to 
> storageType DISK on node 10.12.66.154:1019
> =====namenode2 log active
>  grep blk_-9223372036371044656 
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn2.hiido.host.xxx.com.log.10
> 2024-07-18 17:25:04,786 WARN  BlockStateChange 
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block 
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node 
> 10.12.66.154:1019
> 2024-07-18 17:25:05,703 WARN  BlockStateChange 
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block 
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node 
> 10.12.66.154:1019
> ========namenode2 log
> root@fs-hiido-yycluster06-yynn1:/data/logs/hadoop/hdfs# grep 
> blk_-9223372036371044656 
> hadoop-hdfs-namenode-fs-hiido-yycluster06-yynn1.hiido.host.xxx.com.log
> 2024-07-18 07:20:41,525 WARN  BlockStateChange 
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block 
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node 
> 10.12.66.154:1019
> 2024-07-18 07:20:42,049 WARN  BlockStateChange 
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block 
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node 
> 10.12.66.154:1019
> 2024-07-18 13:20:40,726 WARN  BlockStateChange 
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block 
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node 
> 10.12.66.154:1019
> 2024-07-18 13:20:41,251 WARN  BlockStateChange 
> (BlockManager.java:addStoredBlock(3238)) - BLOCK* addStoredBlock: block 
> blk_-9223372036371044656_1688858793 moved to storageType DISK on node 
> 10.12.66.154:1019
> ================
> hdfs fsck -fs hdfs://yycluster06 -blockId blk_-9223372036371044656
> Connecting to namenode via 
> http://fs-hiido-yycluster06-yynn2.hiido.host.xxx.com:50070/fsck?ugi=hdfs&blockId=blk_-9223372036371044656+&path=%2F
> FSCK started by hdfs (auth:KERBEROS_SSL) from /10.12.19.4 at Thu Jul 18 
> 17:57:06 CST 2024Block Id: blk_-9223372036371044656
> Block belongs to: 
> /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz
> No. of Expected Replica: 5
> No. of live Replica: 5
> No. of excess Replica: 0
> No. of stale Replica: 0
> No. of decommissioned Replica: 0
> No. of decommissioning Replica: 0
> No. of corrupted Replica: 0
> Block replica on datanode/rack: 
> fs-hiido-dn-12-66-225.hiido.host.xxxyy.com/4F08-02-03 is HEALTHY
> Block replica on datanode/rack: 
> fs-hiido-dn-12-67-38.hiido.host.xxxyy.com/4F08-02-15 is HEALTHY
> Block replica on datanode/rack: 
> fs-hiido-dn-12-66-191.hiido.host.xxx.com/4F08-12-06 is HEALTHY
> Block replica on datanode/rack: 
> fs-hiido-dn-12-67-5.hiido.host.xxxyy.com/4F08-12-09 is HEALTHY
> Block replica on datanode/rack: 
> fs-hiido-dn-12-66-154.hiido.host.xxxyy.com/4F08-02-13 is HEALTHYhdfs fsck -fs 
> hdfs://yycluster06  
> /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz
>  -files -blocks -locations
> /hive_warehouse/yydw.db/dwv_event_detail_mob_quality_day/dt=2021-08-10/product_id=171/part-00210-f6fac929-f172-45cf-9fb7-0aa2f68c545e.c000.gz
>  581647843 bytes, erasure-coded: policy=RS-3-2-1024k, 1 block(s):  OK
> 0. 
> BP-1822992414-10.12.65.48-1660893388633:blk_-9223372036371044656_1688858793 
> len=581647843 Live_repl=5  [
> blk_-9223372036371044656:DatanodeInfoWithStorage[10.12.66.154:1019,DS-4b66fe61-93ca-4f8d-8fe0-e00f2ed09e82,DISK],
>  
> blk_-9223372036371044655:DatanodeInfoWithStorage[10.12.67.5:1019,DS-af799695-8e9b-4884-a741-7a0742db6f79,DISK],
>  
> blk_-9223372036371044654:DatanodeInfoWithStorage[10.12.66.191:1019,DS-cef171fa-8e8e-43bb-9fd1-aab84ea046cd,DISK],
>  
> blk_-9223372036371044653:DatanodeInfoWithStorage[10.12.67.38:1019,DS-d2346a26-14c4-41e9-b349-e198ab5b684e,DISK],
>  
> blk_-9223372036371044652:DatanodeInfoWithStorage[10.12.66.225:1019,DS-13c19c1d-9221-45cc-919f-b54ada9cab15,DISK]]
> ========================================================================================================================================
> root@fs-hiido-dn-12-66-154:/data/logs/hadoop/hdfs# ll  
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193986560 Feb 26 19:54 
> /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656
> -rw-r--r-- 1 hdfs hdfs   1515527 Feb 26 19:54 
> /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044656_1688858793.meta
> -rw-r--r-- 1 hdfs hdfs 193986560 Feb 27 21:49 
> /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> -rw-r--r-- 1 hdfs hdfs   1515527 Feb 27 21:49 
> /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652_1688858793.meta
> root@fs-hiido-dn-12-67-5:/home/liangrui# ll  
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193986560 Nov 29  2023 
> /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044655
> -rw-r--r-- 1 hdfs hdfs   1515527 Nov 29  2023 
> /data12/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044655_1688858793.meta
> root@fs-hiido-dn-12-66-191:/home/liangrui# ll  
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193674723 Aug 30  2023 
> /data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044654
> -rw-r--r-- 1 hdfs hdfs   1513091 Aug 30  2023 
> /data4/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044654_1688858793.meta
> root@fs-hiido-dn-12-67-38:/home/liangrui# ll  
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193986560 Apr 19 18:20 
> /data1/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044653
> -rw-r--r-- 1 hdfs hdfs   1515527 Apr 19 18:20 
> /data1/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044653_1688858793.meta
> root@fs-hiido-dn-12-66-225:/home/liangrui# ll  
> /data*/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir*/subdir*/blk_-922337203637104465*
> -rw-r--r-- 1 hdfs hdfs 193986560 Dec  1  2023 
> /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> -rw-r--r-- 1 hdfs hdfs   1515527 Dec  1  2023 
> /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652_1688858793.meta
> ========
> root@fs-hiido-dn-12-66-154:/data/logs/hadoop/hdfs# md5sum 
> /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> b661b6d711d753a82c3bf42bb2ceec51  
> /data3/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> 10.12.66.225
> md5sum 
> /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
> b661b6d711d753a82c3bf42bb2ceec51  
> /data10/hadoop/dfs/data/current/BP-1822992414-10.12.65.48-1660893388633/current/finalized/subdir21/subdir6/blk_-9223372036371044652
>  {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to