[ 
https://issues.apache.org/jira/browse/HDFS-16016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17783080#comment-17783080
 ] 

Yuanbo Liu commented on HDFS-16016:
-----------------------------------

[~hexiaoqiao] 

>  I just wonder if meet some performance issue?

Yes indeed we met. The heartbeat thread not only handle FBR but also dispatch 
deleting blocks commands. When the disks of DN are quite busy, the whole 
execution time of heartbeat thread will be longer than one minute, and it makes 
files cannot be closed in time. We've met this performance issue in our 
checkpoint-backup clusters of flink. It's necessary to seperate IBR from 
heartbeat thread, and NN will benefit as the client close files faster without 
useless retrying. Increasing the frequency of ibr doesn't have much side-effect 
as blockReceivedAndDeleted has been refactored with async model long time 
ago(HDFS-9198)

> BPServiceActor add a new thread to handle IBR
> ---------------------------------------------
>
>                 Key: HDFS-16016
>                 URL: https://issues.apache.org/jira/browse/HDFS-16016
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: JiangHua Zhu
>            Assignee: Viraj Jasani
>            Priority: Minor
>              Labels: pull-request-available
>             Fix For: 3.4.0, 3.3.6
>
>         Attachments: image-2023-11-03-18-11-54-502.png, 
> image-2023-11-06-10-53-13-584.png, image-2023-11-06-10-55-50-939.png
>
>          Time Spent: 5h 20m
>  Remaining Estimate: 0h
>
> Now BPServiceActor#offerService() is doing many things, FBR, IBR, heartbeat. 
> We can handle IBR independently to improve the performance of heartbeat and 
> FBR.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to