[jira] [Commented] (HDFS-5922) DN heartbeat thread can get stuck in tight loop

Aaron T. Myers (JIRA) Mon, 10 Feb 2014 14:52:09 -0800

    [ 
https://issues.apache.org/jira/browse/HDFS-5922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13897155#comment-13897155
 ]


Aaron T. Myers commented on HDFS-5922:
--------------------------------------

Hi Arpit, yes please do take a look at fixing it. I was hoping you'd notice it 
since I'm less familiar with this code. :)

I didn't file it as a blocker against 2.3 because the window for hitting this 
is really quite narrow, it's not the end of the world if a DN ends up hitting 
this, and I don't want to further hold up the 2.3.0 release. I personally think 
we should target this for 2.3.1 / 2.4.0.

That said, if you think this is more serious than I do, then we can certainly 
raise the priority and target it for 2.3.0 if you want.

> DN heartbeat thread can get stuck in tight loop
> -----------------------------------------------
>
>                 Key: HDFS-5922
>                 URL: https://issues.apache.org/jira/browse/HDFS-5922
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: datanode
>    Affects Versions: 2.3.0
>            Reporter: Aaron T. Myers
>
> We saw an issue recently on a test cluster where one of the DN threads was 
> consuming 100% of a single CPU. Running jstack indicated that it was the DN 
> heartbeat thread. I believe I've tracked down the cause to a bug in the 
> accounting around the value of {{pendingReceivedRequests}}.
> More details in the first comment.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (HDFS-5922) DN heartbeat thread can get stuck in tight loop

Reply via email to