[
https://issues.apache.org/jira/browse/HDFS-5922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13897155#comment-13897155
]
Aaron T. Myers commented on HDFS-5922:
--------------------------------------
Hi Arpit, yes please do take a look at fixing it. I was hoping you'd notice it
since I'm less familiar with this code. :)
I didn't file it as a blocker against 2.3 because the window for hitting this
is really quite narrow, it's not the end of the world if a DN ends up hitting
this, and I don't want to further hold up the 2.3.0 release. I personally think
we should target this for 2.3.1 / 2.4.0.
That said, if you think this is more serious than I do, then we can certainly
raise the priority and target it for 2.3.0 if you want.
> DN heartbeat thread can get stuck in tight loop
> -----------------------------------------------
>
> Key: HDFS-5922
> URL: https://issues.apache.org/jira/browse/HDFS-5922
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: datanode
> Affects Versions: 2.3.0
> Reporter: Aaron T. Myers
>
> We saw an issue recently on a test cluster where one of the DN threads was
> consuming 100% of a single CPU. Running jstack indicated that it was the DN
> heartbeat thread. I believe I've tracked down the cause to a bug in the
> accounting around the value of {{pendingReceivedRequests}}.
> More details in the first comment.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)