[
https://issues.apache.org/jira/browse/HDFS-14680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16896739#comment-16896739
]
Chen Zhang edited comment on HDFS-14680 at 7/31/19 1:46 PM:
------------------------------------------------------------
No I don't encounter this issue.
I'm working on HDFS-14657 and it relates with HDFS-9260, when I reading code of
HDFS-9260, I found this design is too aggressive, StorageInfoDefragmenter
should not shutdown NameNode on any exception, because it's not a critical
thread, I'mean it should at least retry some times before shutdown NameNode, or
maybe it can choose keep running no matter what exception happens, like
HeartBeatManager.
We're upgrading our production cluster from 2.6 to 3.1, I don't want this
happen to our NameNode, so it's just a proposal for discussion.
was (Author: zhangchen):
No I don't encounter this issue.
I'm working on HDFS-14657 and it relates with HDFS-9620, when I reading code of
HDFS-9620, I found this design is too aggressive, StorageInfoDefragmenter
should not shutdown NameNode on any exception, because it's not a critical
thread, I'mean it should at least retry some times before shutdown NameNode, or
maybe it can choose keep running no matter what exception happens, like
HeartBeatManager.
We're upgrading our production cluster from 2.6 to 3.1, I don't want this
happen to our NameNode, so it's just a proposal for discussion.
> StorageInfoDefragmenter should handle exceptions gently
> -------------------------------------------------------
>
> Key: HDFS-14680
> URL: https://issues.apache.org/jira/browse/HDFS-14680
> Project: Hadoop HDFS
> Issue Type: Bug
> Reporter: Chen Zhang
> Priority: Major
>
> StorageInfoDefragmenter is responsible for FoldedTreeSet compaction, but it
> terminates NameNode on any exception, is it too radical?
> I mean, even the critical threads like HeartbeatManager don't terminates
> NameNode once they encounter exceptions, StorageInfoDefragmenter should not
> do that either.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]