[ https://issues.apache.org/jira/browse/YARN-8995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16681105#comment-16681105 ]
zhuqi edited comment on YARN-8995 at 11/9/18 9:05 AM: ------------------------------------------------------ Hi [~cheersyang] Thanks for your reply, i think not only the queue size, we can also add a eventMetrics class to monitor the health of cluster's all event dispachers. was (Author: zhuqi): Hi [~cheersyang] Thanks for your reply, i think not only the queue size, we can also add a eventMetrics class to monitor the health of cluster's all event dispacher. > Log the event type of the too big AsyncDispatcher event queue size, and add > the information to the metrics. > ------------------------------------------------------------------------------------------------------------ > > Key: YARN-8995 > URL: https://issues.apache.org/jira/browse/YARN-8995 > Project: Hadoop YARN > Issue Type: Improvement > Components: metrics, nodemanager, resourcemanager > Affects Versions: 3.1.0 > Reporter: zhuqi > Assignee: zhuqi > Priority: Major > > In our growing cluster,there are unexpected situations that cause some event > queues to block the performance of the cluster, such as the bug of > https://issues.apache.org/jira/browse/YARN-5262 . I think it's necessary to > log the event type of the too big event queue size, and add the information > to the metrics, and the threshold of queue size is a parametor which can be > changed. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org