[ https://issues.apache.org/jira/browse/YARN-10483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
jufeng li updated YARN-10483: ----------------------------- Attachment: RM_unnormal_state.stack > yarn hang住卡死,任务无法提交,切换RM主节点或重启才能恢复 > ---------------------------------- > > Key: YARN-10483 > URL: https://issues.apache.org/jira/browse/YARN-10483 > Project: Hadoop YARN > Issue Type: Bug > Components: capacity scheduler, capacityscheduler, resourcemanager, > RM > Affects Versions: 3.1.1 > Reporter: jufeng li > Priority: Blocker > Attachments: RM_normal_state.stack, RM_unnormal_state.stack > > > yarn不定期卡死,新任务无法提交,经排查jstack日志,capacity > scheduler有线程在无限等待锁,rm的cpu内存网络磁盘均正常。问题基本可以确定是capacity > scheduler内部的锁除了问题。jstack日志已上传,希望有人可以解决一下,此bug比较严重,直接导致生产不可用。如果没人解答待会我再来问 -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org