Wang, Xinglong created YARN-9854: ------------------------------------ Summary: RM jetty hang due to WebAppProxyServlet lacks of timeout while doing proxyLink Key: YARN-9854 URL: https://issues.apache.org/jira/browse/YARN-9854 Project: Hadoop YARN Issue Type: Improvement Components: amrmproxy, resourcemanager, webapp Reporter: Wang, Xinglong Assignee: Wang, Xinglong
RM will proxy url request to [http://rm:port/proxy/application_xxxxx] to AM or related history server. Recently we met an issue https://issues.apache.org/jira/browse/SPARK-26961 which will cause Spark AM hang forever. And we have a monitor tool to access [http://rm:port/proxy/application_xxxxx] periodically. Thus all proxied connection to the hang spark AM will also hang forever due to WebAppProxyServlet is lacking of socket connection timeout setting while initialize httpclient towards this spark AM. The jetty server holding RM servlets is with limited threads. In this case, each time one such thread will hang due to waiting for Spark AM response. Eventually all jetty threads serving http traffic hang and caused all RM web links not responsive. If we give timeout config to httpclient, we will be free of this issue. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org