Wang, Xinglong created YARN-9854:
------------------------------------

             Summary: RM jetty hang due to WebAppProxyServlet lacks of timeout 
while doing proxyLink
                 Key: YARN-9854
                 URL: https://issues.apache.org/jira/browse/YARN-9854
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: amrmproxy, resourcemanager, webapp
            Reporter: Wang, Xinglong
            Assignee: Wang, Xinglong


RM will proxy url request to [http://rm:port/proxy/application_xxxxx] to AM or 
related history server.

Recently we met an issue https://issues.apache.org/jira/browse/SPARK-26961 
which will cause Spark AM hang forever.

And we have a monitor tool to access [http://rm:port/proxy/application_xxxxx]  
periodically. Thus all proxied connection to the hang spark AM will also hang 
forever due to WebAppProxyServlet is lacking of socket connection timeout 
setting while initialize httpclient towards this spark AM.

 

The jetty server holding RM servlets is with limited threads. In this case, 
each time one such thread will hang due to waiting for Spark AM response. 
Eventually all jetty threads serving http traffic hang and caused all RM web 
links not responsive. 

 

If we give timeout config to httpclient, we will be free of this issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org

Reply via email to