Jonathan Hurley created AMBARI-16131:
----------------------------------------

             Summary: Prevent Views From Causing a Loss of Service For Ambari
                 Key: AMBARI-16131
                 URL: https://issues.apache.org/jira/browse/AMBARI-16131
             Project: Ambari
          Issue Type: Bug
    Affects Versions: 2.0.0
            Reporter: Jonathan Hurley
            Assignee: Jonathan Hurley
            Priority: Critical
             Fix For: 2.4.0


The underlying problem is that views are accessed off of the REST endpoint 
({{/api/v1/views}}). This means that the Ambari REST API connector is going to 
handle the request from its own threadpool. There is no way to configure Jetty 
to use a different threadpool for the same connector. As a result, if a request 
to load a view holds the Jetty thread hostage, eventually we will see thread 
starvation and loss of service.

An example of this situation is a view which makes an innocent request to a 
remote resource. If the view's request has a timeout of 60 seconds, then the 
Jetty thread is going to be held for that amount of time. With concurrent users 
and multiple instances of that view deployed, the Jetty threadpool can becomes 
exhausted quickly.

Although there are more graceful ways of handling this situation, they mostly 
involve substantial re-architecture and design:
- The use of a new connector and threadpool would require binding to another 
port for view requests. This will cause problems with "local" views and their 
assumption that if they run on the Ambari server they can share the same 
session.
- The use of a 
[Continuation|https://wiki.eclipse.org/Jetty/Feature/Continuations] in Jetty 
which can suspend the incoming request. We would need the ability for views to 
signal that they have completed their work in order to proceed with the 
suspended request.

A quicker and far less invasive fix would be to create a filter which 
intercepts requests for views. It will determine how many executing view 
requests exist and decide if it will allow the new request through. For 
example, if configured to allow a maximum of 10 concurrent view requests, then 
the 11th request would be denied with an {{HTTP 503 - Service Unavailable}}. 
Although the thread is temporarily used while the filter is processing, it's 
quickly returned to the Jetty pool when it's determined there are too many 
other running view requests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to