Re: apache getting in "sending reply" state when connecting to tomcat

Rainer Jung Thu, 30 Aug 2007 10:39:39 -0700

[EMAIL PROTECTED] wrote:

Hi
I'm going to be a real pain, but it make no sense now...


Let's see :)

The email has been a team effort in our offices. We have included some
diagrams to help illustrate our understanding or lack off.

Using a simple example:

1/ Assume I have one httpd server (prefork) that can spawn a maximum of
200 children (through httpd Maxclients directive).

2/ Assume I have 1 tomcat servers that can handle 200 threads each.

If I connect the apache to tomcat with mod_jk (lb) I can, in theory
handle 200 concurrent connections.

Now, if I change the figures

1/ Assume I have one httpd server (prefork) that can spawn a maximum of
200 children (through httpd Maxclients directive).

2/ Assume I have 4 tomcat servers that can handle 200 threads each.

In this case each apache child opens a connection to each tomcat server
so I have reached the maximum amount of connections each tomcat can
handle. What I cannot understand is that by increasing the tomcats to 4
I now have 800 possible connections but with the above config I can only
access 200 of them. If I set apache to 800 (through httpd Maxclients
directive) I will open more connection to each tomcat than they can
handle.

Is the above senario correct? and if it is then we are not getting more
throughput by adding more tomcats and it would be better to access the
tomcats directly.

Your considerations are correct. Since you can't influence, which apachehttpd process handles requests for which Tomcat instance, and since theydon't share the connections, this design doesn't scale to a huge farmwith a simple 1:N (1 httpd, N>>= Tomcats) setup.


What can you do:

0) For a relatively small farm the problem is usually not very big,because if there is high load, the costly ressource is the CPU power,and not the memory and switching overhead for having to many threads.

1) You can use the APR connector for Tomcat. This will decouple thethread from the connection, as long as there's no request on it. Thatway you'll only need threads for the real request paralellism concerningeach backend Tomcat. the number of connections will stay high though, soyou can't scale to a hundred of Tomcats with a thousand connections each..

2) You can use the worker MPM, because there a configurable number ofthreads share the same connection pool. For N Tomcats, on average onlyThreads_per_Process/N requests will need a connection for one Tomcatinstance. Of course in reality the number will be higher, but for biggerN and enough thread per process you should notice a relevant decrease inconnections. Maybe not by 1/N but something like 2/N, depending on howmuch session affinity breaks ideal balancing. If you get close to somefactor C/N for a not to large constant C, you are back in the scalingbusyness.

3) For huge designs you'll need to partition it into M:N (M apachehttpd, N Tomcat), where the quotient N/M doesn't get to big.

4) If your balancing breaks - or much more likely - if something in yoursystem gets slow then your expectation concerning the parallelism getwrong. You can't fix that without fixing the slowness reason. What isimportant though, is to configure the idleness timeouts for the Tomcatthread pool and the jk connection pool, such that when the originalreason of slowness is gone, the connections and threads get down belowthe critical level.

The situation you experienced was most likely coming from a slowness inthe backend applications (remember the need for a Java thread dump?).Then any throughput system will soon get filled up from the back to thefront. The best you can do, is to answer the overload requests quicklywith an error, such that the backend systems have a chance to get stableagain. For this you need Timeouts and other load limitating configurations.

When doing sizing considerations, you always need to be clear foryourself, if you are talking about the normal situation, or if you aretrying to find out, what will happen during times of overload.

So using a ridiculous example, if you have 100 tomcat boxes connecting
to one httpd server. The the limit for amount of spawned children would
still only by 200. Even though you should be able to handle 100x200
concurrent connections. Even if you take into account that for each
request per second received the request will take 4 seconds to process
it still does not seem effective use of the tomcat resources.

A few other resulting questions:
If child1, child2, child3 etc each have a connection to each tomcat,
does each child also do its own load balancing or do all the children
share information to do loadbalancing?

Fortunately they share the information state about the balancing. Thiswas introduced about 10 JK releases ago by means of a shared memory segment.

You could ask, why the processes don't share the connection pool. Theymight do this some time in the future. Historically the pool came beforethe shared memory for balancing. We prefer to stabilize JK 1.2.x now andstart working on a major next release. Switching to a shared pool willlikely lead to a couple of releases with a couple of bugs, so I don'texpect that to happen in 1.2.x.


Regards,

Rainer




---------------------------------------------------------------------
To start a new topic, e-mail: users@tomcat.apache.org
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: apache getting in "sending reply" state when connecting to tomcat

Reply via email to