Re: mod_jk ERR/FRC

Rainer Jung Mon, 03 Mar 2008 14:17:28 -0800

Hi Tim,

[EMAIL PROTECTED] schrieb:

Setup: TC 4.1.18, IIS5, mod_jk 1.2.25, JDK 1.3.1, Windows 2000
1 LB pointing to 1 worker TC, same server as IIS. Because of 3rd party
license issues, we're stuck with using 1 TC only.


Usually the tomcat runs with ~ 280 threads, but soon after a bounce its
getting 400+ threads, and pretty soon I get errors such as this -

[Error] ThreadPool- - Caught exception executing
org.apache.jk.common.SocketAcceptor... <java.lang.OutOfMemoryError
unable to create new native thread>

OK, usually when you run OutOfMemory during thread creation, this willalso mean, that your Tomcat doesn't accept any new connections any more.

With the old style connector, whenever a new connection comes in, Tomcataccepts the connection on the thread, that waits for new connections andlets this thread work on the requests coming in via this connection.

It then looks for another thread to again accept new connections. If thepool is empty and the maximum number of threads is not reached, it willtry to create a new thread. If there is not enough memory for this,thread creation will fail and from that on, there will be no more threadaccepting any new connection. This effectvely means, that Tomcat willstill answer the open requests, but no more new requests are handeld -until Tomcat restart.

That's the reson, why you should always limit the number of threads inthe pool. It's better to not accept new connections, if your threadsreahed the defined limit, than to create to many threads and by thisbreak the service unrecoverable.

How many threads will work? It depends on the OS. On Windows somethingbetween 400 and 500, on Linux it depends on 32/64 Bits and the threadlibrary. On Solaris Sparc there is no such trouble. It's mostly aquestion of memory adressing for thread stacks.

I would test, if 400 work, and then limit the pool to 400. How do youtest? Simply configure the pool to 400 minimum and maximum and see, ifyou can start Tomcat and if all threads get created. Then you know, thatyour platform can create that many threads inside the JVM.


So far this is totally independent of mod_jk/isapi plugin.

Shortly after, mod_jk declares the worker to be in an Error state, and
all my site serves is the 'Service Temporarily Unavailable' page. The
isapi_redirect.log is dumping out hundreds of messages including 'all
tomcat instances are busy or in error state'. This is like a couple of
minutes after TC is restarted !

The cat, that you get many such errors, once your Tomcat isn't acceptingany more connections seems normal. The fact, that *all* tomcats are busyseems normal too, since you only have one.

Why will mod_jk put your Tomcat into error state. although it still has400 working connections (but cannot open a new one): because that's theway it's implemented. If we try to open a new connection and we get anerror (e.g. a network timeout), we declare the backend as broken (errorstate).

worker.properties hasworker.wkr.connection_pool_size=200
worker.wkr.connection_pool_timeout=300

Hmmm. This would mean, that the web server only opens up max. 200connections to Tomcat. If we assume, that there is only one web serverin front of Tomcat and no traffic is going to Tomcat from somewhere else(other connectors etc.), we should not see more than 200 and a fewthreads inside Tomcat. This contradicts your observations.

It's possible, that old connections are not closed correctly on theTomcat side, if a firewall between web server and Tomcat drops idleconnections. Usually one uses a connectionTimeout on the Tomcat sidetogether with the connection pool timeout on the mod_jk side.


See:

http://tomcat.apache.org/connectors-doc/generic_howto/timeouts.html

- When mod_jk shows a state of ERR/FRC, what exactly does the forced
recovery attempt to do ?

If a balancer has to put all members into error state, it could eitheralways return an error, or it could say: I've got no good backend, solets try to use the bad ones anyhow. Force recovery means: no backend isOK, then use a bad one. In your case (one backend) it means, thatalthough the backend goes into error, we still send the requests there.

- Our webapp in TC works hard on startup to cache frequent content into
memory. That means bouncing the Tomcat is painful for us during the day.

Fix an upper thread limit. If your app behaves well, then you shouldn'thave to restart. Of course if the problem is bad performance and yourressources are exhausted, that problm will only end, if the loaddecreases, or the performance gets better. But usually the containerwill then proceed as normal, without restart.

In the case described abive, when the container could not start a newthread for socket acept, it will be broken and a restart is needed.

I wonder anectdotally if the FRC messages from mod_jk are causing it
hassle ? Moreover in the first ~15minutes of the webapp, response times
will be longer than usual. Is this causing mod_jk to give up? If so
whats the best tuning approach?


It will only give up on long response times when a reply_timeout is set.

- Is there a simple way in mod_jk to throttle traffic incident on tomcat
?

No. There are other modules, but most of them throttle traffic perclient IP. Search for httpd modules with "throttle" or "bandwidth". Youcould code a servlet filter though. Throttling inside Tomat is lessstable than in front of it, but you can have some easy logic there, whomyou want to send the throttle page (error page), e.g. to many userslogged on (or to many sessions) and the request doesn't belong to asession. etc. Ooops, sorry, you are on IIS, forgot about that.

- Lastly, can the 'Service Temporarily Unavailable' page be customised ?
I'm presuming that its being served by mod_jk internals rather than IIS
?


Anything inside IIS? It would be ErrorPage with httpd.


regards

Tim


Regards,

Rainer

---------------------------------------------------------------------
To start a new topic, e-mail: users@tomcat.apache.org
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: mod_jk ERR/FRC

Reply via email to