Sean Finkel wrote:
Hello,
First a brief background on the setup:
We are running Apache 1.3.31 utilizing mod_jk (not jk2). We are running two instances of Tomcat. Previously, both were version 4. Currently, we have one shared instance running the latest 5.x release (just compiled yesterday). We have one customer running a private 4.x instance.
With that said, we just moved to a new server (Dual Xeon, RHEL 3.0) and now the problem we are having is the customer who has this private instance has jsp pages that "hang". But, it does not hang all the time. What's weird, is most of the time, I can stop the shared instance and the customer's pages will stop hanging. I would like to fix the hanging completely, obviously.
Ok this is happening again right now. Here is some output from various programs:
From Apache Status (blanked out the VHost) - these Apache child processes have been running for 15+ minutes waiting on Tomcat. They are not servicing new requests and then dieing like they should be (Apache hands *everything* off for this domain to Tomcat, including images):
--------------------( snip )-------------------
*Srv* *PID* *Acc* *M* *CPU* *SS* *Req* *Conn* *Child* *Slot* *Host* *VHost* *Request*
*0-0* 24535 0/21/1803 *W* 0.23 3333 0 0.0 0.14 21.56 4.153.20.11 xxxx GET /images/pics/todayspic27.jpg HTTP/1.1
*1-0* 23257 0/39/2065 *W* 0.46 3452 0 0.0 0.27 12.33 4.153.20.11 xxxx GET /images/pics/todayspic16.jpg HTTP/1.1
*2-0* 23252 0/17/1868 *W* 0.20 3593 0 0.0 0.26 15.75 207.69.137.135 xxxx GET /images/pics/todayspic13.jpg HTTP/1.1
*3-0* 23377 0/22/1825 *W* 0.26 3445 0 0.0 0.34 20.29 4.153.20.11 xxxx GET /images/pics/todayspic21.jpg HTTP/1.1
*4-0* 23378 0/22/1839 *W* 0.38 3439 0 0.0 0.07 16.09 4.153.20.11 xxxx GET /images/pics/todayspic15.jpg HTTP/1.1
*5-0* 22810 0/43/1750 *W* 0.26 3584 0 0.0 0.54 19.31 207.69.137.135 xxxx GET /images/pics/todayspic12.jpg HTTP/1.1
*6-0* 23267 0/37/1788 *W* 0.68 3425 0 0.0 0.52 14.50 4.153.20.11 xxxx GET /images/pics/todayspic19.jpg HTTP/1.1
*7-0* 26919 0/33/1586 *W* 0.15 3143 0 0.0 0.27 12.33 4.153.20.11 xxxx GET /images/pics/todayspic36.jpg HTTP/1.1
*8-0* 23441 0/22/1532 *W* 0.28 3385 0 0.0 0.09 11.07 4.153.20.11 xxxx GET /images/pics/todayspic25.jpg HTTP/1.1
*9-0* 26920 0/30/1396 *W* 0.36 3147 0 0.0 0.17 8.23 4.153.20.11 xxxx GET /images/pics/todayspic34.jpg HTTP/1.1
-------------( /snip )-------------------------
From A thread dump of the JVM - obviously there were more than three, but I only included three for the sake of brevity:
---------------( snip )-----------------------
"Thread-20" daemon prio=1 tid=0x08192b68 nid=0x3560 in Object.wait() [a84db000..a84db87c]
at java.lang.Object.wait(Native Method)
- waiting on <0xab9bcba8> (a org.apache.tomcat.util.threads.ThreadPool$ControlRunnable)
at java.lang.Object.wait(Object.java:429)
at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:512)
- locked <0xab9bcba8> (a org.apache.tomcat.util.threads.ThreadPool$ControlRunnable)
at java.lang.Thread.run(Thread.java:534)
"Thread-19" daemon prio=1 tid=0x08195450 nid=0x3560 in Object.wait() [a855c000..a855c87c]
at java.lang.Object.wait(Native Method)
- waiting on <0xab9bcc10> (a org.apache.tomcat.util.threads.ThreadPool$ControlRunnable)
at java.lang.Object.wait(Object.java:429)
at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:512)
- locked <0xab9bcc10> (a org.apache.tomcat.util.threads.ThreadPool$ControlRunnable)
at java.lang.Thread.run(Thread.java:534)
"Thread-18" daemon prio=1 tid=0x0824b148 nid=0x3560 in Object.wait() [a85dd000..a85dd87c]
at java.lang.Object.wait(Native Method)
- waiting on <0xab7b9ae0> (a org.apache.tomcat.util.threads.ThreadPool$ControlRunnable)
at java.lang.Object.wait(Object.java:429)
at org.apache.tomcat.util.threads.ThreadPool$ControlRunnable.run(ThreadPool.java:512)
- locked <0xab7b9ae0> (a org.apache.tomcat.util.threads.ThreadPool$ControlRunnable)
at java.lang.Thread.run(Thread.java:534)
--------------------- ( /snip )--------------
So, as another user suggested earler, it seems some blocking is happening. Also as I mentioned, I am on RHEL 3.0 and JDK version: 1.4.2_04. Does anyone have any ideas as to how to prevent the threads from locking up? I have already tried the "LD_KERNEL_ASSUME=2.4" and that does not help.
-Sean
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
