Well, after much wailing and gnashing of teeth, this appears to be due to a bug in the way CF handles pooled SQL statements: we changed "Max Pooled Statements" from the default 1000 to 1, and the server has been running happily for over 24 hours now, where previously CF was restarting ever hour or so during busy periods. Hmm...
Bert ps and if anyone is looking to go on holiday/vacation they could do worse than hop over to http://www.holiday-rentals.com/ - over 10000 properties available for rent, and a stable server. :) On Mon, 10 Jan 2005 17:41:03 +0000, Bert Dawson <[EMAIL PROTECTED]> wrote: > I've got a couple of servers running enterprise and they keep crashing. > Usually they restart themselves, and the only downside is everyone > losing sessions, and delayMS and CPU go through the roof for a minute > or two. > Looking at memory usage and CPU they seem to be fine: CPU around > 10-20%, using 1.4gb memory being used on a 4gb box. > When it crashes the most common error reported is , and the stacktrace > is an empty array. > Sometimes it trows a few errors over the course of a few minutes, > while serving the same page with no errors. In other words, its > possible to go to a page, press refresh, and sometimes get an error, > other times not. When an error other than > java.lang.NullPointerException is reported it is always a non-sense > error, by which I mean it cannot be true. For example it will complain > about a recordset not having a particular column, but from the code > point of vieww the column *is* there, and the same request is > processed successfully most of the time. > > I'm guessing that there is some underlying problem with Jrun/jvm, but > am at a loss as to where to look next. > > Both are win2k running CFMX6.1 with updater. > One has 3.75GB RAM, with dual Xeon 3.06GHz, and the other 1GB RAM with > dual Xeon 2.40GHz. > Both boxes are running the same application, though there are slight > differences in that certain features will be slightly different. > The Db is on a separate box, SQLserver2k, dual xeon 3.2GHz, 4GB RAM. > (using 2GB, and bubbling around 20% CPU). > > We've been logging metrics for a while, and although they crash more > often when under load, there doesn't seem to be anything going wrong > immediately rpior to a crash/restart. > > Any pointers as to what to look for and where to go next would be very > much appreciated, and incase its any use, here's a couple of lines > from the cfusion-event.log just before it crashed earlier today. > > 10/01 14:42:15 metrics > jrpp.listenTh=1 jrpp.idleTh=19 jrpp.delayTh=0 jrpp.busyTh=3 > jrpp.totalTh=23 jrpp.delayRq=134 jrpp.droppedRq=0 > jrpp.handledRq=400 jrpp.handledMs=140979 jrpp.delayMs=15 > jrpp.bytesIn=399457 jrpp.bytesOut=11436820 freeMemory=306523 > totalMemory=882624 sessions=51 sessionsInMem=51 > scheduler.listenTh=12 scheduler.idleTh=0 scheduler.delayTh=0 > scheduler.busyTh=1 scheduler.totalTh=13 scheduler.delayRq=0 > scheduler.droppedRq=0 scheduler.delayMs=0 > 10/01 14:43:15 metrics > jrpp.listenTh=1 jrpp.idleTh=19 jrpp.delayTh=0 jrpp.busyTh=3 > jrpp.totalTh=23 jrpp.delayRq=107 jrpp.droppedRq=0 > jrpp.handledRq=399 jrpp.handledMs=134817 jrpp.delayMs=0 > jrpp.bytesIn=396272 jrpp.bytesOut=11027498 freeMemory=322098 > totalMemory=882624 sessions=74 sessionsInMem=74 > scheduler.listenTh=12 scheduler.idleTh=0 scheduler.delayTh=0 > scheduler.busyTh=1 scheduler.totalTh=13 scheduler.delayRq=0 > scheduler.droppedRq=0 scheduler.delayMs=0 > 10/01 14:44:15 metrics > jrpp.listenTh=1 jrpp.idleTh=18 jrpp.delayTh=0 jrpp.busyTh=4 > jrpp.totalTh=23 jrpp.delayRq=85 jrpp.droppedRq=0 jrpp.handledRq=405 > jrpp.handledMs=167584 jrpp.delayMs=0 jrpp.bytesIn=399513 > jrpp.bytesOut=10235531 freeMemory=333136 totalMemory=882624 > sessions=95 sessionsInMem=95 scheduler.listenTh=12 > scheduler.idleTh=0 scheduler.delayTh=0 scheduler.busyTh=1 > scheduler.totalTh=13 scheduler.delayRq=0 scheduler.droppedRq=0 > scheduler.delayMs=0 > > TIA > Bert > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| Discover CFTicket - The leading ColdFusion Help Desk and Trouble Ticket application http://www.houseoffusion.com/banners/view.cfm?bannerid=48 Message: http://www.houseoffusion.com/lists.cfm/link=i:10:5135 Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/10 Subscription: http://www.houseoffusion.com/lists.cfm/link=s:10 Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.10 Donations & Support: http://www.houseoffusion.com/tiny.cfm/54
