Well, after much wailing and gnashing of teeth, this appears to be due
to a bug in the way CF handles pooled SQL statements: we changed "Max
Pooled Statements" from the default 1000 to 1, and the server has been
running happily for over 24 hours now, where previously CF was
restarting ever hour or so during busy periods.
Hmm...

Bert

ps and if anyone is looking to go on holiday/vacation they could do
worse than hop over to http://www.holiday-rentals.com/ - over 10000
properties available for rent, and a stable server.
:)


On Mon, 10 Jan 2005 17:41:03 +0000, Bert Dawson <[EMAIL PROTECTED]> wrote:
> I've got a couple of servers running enterprise and they keep crashing.
> Usually they restart themselves, and the only downside is everyone
> losing sessions, and delayMS and CPU go through the roof for a minute
> or two.
> Looking at memory usage and CPU they seem to be fine: CPU around
> 10-20%, using 1.4gb memory being used on a 4gb box.
> When it crashes the most common error reported is , and the stacktrace
> is an empty array.
> Sometimes it trows a few errors over the course of a few minutes,
> while serving the same page with no errors. In other words, its
> possible to go to a page, press refresh, and sometimes get an error,
> other times not. When an error other than
> java.lang.NullPointerException is reported it is always a non-sense
> error, by which I mean it cannot be true. For example it will complain
> about a recordset not having a particular column, but from the code
> point of vieww the column *is* there, and the same request is
> processed successfully most of the time.
> 
> I'm guessing that there is some underlying problem with Jrun/jvm, but
> am at a loss as to where to look next.
> 
> Both are win2k running CFMX6.1 with updater.
> One has 3.75GB RAM, with dual Xeon 3.06GHz, and the other 1GB RAM with
> dual Xeon 2.40GHz.
> Both boxes are running the same application, though there are slight
> differences in that certain features will be slightly different.
> The Db is on a separate box, SQLserver2k, dual xeon 3.2GHz, 4GB RAM.
> (using 2GB, and bubbling around 20% CPU).
> 
> We've been logging metrics for a while, and although they crash more
> often when under load, there doesn't seem to be anything going wrong
> immediately rpior to a crash/restart.
> 
> Any pointers as to what to look for and where to go next would be very
> much appreciated, and incase its any use, here's a couple of lines
> from the cfusion-event.log just before it crashed earlier today.
> 
> 10/01 14:42:15 metrics
> jrpp.listenTh=1 jrpp.idleTh=19  jrpp.delayTh=0  jrpp.busyTh=3   
> jrpp.totalTh=23 jrpp.delayRq=134        jrpp.droppedRq=0        
> jrpp.handledRq=400      jrpp.handledMs=140979   jrpp.delayMs=15 
> jrpp.bytesIn=399457     jrpp.bytesOut=11436820  freeMemory=306523       
> totalMemory=882624      sessions=51     sessionsInMem=51        
> scheduler.listenTh=12   scheduler.idleTh=0      scheduler.delayTh=0     
> scheduler.busyTh=1      scheduler.totalTh=13    scheduler.delayRq=0     
> scheduler.droppedRq=0   scheduler.delayMs=0
> 10/01 14:43:15 metrics
> jrpp.listenTh=1 jrpp.idleTh=19  jrpp.delayTh=0  jrpp.busyTh=3   
> jrpp.totalTh=23 jrpp.delayRq=107        jrpp.droppedRq=0        
> jrpp.handledRq=399      jrpp.handledMs=134817   jrpp.delayMs=0  
> jrpp.bytesIn=396272     jrpp.bytesOut=11027498  freeMemory=322098       
> totalMemory=882624      sessions=74     sessionsInMem=74        
> scheduler.listenTh=12   scheduler.idleTh=0      scheduler.delayTh=0     
> scheduler.busyTh=1      scheduler.totalTh=13    scheduler.delayRq=0     
> scheduler.droppedRq=0   scheduler.delayMs=0
> 10/01 14:44:15 metrics
> jrpp.listenTh=1 jrpp.idleTh=18  jrpp.delayTh=0  jrpp.busyTh=4   
> jrpp.totalTh=23 jrpp.delayRq=85 jrpp.droppedRq=0        jrpp.handledRq=405    
>   jrpp.handledMs=167584   jrpp.delayMs=0  jrpp.bytesIn=399513     
> jrpp.bytesOut=10235531  freeMemory=333136       totalMemory=882624      
> sessions=95     sessionsInMem=95        scheduler.listenTh=12   
> scheduler.idleTh=0      scheduler.delayTh=0     scheduler.busyTh=1      
> scheduler.totalTh=13    scheduler.delayRq=0     scheduler.droppedRq=0   
> scheduler.delayMs=0
> 
> TIA
> Bert
>

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Discover CFTicket - The leading ColdFusion Help Desk and Trouble 
Ticket application

http://www.houseoffusion.com/banners/view.cfm?bannerid=48

Message: http://www.houseoffusion.com/lists.cfm/link=i:10:5135
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/10
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:10
Unsubscribe: 
http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.10
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54

Reply via email to