Bert, Start by making sure that your apps aren't producing any error entries in the logs files. Specifically, the application.log, exception.log and server.log. Start will with a clean set of log files to reduce the bulk so you can focus on what is happening.
Once you know that app can run cleanly without producing errors, get a load tester. It doesn't matter which one; Segue SilkPerformer, Mercury LoadRunner, OpenSTA (free), Microsoft WAS Tool (free) If there is a problem with a piece of the application under load, the scripts you use to load test the application will help you narrow down the problem area. If there is a problem to be found, good debugging technique will help reveal the stealthy gremlin. Regards, Rob Rusher --- Bert Dawson <[EMAIL PROTECTED]> wrote: > I've got a couple of servers running enterprise and they keep crashing. > Usually they restart themselves, and the only downside is everyone > losing sessions, and delayMS and CPU go through the roof for a minute > or two. > Looking at memory usage and CPU they seem to be fine: CPU around > 10-20%, using 1.4gb memory being used on a 4gb box. > When it crashes the most common error reported is , and the stacktrace > is an empty array. > Sometimes it trows a few errors over the course of a few minutes, > while serving the same page with no errors. In other words, its > possible to go to a page, press refresh, and sometimes get an error, > other times not. When an error other than > java.lang.NullPointerException is reported it is always a non-sense > error, by which I mean it cannot be true. For example it will complain > about a recordset not having a particular column, but from the code > point of vieww the column *is* there, and the same request is > processed successfully most of the time. > > I'm guessing that there is some underlying problem with Jrun/jvm, but > am at a loss as to where to look next. > > Both are win2k running CFMX6.1 with updater. > One has 3.75GB RAM, with dual Xeon 3.06GHz, and the other 1GB RAM with > dual Xeon 2.40GHz. > Both boxes are running the same application, though there are slight > differences in that certain features will be slightly different. > The Db is on a separate box, SQLserver2k, dual xeon 3.2GHz, 4GB RAM. > (using 2GB, and bubbling around 20% CPU). > > We've been logging metrics for a while, and although they crash more > often when under load, there doesn't seem to be anything going wrong > immediately rpior to a crash/restart. > > Any pointers as to what to look for and where to go next would be very > much appreciated, and incase its any use, here's a couple of lines > from the cfusion-event.log just before it crashed earlier today. > > 10/01 14:42:15 metrics > jrpp.listenTh=1 jrpp.idleTh=19 jrpp.delayTh=0 jrpp.busyTh=3 > jrpp.totalTh=23 > jrpp.delayRq=134 jrpp.droppedRq=0 jrpp.handledRq=400 > jrpp.handledMs=140979 > jrpp.delayMs=15 jrpp.bytesIn=399457 jrpp.bytesOut=11436820 > freeMemory=306523 > totalMemory=882624 sessions=51 sessionsInMem=51 > scheduler.listenTh=12 > scheduler.idleTh=0 scheduler.delayTh=0 scheduler.busyTh=1 > scheduler.totalTh=13 scheduler.delayRq=0 scheduler.droppedRq=0 > scheduler.delayMs=0 > 10/01 14:43:15 metrics > jrpp.listenTh=1 jrpp.idleTh=19 jrpp.delayTh=0 jrpp.busyTh=3 > jrpp.totalTh=23 > jrpp.delayRq=107 jrpp.droppedRq=0 jrpp.handledRq=399 > jrpp.handledMs=134817 > jrpp.delayMs=0 jrpp.bytesIn=396272 jrpp.bytesOut=11027498 > freeMemory=322098 > totalMemory=882624 sessions=74 sessionsInMem=74 > scheduler.listenTh=12 > scheduler.idleTh=0 scheduler.delayTh=0 scheduler.busyTh=1 > scheduler.totalTh=13 scheduler.delayRq=0 scheduler.droppedRq=0 > scheduler.delayMs=0 > 10/01 14:44:15 metrics > jrpp.listenTh=1 jrpp.idleTh=18 jrpp.delayTh=0 jrpp.busyTh=4 > jrpp.totalTh=23 > jrpp.delayRq=85 jrpp.droppedRq=0 jrpp.handledRq=405 > jrpp.handledMs=167584 > jrpp.delayMs=0 jrpp.bytesIn=399513 jrpp.bytesOut=10235531 > freeMemory=333136 > totalMemory=882624 sessions=95 sessionsInMem=95 > scheduler.listenTh=12 > scheduler.idleTh=0 scheduler.delayTh=0 scheduler.busyTh=1 > scheduler.totalTh=13 scheduler.delayRq=0 scheduler.droppedRq=0 > scheduler.delayMs=0 > > TIA > Bert > > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| Discover CFTicket - The leading ColdFusion Help Desk and Trouble Ticket application http://www.houseoffusion.com/banners/view.cfm?bannerid=48 Message: http://www.houseoffusion.com/lists.cfm/link=i:10:5088 Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/10 Subscription: http://www.houseoffusion.com/lists.cfm/link=s:10 Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.10 Donations & Support: http://www.houseoffusion.com/tiny.cfm/54
