Susan, Axton, Ben, etc...

We have the same issues.  Our server comes and goes as it pleases, and 
we've yet to track down exactly what the cause is.  We do know some of our 
problems stem from plugins, most of the time it's ardbcquery.  The easiest 
way to crash a thread is to open either the Overview or Problem console. 
Other times, the server just gives up, we get ARERR93's and end up having 
to restart the service.

I think things have gotten worse after patch002 and 003, but things have 
never been stable.

Here's an excerpt from a High priority ticket we have open -- with no 
solutions so far:

Faulting application arcmdbd.exe, version 2.0.1.3, faulting module 
arcmdbd.exe, version 2.0.1.3, fault address 0x0000a981. 

Mon Jul 02 12:45:05 2007  390695 : AR System server terminated when a 
signal/exception was received by the server (ARNOTE 20)
Mon Jul 02 12:45:05 2007     0xc0000005
Mon Jul 02 12:45:05 2007  390695 : AR System server terminated -- fatal 
error encountered (ARNOTE 21)
Mon Jul 02 12:46:12 2007  390695 : AR System server terminated when a 
signal/exception was received by the server (ARNOTE 20)
Mon Jul 02 12:46:12 2007     0xc0000005

We're running with lots of threads, but are on a 8 processor (dual core 
3ghz) box with 18gb of ram.  :-)  Remedy is still slow.  Our DB flies, and 
network is fine.

Private-RPC-Socket:  390601   2   8
Private-RPC-Socket:  390620   4   8
Private-RPC-Socket:  390621   2   8
Private-RPC-Socket:  390626   4  12
Private-RPC-Socket:  390627   2   8
Private-RPC-Socket:  390628   12 12
Private-RPC-Socket:  390629   2   8
Private-RPC-Socket:  390635   4   8
Private-RPC-Socket:  390690   1   6


Windows 2003R2 EE
7.0.0.1 patch 003
Remote Oracle 10gR2 RAC/Linux
ITSM7p004

-tony

-- 
Tony Worthington
[EMAIL PROTECTED]
262-703-5911



Ben Cantatore <[EMAIL PROTECTED]> 
Sent by: "Action Request System discussion list(ARSList)" 
<[email protected]>
07/11/2007 12:02 AM
Please respond to
[email protected]


To
[email protected]
cc

Subject
Re: ARS v7.0.1P2 LIST threads initiate inexplainably






** 
Axton and Susan, you're not alone with the stack errors.  I've been 
plagued with them since my launch back in Feb.  In my situation the server 
runs about 2 weeks and crashes and usually recovers on its own.  One 
problem I had which I could reproduce crashing the server was fixed by 
patch 3 so I think what Davies is saying applied to that.  The stack 
problems have gotten a little better, but still happens.  I have API, 
Escalation, Filter, SQL and Thread logging turned on and am currently 
waiting for the next crash.  So I'll post then with any interesting 
information that surfaces from that. 

Susan, sounds like you're having a worse problem than I am.  How often do 
the crashes happen, does the arserver recover on its own? 

Currently on Server 7.0.1 patch 2 and ITSM 7.0.2 patch 3 for all modules 
connecting to Oracle 10g db. 

Ben Cantatore
Remedy Administrator
Avon
(914) 935-2946 


"Davies, J.T." <[EMAIL PROTECTED]> 
Sent by: "Action Request System discussion list(ARSList)" 
<[email protected]> 
07/10/2007 05:38 PM 

Please respond to
[email protected]


To
[email protected] 
cc

Subject
Re: ARS v7.0.1P2 LIST threads initiate inexplainably








** 
Hi Susan, 
  
I've experienced these errors before, too, a long time ago: Signal 5 
termination on a specific thread. 
  
I was able to trace it back to a workflow problem, and with your jump from 
5.1.2 to 7.0, I'd imagine you might be in the same boat. 
  
What I found was with the Run Process commands. 
  
To give an example: 
  
You're calling the Application-Delete-Entry command.  It's expecting two 
inputs: Form and ID. 
  
The problem is akin to receiving a bad input value.  Perhaps the ID was 
missing, or the Form name was invalid...some wacky scenario where the 
command raised an error, and thus, the users execution died. 
  
I also found the errors were occurring if a command was expecting an 
integer value (say, the number of seconds to offset on a Business Time 
command), but the input was some character string ("ABC"). 
  
You might see if your logs (Filter and SQL) can narrow this down for you. 
You might have to mentally process some of these commands, because I do 
remember that it wasn't very apparent in the logs that it was failing on 
these. 
  
Hope this helps or provides a little more direction! 
  
J.T. 
New Edge Networks 
An Earthlink Company 

From: Action Request System discussion list(ARSList) 
[mailto:[EMAIL PROTECTED] On Behalf Of Susan Palmer
Sent: Tuesday, July 10, 2007 2:16 PM
To: [email protected]
Subject: ARS v7.0.1P2 LIST threads initiate inexplainably

** 
Since our upgrade from ARS 5.1.2 to 7.0.1P2 on 6/27/07 I have no life. Ok, 
that's the end of my rant. 
 
Everyday, with the exception of Sat/Sun even though there are some users 
on the system, we have a thread run wild issue.  It is with the LIST 
threads.  This manifested itself in a arerror94 - database timeout 
 
Of course everyone points to Remedy but it didn't feel like a Remedy 
issue.  I felt like network or database.  Well after 4 days of denial the 
other guys finally started looking at their sides and various changes were 
made.  Optimizations in oracle and some minor server changes.  Yesterday 
we actually ran the full high-production hours schedule without timing 
out.  Thought I was home free. 
 
Well on the way home the threads started to initiate again.  And finally 
by 10:30p I had to restart the services because we were at 14 threads and 
our max is 15.  Some sleep is required even after a v7 upgrade. 
 
Restarting the services clears the problem.  Yesterday we ran most of the 
day at 10 threads.  I check old statistics from May and that's what we 
were running at then.  I know, it seems high ... that's not the issue 
right now. 
 
We were having 343 errors appear after a restart which didn't seem all 
that bad but apparently are VERY bad.  Those got cleared away today and we 
are not seeing them anymore. 
 
So today, it's been a horrific day.  Four times we pushed the threads to 
the limit and had to restart services. 
 
Over the last week or so I've become intimately familiar with my sql logs 
again.   Looking for trends, patterns, hints.  Sometimes it seems like 
when a person logs in it causes a thread to stop and restart.  Or maybe 
it's the start of the thread buildup. 
 
But what I do see periodically are the following errors.  AND I think I'm 
see a pattern that after one of these there is a slow buildup of threads 
from 7-11 over 20-30 minutes then 12-15 is faster.  We start seeing 
hourglasses when I think we're at 14 on the way to 15. 
 
I'd like to know what the entry below is telling me.  I've asked bmc 
support but have received no answer.  I've looked in the sql logs at this 
time and don't see a real pattern to what the person was doing that would 
drive me in a certain direction. 
 
Any knowledge on this you can share would be appreciated. 
 
Thanks, 
Susan Palmer 
ShopperTrak 
 
ARS v7.0.1P1 
Oracle 10g 
Windows 2003 
Tue Jul 10 14:55:33 2007  390635 : AR System server terminated when a 
signal/exception was received by the server (ARNOTE 20)
Tue Jul 10 14:55:33 2007 
  Timestamp: Tue Jul 10 2007 14:55:34.1250 
  Thread Id: 2664 
  Version: 7.0.01 patch 002  200704021644 Apr  2 2007 20:12:39
  ServerName: remsrv 
  Database: SQL -- Oracle
  Hardware: Intel Pentium
  OS: Windows NT 5.2
  RPC Id: 13200
  RPC Call: 34 (EXP) 
  RPC Queue: 390635
  Client: User AdamsT from Remedy User (protocol 9) at IP address 
10.0.4.68
  Form: 
  Logging On: API SQL Thread
  Code: c0000005
  Operation: read 
  Access Addr: 736C6C69
  Stack Begin: 
  Stack End 
Tue Jul 10 14:55:33 2007  390635 : AR System server terminated when a 
signal/exception was received by the server (ARNOTE 20)
Tue Jul 10 14:55:33 2007     0xc0000005
Tue Jul 10 14:55:33 2007  390635 : AR System server terminated -- fatal 
error encountered (ARNOTE 21) 
Thread log that relates to the above error from a timing perspective: 
<THRD> /* Tue Jul 10 2007 14:15:16.3430 */ Thread Id 3108 (thread number 
21) application statistics thread started.
<THRD> /* Tue Jul 10 2007 14:55:34.3430 */ Thread Id 2664 (thread number 
16) on LIST queue died. 
<THRD> /* Tue Jul 10 2007 14:55:34.3430 */ Thread Id 4864 (thread number 
16) on LIST queue restarted.
<THRD> /* Tue Jul 10 2007 14:57:39.8750 */ Thread Id 5784 (thread number 
22) on LIST queue started. 
<THRD> /* Tue Jul 10 2007 14:59:05.8750 */ Thread Id 5640 (thread number 
23) on LIST queue started.
<THRD> /* Tue Jul 10 2007 14:59:08.0620 */ Thread Id 5568 (thread number 
24) on LIST queue started.
<THRD> /* Tue Jul 10 2007 14:59:27.4530 */ Thread Id 5652 (thread number 
25) on LIST queue started.
<THRD> /* Tue Jul 10 2007 15:02:14.5620 */ Thread Id 2332 (thread number 
26) on LIST queue started.
<THRD> /* Tue Jul 10 2007 15:03: 11.6560 */ Thread Id 5328 (thread number 
27) on LIST queue started.
<THRD> /* Tue Jul 10 2007 15:03:17.9370 */ Thread Id 4908 (thread number 
28) on LIST queue started.
<THRD> /* Tue Jul 10 2007 15:23:11.9370 */ Thread Id 5704 (thread number 
29) on LIST queue started.
<THRD> /* Tue Jul 10 2007 15:26:09.5460 */ Thread Id 964 (thread number 
30) on LIST queue started. 
 
 
This thread terror occurred today too.  First time FAST  threads have been 
involved: 
<THRD> /* Tue Jul 10 2007 13:11:11.7650 */ Thread Trace Log -- ON
<THRD> /* Tue Jul 10 2007 13:11:11.7810 */ Thread Id 5856 (thread number 
0) Thread Manager started.
<THRD> /* Tue Jul 10 2007 13:11: 11.7810 */ Thread Id 3236 (thread number  
1) timed call thread started.
<THRD> /* Tue Jul 10 2007 13:11:11.7810 */ Thread Id 5236 (thread number 
2) on ADMIN queue started.
<THRD> /* Tue Jul 10 2007 13:11: 37.7030 */ Thread Id 5740 (thread number  
3) on ALERT queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 5384 (thread number 
4) on ESCALATION queue started.
<THRD> /* Tue Jul 10 2007 13:11: 37.7030 */ Thread Id 5880 (thread number  
5) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 3256 (thread number 
6) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 3432 (thread number 
7) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 4812 (thread number 
8) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 3692 (thread number 
9) on FAST queue started. 
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 904 (thread number 
10) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 4508 (thread number 
11) on LIST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 4816 (thread number 
12) on LIST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 5792 (thread number 
13) on LIST queue started.
<THRD> /* Tue Jul 10 2007 13:11: 37.7030 */ Thread Id 4120 (thread number 
14) on LIST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 4216 (thread number 
15) on LIST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 2960 (thread number 
16) on LIST queue started.
<THRD> /* Tue Jul 10 2007 13:11:37.7030 */ Thread Id 2832 (thread number 
17) license monitor thread started.
<THRD> /* Tue Jul 10 2007 13:11: 37.7030 */ Thread Id 5104 (thread number 
18) archive thread started.
<THRD> /* Tue Jul 10 2007 13:11:37.7180 */ Thread Id 4960 (thread number 
19) computed group call thread started.
<THRD> /* Tue Jul 10 2007 13:11: 37.7180 */ Thread Id 5340 (thread number 
20) server statistics thread started.
<THRD> /* Tue Jul 10 2007 13:11:37.7180 */ Thread Id 4940 (thread number 
21) application statistics thread started.
<THRD> /* Tue Jul 10 2007 13:40: 36.1400 */ Thread Id 5880 (thread number  
5) on FAST queue died.
<THRD> /* Tue Jul 10 2007 13:40:36.1400 */ Thread Id 5372 (thread number 
5) on FAST queue restarted.
<THRD> /* Tue Jul 10 2007 13:40:36.1400 */ Thread Id 5372 (thread number 
5) on FAST queue died.
<THRD> /* Tue Jul 10 2007 13:40:36.1400 */ Thread Id 4196 (thread number 
5) on FAST queue restarted.
<THRD> /* Tue Jul 10 2007 13:40:36.1710 */ Thread Id 4812 (thread number 
8) on FAST queue died. 
<THRD> /* Tue Jul 10 2007 13:40:36.1710 */ Thread Id 884 (thread number 8) 
on FAST queue restarted.
<THRD> /* Tue Jul 10 2007 13:40:38.6710 */ Thread Id 3692 (thread number 
9) on FAST queue died.
<THRD> /* Tue Jul 10 2007 13:40:38.6710 */ Thread Id 1996 (thread number 
9) on FAST queue restarted.
<THRD> /* Tue Jul 10 2007 13:40:59.2500 */ Thread Id 4120 (thread number 
14) on LIST queue died.
<THRD> /* Tue Jul 10 2007 13:40: 59.2500 */ Thread Id 5860 (thread number 
14) on LIST queue restarted.
<THRD> /* Tue Jul 10 2007 13:41:39.4060 */ Thread Id 4216 (thread number 
15) on LIST queue died.
<THRD> /* Tue Jul 10 2007 13:41:39.4060 */ Thread Id 5108 (thread number 
15) on LIST queue restarted.
<THRD> /* Tue Jul 10 2007 13:41:42.9840 */ Thread Id 2960 (thread number 
16) on LIST queue died.
<THRD> /* Tue Jul 10 2007 13:41:42.9840 */ Thread Id 2700 (thread number 
16) on LIST queue restarted. 
<THRD> /* Tue Jul 10 2007 13:42:19.5150 */ Thread Id 3256 (thread number 
6) on FAST queue died.
<THRD> /* Tue Jul 10 2007 13:42:19.5150 */ Thread Id 2108 (thread number 
6) on FAST queue restarted.
<THRD> /* Tue Jul 10 2007 13:42:36.1710 */ Thread Id 3432 (thread number 
7) on FAST queue died.
<THRD> /* Tue Jul 10 2007 13:42:36.1710 */ Thread Id 4064 (thread number 
7) on FAST queue restarted.
<THRD> /* Tue Jul 10 2007 13:42: 46.6090 */ Thread Id 480 (thread number 
22) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:42:58.3590 */ Thread Id 4700 (thread number 
23) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:43:05.3430 */ Thread Id 4816 (thread number 
12) on LIST queue died.
<THRD> /* Tue Jul 10 2007 13:43:05.3430 */ Thread Id 5756 (thread number 
12) on LIST queue restarted.
<THRD> /* Tue Jul 10 2007 13:43:13.6710 */ Thread Id 356 (thread number 
24) on FAST queue started. 
<THRD> /* Tue Jul 10 2007 13:43:13.7180 */ Thread Id 904 (thread number 
10) on FAST queue died.
<THRD> /* Tue Jul 10 2007 13:43:13.7180 */ Thread Id 2812 (thread number 
10) on FAST queue restarted.
<THRD> /* Tue Jul 10 2007 13:43:25.6870 */ Thread Id 6140 (thread number 
25) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:43:31.7180 */ Thread Id 3960 (thread number 
26) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:43: 41.7810 */ Thread Id 2484 (thread number 
27) on FAST queue started.
<THRD> /* Tue Jul 10 2007 13:46:39.1870 */ Thread Id 5384 (thread number 
4) on ESCALATION queue died.
<THRD> /* Tue Jul 10 2007 13:46: 39.1870 */ Thread Id 3704 (thread number  
4) on ESCALATION queue restarted.
<THRD> /* Tue Jul 10 2007 13:48:45.8750 */ Thread Id 5792 (thread number 
13) on LIST queue died.
<THRD> /* Tue Jul 10 2007 13:48: 45.8750 */ Thread Id 824 (thread number 
13) on LIST queue restarted.
__20060125_______________________This posting was submitted with HTML in 
it___ 
__20060125_______________________This posting was submitted with HTML in 
it___ 
__20060125_______________________This posting was submitted with HTML in 
it___


CONFIDENTIALITY NOTICE: 
This is a transmission from Kohl's Department Stores, Inc.
and may contain information which is confidential and proprietary.
If you are not the addressee, any disclosure, copying or distribution or use of 
the contents of this message is expressly prohibited.
If you have received this transmission in error, please destroy it and notify 
us immediately at 262-703-7000.

CAUTION:
Internet and e-mail communications are Kohl's property and Kohl's reserves the 
right to retrieve and read any message created, sent and received.  Kohl's 
reserves the right to monitor messages to or from authorized Kohl's Associates 
at any time
without any further consent.

_______________________________________________________________________________
UNSUBSCRIBE or access ARSlist Archives at www.arslist.org ARSlist:"Where the 
Answers Are"

Reply via email to