I have a third party app that is core dumping.  Some days I get 8 to 10 dumps.  
No one is complaining (except me) about any problems using the application.  
This has been going on  for several months (so I can forget trying to identify 
any application changes that might point to the source).

I have a crontab set up to check /var/core once an hour and run the following 
mdb commands on any new dump files (the files are moved to a directory on an 
NFS mount to keep /var from filling):

mdb $bin $i <<EOA

  =nn"** Core file status **"
  ="------------------------"
  ::status

  =nn"** Thread stack **"
  ="----------------------"
  ::stack

  =nn"** regs **"
  ="----------------------"
  ::regs

  =nn"** Shared objects **"
  ="----------------------"
  ::objects

EOA

::status  all most always shows (sometimes the signal is SIGBUS, sometimes 
SIGSEGV)

                ** Core file status **
                ------------------------
debugging core file of f90webm (32-bit) from kermit
executable file: /u01/app/oracle/product/10gASforms/9.0.4/bin/f90webm
initial argv:
/u01/app/oracle/product/10gASforms/9.0.4/bin/f90webm server webfile=HTTP-0,0,0,
threading model: multi-threaded
status: process terminated by SIGILL (Illegal Instruction)


::stack  almost always starts (ends?) with:

                ** Thread stack **
                ----------------------
0x1638adc(3, 1814490, ffbfa32c, 0, 0, 0)
ixwsru+0x114(12703e8, ffbfa394, 0, 0, 0, 0)
icgcud+0xe8(12703e8, ffbfa3f8, 100, 0, ffbfa464, 0)
ifzterm+0x1dc(12703e8, 1, 1, e, 81010100, ff0000)
siehjmpterm+0x2cc(e, 0, ffbfa698, 0, 0, 0)
libthread.so.1`__sighndlr+0xc(e, 0, ffbfa698, 1fab48, 0, 0)
libthread.so.1`call_user_handler+0x234(e, 0, ffbfa698, 0, 0, 0)
libthread.so.1`sigacthandler+0x64(e, 0, ffbfa698, 0, 0, 0)
libc.so.1`_read+0xc(6, 1290388, 400, 400, 7ffffff2, fece625c)
ixhgr_GenRead+0x6c(12703e8, 127dd78, 1290388, 400, ffbfab84, 0)
ixhrdh_ReadHeader+0x38(12703e8, 127dd78, 1290368, 10e3f54, 0, 0)
ixhhsm_HTTPStateMachine+0x250(12703e8, 127dd78, 1, fffffffe, 4, 0)

Have talked to the third party support people and they ... (well, don't have 
anything usefull to add).

Without any other information, is there something I could get from the dump 
that would help with making the error more "observable"?  (Well, at least to 
the casual observer [me])

The server is a V490 running Solaris 9 4/04.  The application is an Oracle AS 
based ERP.

Thanks for any ideas/insight/wisdom,
Glen Gunselman
 
 
This message posted from opensolaris.org

Reply via email to