Hi,
As python ans dash core dumps too, it seems to be a machine problems and
not an icinga problem.
Did you check dmesg for errors and /var/log/messages ?
memory problems ?
Le 23/11/2012 10:02, jan.co...@gmail.com a écrit :
Hi,
We installed icinga 1.7.0 on Debian 6.0 virtual machine. But after
while we got segfault, so we Installed icinga 1.8.1 from debmon
repository. But again we got segfault.
What we can check or where is the problem? Here are information from
logs:
From kern.log:
Nov 16 11:56:24 deu-s0244 kernel: [74631.877751]
check_snmp_ibm_[31048]: segfault at 38 ip 0000000000000038 sp
00007fffb5dd9b70 error 14 in bash[400000+d9000]
Nov 16 11:56:52 deu-s0244 kernel: [74659.792505] icinga[31355]:
segfault at 38 ip 0000000000000038 sp 00007fff6101be20 error 14 in
icinga[400000+a4000]
Nov 16 12:10:18 deu-s0244 kernel: [75465.734324] sh[4711]: segfault at
38 ip 0000000000000038 sp 00007fff88f2c960 error 14 in dash[400000+19000]
Nov 16 12:20:59 deu-s0244 kernel: [76106.825141] icinga[9252]:
segfault at 38 ip 0000000000000038 sp 00007fff61013d50 error 14 in
icinga[400000+a4000]
Nov 16 12:53:07 deu-s0244 kernel: [78034.792423] icinga[30323]:
segfault at 38 ip 0000000000000038 sp 00007fff6101be20 error 14 in
icinga[400000+a4000]
Nov 17 13:55:02 deu-s0244 kernel: [168149.650410]
check_snmp_ibm_[22135]: segfault at 38 ip 0000000000000038 sp
00007fffb1b061d0 error 14 in bash[400000+d9000]
Nov 17 17:46:39 deu-s0244 kernel: [182046.762417] icinga[20803]:
segfault at 38 ip 0000000000000038 sp 00007fffae2445a0 error 14 in
icinga[400000+a4000]
...
Nov 20 13:50:41 xxx kernel: [ 2880.448776] python[22380]: segfault at
38 ip 0000000000000038 sp 00007fff8a0068f0 error 14 in
python2.6[400000+21d000]
Nov 20 13:54:26 xxx kernel: [ 3104.548646 <tel:%5B%203104.548646>]
icinga[1383]: segfault at 38 ip 0000000000000038 sp 00007fffa0b1eb00
error 14 in icinga[400000+a7000]
Nov 20 15:55:58 xxx kernel: [10397.302069] python[9209]: segfault at
38 ip 0000000000000038 sp 00007fffffffbb60 error 14 in
python2.6[400000+21d000]
From icinga.debug:
[1353498301.206147] [2048.1] [pid=11777] **** END MACRO PROCESSING
*************
[1353498301.206199] [001.0] [pid=11777] process_macros_r()
[1353498301.206251] [001.0] [pid=11777] process_macros_r()
[1353498301.206738] [001.0] [pid=11777] process_macros_r()
[1353498301.206793] [001.0] [pid=11777] process_macros_r()
[1353498301.206846] [001.0] [pid=11777] process_macros_r()
[1353498301.211711] [016.0] [pid=11778] running command
/usr/lib/nagios/plugins/check_snmp_storage.pl
<http://check_snmp_storage.pl> -H xx.xx.xx.xx -C GET01SRVLAN -m ^[C]
-w 80% -c 90% 0 -f via popen
[1353498301.409464] [016.2] [pid=11778] Moving temp check result file
'/tmp/check8jCbM7' to queue file
'/var/lib/icinga/spool/checkresults/cbQhIIT'...
I tried gdb:
[Thread debugging using libthread_db enabled]
[New Thread 0x7ffff61e8700 (LWP 11946)]
Program received signal SIGSEGV, Segmentation fault.
#00x0000000000000038 in ?? ()
No symbol table info available.
#10x00007ffff73cd748 in ?? () from /lib/libc.so.6
No symbol table info available.
#20x00007fffffffd440 in ?? ()
No symbol table info available.
#30x0000000000000000 in ?? ()
No symbol table info available.
rax0x2e0111777
rbx0x7fffffffd420140737488344096
rcx0x3856
rdx0x246582
rsi0x7ffff7feb9d0140737354054096
rdi0x2ea511941
rbp0x7fffffffd4800x7fffffffd480
rsp0x7fffffffd4200x7fffffffd420
r80x00
r90x7fffffffd440140737488344128
r100x00
r110x124ace019180768
r120x00
r130x7fffffffd440140737488344128
r140x00
r150x124ace019180768
rip0x380x38
eflags0x252cc2 [ ZF SF DF OF #13 RF AC ID ]
cs0x3351
ss0x2b43
ds0x00
es0x00
fs0x00
gs0x00
fctrl0x37f895
fstat0x00
ftag0xffff65535
fiseg0x00
fioff0x00
foseg0x00
fooff0x00
fop0x00
mxcsr0x1fa0[ PE IM DM ZM OM UM PM ]
0x38:Cannot access memory at address 0x38
Thread 2 (Thread 0x7ffff61e8700 (LWP 11946)):
#00x00007ffff7132c13 in poll () from /lib/libc.so.6
#10x00000000004457d6 in command_file_worker_thread (arg=<value
optimized out>) at utils.c:3784
#20x00007ffff73d68ca in start_thread () from /lib/libpthread.so.0
#30x00007ffff713db6d in clone () from /lib/libc.so.6
#40x0000000000000000 in ?? ()
Thread 1 (Thread 0x7ffff7feb700 (LWP 11941)):
#00x0000000000000038 in ?? ()
#10x00007ffff73cd748 in ?? () from /lib/libc.so.6
#20x00007fffffffd440 in ?? ()
#30x0000000000000000 in ?? ()
A debugging session is active.
Inferior 1 [process 11941] will be killed.
Quit anyway? (y or n) [answered Y; input not from terminal]
Let me know if you need more information.
Thanks,
jan
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
icinga-users mailing list
icinga-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/icinga-users
--
Serge Noiraud
Les logiciels libre, c'est mieux : Debian, Firefox, LibreOffice, ...
Tout ce qui n'est pas gratuit est perdu.
Développeur projet gramps : http://www.gramps-project.org/
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
icinga-users mailing list
icinga-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/icinga-users