Re: Sig 11 Segfault in net/net-snmp net-snmp-5.1.3p4 in 4.0/i386

2006-12-11 Thread Brian A. Seklecki


And with internal debugging:

[...]
snmp_agent: REMOVE session == 0x7d6bbc80
trace: free_agent_snmp_session(): snmp_agent.c, 1257:
snmp_agent: agent_session 0x7d6bbc80 released
trace: handle_snmp_packet(): snmp_agent.c, 1794:
snmp_agent: end of handle_snmp_packet, asp = 0x7d6bbc80
trace: snmp_sess_select_info(): snmp_api.c, 5630:
sess_select: for all sessions: 17 16 13 11 8
trace: _sess_read(): snmp_api.c, 5216:
sess_read: not reading 17 (fdset 0xcf7de6c0 set 0)
trace: _sess_read(): snmp_api.c, 5216:
sess_read: not reading 16 (fdset 0xcf7de6c0 set 0)
trace: netsnmp_callback_recv(): snmpCallbackDomain.c, 188:
transport_callback: hook_recv enter
trace: netsnmp_callback_recv(): snmpCallbackDomain.c, 214:
transport_callback: hook_recv exit
trace: _sess_process_packet(): snmp_api.c, 4898:
sess_process_packet: session 0x8abdafb0 fd 13 pkt 0x88f86000 length 1
trace: callback_debug_pdu(): snmpCallbackDomain.c, 91:
dump_recv_callback_transport: PDU: command = 162, errstat = 0, errindex = 
0

trace: callback_debug_pdu(): snmpCallbackDomain.c, 93:
dump_recv_callback_transport:   var 2:UCD-SNMP-MIB::prErrorFlag.1 = 
INTEGER: 0

trace: _sess_read(): snmp_api.c, 5216:
sess_read: not reading 11 (fdset 0xcf7de6c0 set 0)
trace: _sess_read(): snmp_api.c, 5216:
sess_read: not reading 8 (fdset 0xcf7de6c0 set 0)
trace: mte_get_response(): disman/mteTriggerTable.c, 3107:
mteTriggerTable: got a variables: UCD-SNMP-MIB::prErrorFlag.1 = INTEGER: 0
trace: mte_run_trigger(): disman/mteTriggerTable.c, 3375:
mteTriggerTable: received UCD-SNMP-MIB::prErrorFlag.1 = INTEGER: 0 (type 
2)

trace: mte_run_trigger(): disman/mteTriggerTable.c, 3528:
mteTriggerTable: value: 0 0 0 x: 0 0 0
trace: mte_run_trigger(): disman/mteTriggerTable.c, 3536:
mteTriggerTable: boolean result: x=0 != configured=0 = 0
trace: header_complex_add_data_by_oid(): header_complex.c, 417:
header_complex_add_data: adding something...
Segmentation fault (core dumped)


On Mon, 11 Dec 2006, Brian A. Seklecki wrote:



With debugging symbols:

#0  0x008b8d71 in memmove () from /usr/lib/libc.so.39.3
#1  0x0ecaaf7e in snmp_set_var_objid (vp=0x2c, objid=0x7d8c7018, 
name_length=11) at snmp_client.c:652
#2  0x0ecd1bac in snmp_varlist_add_variable (varlist=0x87618844, 
name=0x7d8c7018, name_length=11, type=5 '\005', value=0x0,

   len=0) at snmp_api.c:6259
#3  0x0ecd1b3c in snmp_pdu_add_variable (pdu=0x87618800, name=0x7d8c7018, 
name_length=11, type=5 '\005', value=0x0, len=0)

   at snmp_api.c:6232
#4  0x0ecaa87b in snmp_add_null_var (pdu=0x87618800, name=0x7d8c7018, 
name_length=11) at snmp_client.c:157
#5  0x01bcdb35 in mte_run_trigger (clientreg=1) at 
disman/mteTriggerTable.c:3309

#6  0x0ecea9c3 in run_alarms () at snmp_alarm.c:248
#7  0x1c003da5 in SnmpdCatchRandomSignal ()
#8  0x1c003204 in SnmpdCatchRandomSignal ()
#9  0x1c001ea0 in ?? ()
#10 0x0005 in ?? ()
#11 0xcf7f7df4 in ?? ()
#12 0xcf7f7e0c in ?? ()
#13 0x1c001e31 in ?? ()
#14 0xcf7f7f6c in ?? ()
#15 0xcf7f7dd0 in ?? ()
#16 0x1c001885 in ?? ()
#17 0x1c001e8e in ?? ()
#18 0x in ?? ()


Running it in -f -D -L now ~BAS

On Mon, 11 Dec 2006, Brian A. Seklecki wrote:



I've got a pretty consistent segfault:

[EMAIL PROTECTED]:/home/seklecki# gdb /usr/local/sbin/snmpd 
/home/seklecki/snmpd.core


Core was generated by `snmpd'.
Program terminated with signal 11, Segmentation fault.
Reading symbols from /usr/local/lib/libnetsnmpagent.so.6.3...(no debugging 
symbols found)...done.

Loaded symbols for /usr/local/lib/libnetsnmpagent.so.6.3
Reading symbols from /usr/local/lib/libnetsnmpmibs.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmpmibs.so.6.3
Reading symbols from /usr/local/lib/libnetsnmphelpers.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmphelpers.so.6.3
Reading symbols from /usr/lib/libwrap.so.4.0...done.
Loaded symbols for /usr/lib/libwrap.so.4.0
Reading symbols from /usr/local/lib/libnetsnmp.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmp.so.6.3
Reading symbols from /usr/lib/libkvm.so.8.0...done.
Loaded symbols for /usr/lib/libkvm.so.8.0
Reading symbols from /usr/lib/libz.so.4.1...done.
Loaded symbols for /usr/lib/libz.so.4.1
Reading symbols from /usr/lib/libcrypto.so.13.0...done.
Loaded symbols for /usr/lib/libcrypto.so.13.0
Reading symbols from /usr/lib/libm.so.2.3...done.
Loaded symbols for /usr/lib/libm.so.2.3
Reading symbols from /usr/lib/libc.so.39.3...done.
Loaded symbols for /usr/lib/libc.so.39.3
Reading symbols from /usr/libexec/ld.so...done.
Loaded symbols for /usr/libexec/ld.so

#0  0x089b3d71 in memmove () from /usr/lib/libc.so.39.3
(gdb) bt
#0  0x089b3d71 in memmove () from /usr/lib/libc.so.39.3
#1  0x0cf6bede in snmp_set_var_objid () from 
/usr/local/lib/libnetsnmp.so.6.3
#2  0x0cf92b0c in snmp_varlist_add_variable () from 
/usr/local/lib/libnetsnmp.so.6.3
#3  0x0cf92a9c in snmp_pdu_add_variable () from 
/usr/local/lib/libnetsnmp.so.6.3
#4  0x0cf6b7db in snmp_add_null_var () from 
/usr/local/lib/libnetsnmp.so.6.3
#5  0x08757bf5 in mte_run_trigger (

Re: Sig 11 Segfault in net/net-snmp net-snmp-5.1.3p4 in 4.0/i386

2006-12-11 Thread Brian A. Seklecki


With debugging symbols:

#0  0x008b8d71 in memmove () from /usr/lib/libc.so.39.3
#1  0x0ecaaf7e in snmp_set_var_objid (vp=0x2c, objid=0x7d8c7018, 
name_length=11) at snmp_client.c:652
#2  0x0ecd1bac in snmp_varlist_add_variable (varlist=0x87618844, 
name=0x7d8c7018, name_length=11, type=5 '\005', value=0x0,

len=0) at snmp_api.c:6259
#3  0x0ecd1b3c in snmp_pdu_add_variable (pdu=0x87618800, name=0x7d8c7018, 
name_length=11, type=5 '\005', value=0x0, len=0)

at snmp_api.c:6232
#4  0x0ecaa87b in snmp_add_null_var (pdu=0x87618800, name=0x7d8c7018, 
name_length=11) at snmp_client.c:157
#5  0x01bcdb35 in mte_run_trigger (clientreg=1) at 
disman/mteTriggerTable.c:3309

#6  0x0ecea9c3 in run_alarms () at snmp_alarm.c:248
#7  0x1c003da5 in SnmpdCatchRandomSignal ()
#8  0x1c003204 in SnmpdCatchRandomSignal ()
#9  0x1c001ea0 in ?? ()
#10 0x0005 in ?? ()
#11 0xcf7f7df4 in ?? ()
#12 0xcf7f7e0c in ?? ()
#13 0x1c001e31 in ?? ()
#14 0xcf7f7f6c in ?? ()
#15 0xcf7f7dd0 in ?? ()
#16 0x1c001885 in ?? ()
#17 0x1c001e8e in ?? ()
#18 0x in ?? ()


Running it in -f -D -L now ~BAS

On Mon, 11 Dec 2006, Brian A. Seklecki wrote:



I've got a pretty consistent segfault:

[EMAIL PROTECTED]:/home/seklecki# gdb /usr/local/sbin/snmpd 
/home/seklecki/snmpd.core

Core was generated by `snmpd'.
Program terminated with signal 11, Segmentation fault.
Reading symbols from /usr/local/lib/libnetsnmpagent.so.6.3...(no debugging 
symbols found)...done.

Loaded symbols for /usr/local/lib/libnetsnmpagent.so.6.3
Reading symbols from /usr/local/lib/libnetsnmpmibs.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmpmibs.so.6.3
Reading symbols from /usr/local/lib/libnetsnmphelpers.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmphelpers.so.6.3
Reading symbols from /usr/lib/libwrap.so.4.0...done.
Loaded symbols for /usr/lib/libwrap.so.4.0
Reading symbols from /usr/local/lib/libnetsnmp.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmp.so.6.3
Reading symbols from /usr/lib/libkvm.so.8.0...done.
Loaded symbols for /usr/lib/libkvm.so.8.0
Reading symbols from /usr/lib/libz.so.4.1...done.
Loaded symbols for /usr/lib/libz.so.4.1
Reading symbols from /usr/lib/libcrypto.so.13.0...done.
Loaded symbols for /usr/lib/libcrypto.so.13.0
Reading symbols from /usr/lib/libm.so.2.3...done.
Loaded symbols for /usr/lib/libm.so.2.3
Reading symbols from /usr/lib/libc.so.39.3...done.
Loaded symbols for /usr/lib/libc.so.39.3
Reading symbols from /usr/libexec/ld.so...done.
Loaded symbols for /usr/libexec/ld.so

#0  0x089b3d71 in memmove () from /usr/lib/libc.so.39.3
(gdb) bt
#0  0x089b3d71 in memmove () from /usr/lib/libc.so.39.3
#1  0x0cf6bede in snmp_set_var_objid () from /usr/local/lib/libnetsnmp.so.6.3
#2  0x0cf92b0c in snmp_varlist_add_variable () from 
/usr/local/lib/libnetsnmp.so.6.3
#3  0x0cf92a9c in snmp_pdu_add_variable () from 
/usr/local/lib/libnetsnmp.so.6.3

#4  0x0cf6b7db in snmp_add_null_var () from /usr/local/lib/libnetsnmp.so.6.3
#5  0x08757bf5 in mte_run_trigger () from 
/usr/local/lib/libnetsnmpmibs.so.6.3

#6  0x0cfab923 in run_alarms () from /usr/local/lib/libnetsnmp.so.6.3
#7  0x1c003da5 in SnmpdCatchRandomSignal ()
#8  0x1c003204 in SnmpdCatchRandomSignal ()
#9  0x1c001ea0 in ?? ()
#10 0x0005 in ?? ()
#11 0xcf7df148 in ?? ()
#12 0xcf7df160 in ?? ()
#13 0x1c001e31 in ?? ()
#14 0xcf7df2bc in ?? ()
#15 0xcf7df124 in ?? ()
#16 0x1c001885 in ?? ()
#17 0x1c001e8e in ?? ()
#18 0x in ?? ()
(gdb)

This system 4.0-stable as of last Wednesday with a slightly modified kernel 
(RAIDFrame enabled):


[EMAIL PROTECTED]:/home/seklecki# uname -a
OpenBSD br0 4.0 GENERIC+RAIDFrame#2 i386 (full dmesg below)

I will try to recompile Net-SNMP from source to see if the mismatch between 
GENERIC and the "recommended" 4.0 binaries is the cause.  I will also try to 
figure out why it's only partially being built with debugging symbols.


~BAS

l8*
-lava (Brian A. Seklecki - Pittsburgh, PA, USA)
   http://www.spiritual-machines.org/

"...from back in the heady days when "helpdesk" meant nothing, "diskquota"
meant everything, and lives could be bought and sold for a couple of pages
of laser printout - and frequently were."

OpenBSD 4.0-stable (GENERIC+RAIDFrame) #2: Wed Dec  6 22:26:09 EST 2006
   [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC+RAIDFrame
cpu0: Intel(R) Xeon(TM) CPU 3.00GHz ("GenuineIntel" 686-class) 3 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,

SSE3,MWAIT,DS-CPL,CNXT-ID,CX16
real mem  = 534921216 (522384K)
avail mem = 479571968 (468332K)
using 4256 buffers containing 26849280 bytes (26220K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(00) BIOS, date 08/18/05, BIOS32 rev. 0 @ 0xffe90, 
SMBIOS rev. 2.3 @ 0xf0450 (92 entries)

bios0: Dell Inc. PowerEdge SC1420
apm0 at bios0: Power Management spec V1.2
apm0: AC on, battery charge unknown
apm0: flags 30102 dobusy 0 doidle 1
pcibios0 at bio

Sig 11 Segfault in net/net-snmp net-snmp-5.1.3p4 in 4.0/i386

2006-12-11 Thread Brian A. Seklecki


I've got a pretty consistent segfault:

[EMAIL PROTECTED]:/home/seklecki# gdb /usr/local/sbin/snmpd 
/home/seklecki/snmpd.core


Core was generated by `snmpd'.
Program terminated with signal 11, Segmentation fault.
Reading symbols from /usr/local/lib/libnetsnmpagent.so.6.3...(no debugging 
symbols found)...done.

Loaded symbols for /usr/local/lib/libnetsnmpagent.so.6.3
Reading symbols from /usr/local/lib/libnetsnmpmibs.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmpmibs.so.6.3
Reading symbols from /usr/local/lib/libnetsnmphelpers.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmphelpers.so.6.3
Reading symbols from /usr/lib/libwrap.so.4.0...done.
Loaded symbols for /usr/lib/libwrap.so.4.0
Reading symbols from /usr/local/lib/libnetsnmp.so.6.3...done.
Loaded symbols for /usr/local/lib/libnetsnmp.so.6.3
Reading symbols from /usr/lib/libkvm.so.8.0...done.
Loaded symbols for /usr/lib/libkvm.so.8.0
Reading symbols from /usr/lib/libz.so.4.1...done.
Loaded symbols for /usr/lib/libz.so.4.1
Reading symbols from /usr/lib/libcrypto.so.13.0...done.
Loaded symbols for /usr/lib/libcrypto.so.13.0
Reading symbols from /usr/lib/libm.so.2.3...done.
Loaded symbols for /usr/lib/libm.so.2.3
Reading symbols from /usr/lib/libc.so.39.3...done.
Loaded symbols for /usr/lib/libc.so.39.3
Reading symbols from /usr/libexec/ld.so...done.
Loaded symbols for /usr/libexec/ld.so

#0  0x089b3d71 in memmove () from /usr/lib/libc.so.39.3
(gdb) bt
#0  0x089b3d71 in memmove () from /usr/lib/libc.so.39.3
#1  0x0cf6bede in snmp_set_var_objid () from 
/usr/local/lib/libnetsnmp.so.6.3
#2  0x0cf92b0c in snmp_varlist_add_variable () from 
/usr/local/lib/libnetsnmp.so.6.3
#3  0x0cf92a9c in snmp_pdu_add_variable () from 
/usr/local/lib/libnetsnmp.so.6.3
#4  0x0cf6b7db in snmp_add_null_var () from 
/usr/local/lib/libnetsnmp.so.6.3
#5  0x08757bf5 in mte_run_trigger () from 
/usr/local/lib/libnetsnmpmibs.so.6.3

#6  0x0cfab923 in run_alarms () from /usr/local/lib/libnetsnmp.so.6.3
#7  0x1c003da5 in SnmpdCatchRandomSignal ()
#8  0x1c003204 in SnmpdCatchRandomSignal ()
#9  0x1c001ea0 in ?? ()
#10 0x0005 in ?? ()
#11 0xcf7df148 in ?? ()
#12 0xcf7df160 in ?? ()
#13 0x1c001e31 in ?? ()
#14 0xcf7df2bc in ?? ()
#15 0xcf7df124 in ?? ()
#16 0x1c001885 in ?? ()
#17 0x1c001e8e in ?? ()
#18 0x in ?? ()
(gdb)

This system 4.0-stable as of last Wednesday with a slightly modified 
kernel (RAIDFrame enabled):


[EMAIL PROTECTED]:/home/seklecki# uname -a
OpenBSD br0 4.0 GENERIC+RAIDFrame#2 i386 (full dmesg below)

I will try to recompile Net-SNMP from source to see if the mismatch 
between GENERIC and the "recommended" 4.0 binaries is the cause.  I will 
also try to figure out why it's only partially being built with debugging 
symbols.


~BAS

l8*
-lava (Brian A. Seklecki - Pittsburgh, PA, USA)
   http://www.spiritual-machines.org/

"...from back in the heady days when "helpdesk" meant nothing, "diskquota"
meant everything, and lives could be bought and sold for a couple of pages
of laser printout - and frequently were."

OpenBSD 4.0-stable (GENERIC+RAIDFrame) #2: Wed Dec  6 22:26:09 EST 2006
[EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC+RAIDFrame
cpu0: Intel(R) Xeon(TM) CPU 3.00GHz ("GenuineIntel" 686-class) 3 GHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,

SSE3,MWAIT,DS-CPL,CNXT-ID,CX16
real mem  = 534921216 (522384K)
avail mem = 479571968 (468332K)
using 4256 buffers containing 26849280 bytes (26220K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(00) BIOS, date 08/18/05, BIOS32 rev. 0 @ 
0xffe90, SMBIOS rev. 2.3 @ 0xf0450 (92 entries)

bios0: Dell Inc. PowerEdge SC1420
apm0 at bios0: Power Management spec V1.2
apm0: AC on, battery charge unknown
apm0: flags 30102 dobusy 0 doidle 1
pcibios0 at bios0: rev 2.1 @ 0xf/0x1
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfeb00/256 (14 entries)
pcibios0: PCI Interrupt Router at 000:31:0 ("Intel 82801EB/ER LPC" rev 
0x00)

pcibios0: PCI bus #6 is the last bus
bios0: ROM list: 0xc/0x8000 0xc8000/0x800 0xc8800/0x1800! 
0xca000/0x2000

cpu0 at mainbus0
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "Intel E7520 MCH" rev 0x09
"Intel E7520 MCH ERR" rev 0x09 at pci0 dev 0 function 1 not configured
ppb0 at pci0 dev 2 function 0 "Intel MCH PCIE" rev 0x09
pci1 at ppb0 bus 1
ppb1 at pci1 dev 0 function 0 "Intel PCIE-PCIE" rev 0x00
pci2 at ppb1 bus 2
vga1 at pci2 dev 12 function 0 "ATI Mach64 GO" rev 0x27
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
ppb2 at pci1 dev 0 function 2 "Intel PCIE-PCIE" rev 0x00
pci3 at ppb2 bus 3
xl0 at pci3 dev 13 function 0 "3Com 3c905C 100Base-TX" rev 0x74: irq 10, 
address 00:50:da:28:37:7f

bmtphy0 at xl0 phy 24: Broadcom 3C905C internal PHY, rev. 6
em0 at pci3 dev 14 function 0 "Intel PRO/1000MT (82545GM)" rev 0x04: irq 
11, address 00: