On Fri, Sep 23, 2022 at 02:38:59PM +0000, mukund via Bird-users wrote:
>  
>  Hi,
> We are using bird version 2.0.9.  We have IXIA traffic which is sending 16K 
> OSPF  routes to our App which we send to Bird. Issue is seen when we restart 
> our App. As part of our restart app handling, we restart Bird as well.We see 
> a crash  with following BT
> VT33-VT33_PRI:~/backup/cores/coredump_2022-09-20.20.31.33# gdb bird 
> core.bird.30975GNU gdb (GDB) Red Hat Enterprise Linux 
> 7.6.1-120.0.1.el7Copyright (C) 2013 Free Software Foundation, Inc.License 
> GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>This is 
> free software: you are free to change and redistribute it.There is NO 
> WARRANTY, to the extent permitted by law.  Type "show copying"and "show 
> warranty" for details.This GDB was configured as 
> "x86_64-redhat-linux-gnu".For bug reporting instructions, please 
> see:<http://www.gnu.org/software/gdb/bugs/>...Reading symbols from 
> /home/talariuser/backup/cores/coredump_2022-09-20.20.31.33/bird...done.[New 
> LWP 30975][Thread debugging using libthread_db enabled]Using host 
> libthread_db library "/lib64/libthread_db.so.1".Core was generated by 
> `/home/talariuser/bird/sbin/bird -f'.Program terminated with signal 8, 
> Arithmetic exception.#0  0x000000000047262e in sl_free (s=0x783170, 
> oo=0x7ffff6e1fc60) at /tn-build/src/thir!
 d_party/bird/lib/slab.c:315315     /tn-build/src/third_party/bird/lib/slab.c: 
No such file or directory.Missing separate debuginfos, use: debuginfo-install 
glibc-2.17-325.0.1.el7_9.x86_64 openssl-libs-1.0.2k-22.el7_9.x86_64 
zlib-1.2.7-19.el7_9.x86_64(gdb) bt#0  0x000000000047262e in sl_free 
(s=0x783170, oo=0x7ffff6e1fc60) at 
/tn-build/src/third_party/bird/lib/slab.c:315#1  0x0000000000462e33 in 
fib_delete (f=0x75f508, E=0x7ffff6e1fc60) at 
/tn-build/src/third_party/bird/nest/rt-fib.c:479#2  0x0000000000420915 in 
rt_sync (p=0x75f350) at /tn-build/src/third_party/bird/proto/ospf/rt.c:2104#3  
0x00000000004218da in ospf_rt_spf.part.31 (p=0x75f350) at 
/tn-build/src/third_party/bird/proto/ospf/rt.c:1721#4  0x000000000041f403 in 
ospf_rt_spf (p=0x75f350) at 
/tn-build/src/third_party/bird/proto/ospf/rt.c:1692#5  0x000000000042fb8f in 
ospf_disp (timer=0x782f10) at 
/tn-build/src/third_party/bird/proto/ospf/ospf.c:482#6  0x00000000004736cf in 
timers_fire (loop=0x74e0e0 <main_timeloop>) !
 at /tn-build/src/third_party/bird/lib/timer.c:235#7  0x0000000000408d0a in 
io_loop () at /tn-build/src/third_party/bird/sysdep/unix/io.c:2260#8  
0x0000000000405af6 in main (argc=2, argv=0x7fffffffeb68) at 
/tn-build/src/third_party/bird/sysdep/unix/main.c:952
> Since all routes are newly passed and OSPF SPF calculation is done from 
> scratch, not sure why delete / free has an issue.
> Also issue is not seen when there are less routes say ~200. Seems to be 
> specific to huge number.If any pointers to this will be helpful
> Thanks in advance .

Hi

Could you send us core dump and bird binary? Also could you try 2.0.10?

-- 
Elen sila lumenn' omentielvo

Ondrej 'Santiago' Zajicek (email: [email protected])
OpenPGP encrypted e-mails preferred (KeyID 0x11DEADC3, wwwkeys.pgp.net)
"To err is human -- to blame it on a computer is even more so."

Reply via email to