Opensips 2.1.2 occasionally crashes when connection to database fails when
using db_virtual module. In this case, there is only one backend.
It's a rare problem - usually when the query fails, it doesn't cause a crash.
But the crash has happened multiple times, on different machines connected to
different databases.
This query is generated by a timer route that runs every minute. I'm waiting
for it to happen again so I can get a core dump file.
The configuration for db_virtual is as follows:
modparam("db_virtual", "db_urls", "define set1 FAILOVER")
modparam("db_virtual", "db_urls", "postgres://xxx:[email protected]:5433/xxx")
modparam("db_virtual", "db_max_consec_retrys", 5)
modparam("db_virtual", "db_probe_time", 10)
And the log:
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
DBG:db_postgres:db_postgres_submit_query: 0x7f054ab9a770 PQsendQuery failed:
could not rec
eive data from server: Connection timed out#012 Query: select
ip,grp,mask,port,proto,pattern,context_info,id from opensips_permissions
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
DBG:db_postgres:free_query: PQclear(0xce33f0) result set
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
ERROR:db_postgres:db_postgres_submit_query: 0x7f054ab9a770 PQsendQuery Error:
could not re
ceive data from server: Connection timed out#012 Query: select
ip,grp,mask,port,proto,pattern,context_info,id from opensips_permissions
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]: ERROR:core:db_do_query:
error while submitting query - [select
ip,grp,mask,port,proto,pattern,context_info,id from opensips_permissions ]
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
DBG:db_virtual:db_virtual_query: failover call failed
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]: DBG:core:pool_remove:
removing connection from the pool
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
DBG:db_postgres:db_postgres_free_connection: PQfinish(0xcdaf50)
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
DBG:db_postgres:db_postgres_free_connection: pkg_free(0x7f054ab9a898)
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
DBG:db_virtual:db_virtual_query: curent_con = 0
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
DBG:db_virtual:db_virtual_query: flags2 = 2
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10815]:
DBG:db_virtual:db_virtual_query: curent_con = 0
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10817]:
NOTICE:core:io_wait_loop_epoll: EPOLLIN(read) event: epollwait() set event
EPOLLHUP - connection closed by the remote peer!
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10817]:
DBG:core:handle_tcp_worker: dead tcp worker 6 (pid 10815, no 0) (shutting down?)
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10817]: DBG:core:io_watch_del:
[TCP_main] io_watch_del op on index 25 32 (0x84b1e0, 32, 25, 0x0,0x1) fd_no=27
called
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10817]:
NOTICE:core:io_wait_loop_epoll: EPOLLIN(read) event: epollwait() set event
EPOLLHUP - connection closed by the remote peer!
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10817]: CRITICAL:core:receive_fd:
EOF on 35
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10817]: DBG:core:handle_worker:
dead child 17, pid 10815 (shutting down?)
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10817]: DBG:core:io_watch_del:
[TCP_main] io_watch_del op on index 17 35 (0x84b1e0, 35, 17, 0x0,0x1) fd_no=26
called
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:core:handle_sigs:
status = 139
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: INFO:core:handle_sigs:
child process 10815 exited by a signal 11
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: INFO:core:handle_sigs:
core was generated
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: INFO:core:handle_sigs:
terminating due to SIGCHLD
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10817]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10816]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10814]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10813]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10812]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10811]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10810]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10809]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10808]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10807]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10806]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10802]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10801]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10803]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10804]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10800]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10799]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10805]: INFO:core:sig_usr: signal
15 received
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: INFO:core:cleanup: cleanup
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: INFO:pike:pike_exit:
destroying...
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:tm:tm_shutdown:
tm_shutdown : start
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]:
DBG:tm:unlink_timer_lists: emptying DELETE list for set 0
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:tm:tm_shutdown:
emptying hash table
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:tm:tm_shutdown:
releasing timers
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:tm:tm_shutdown:
removing semaphores
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:tm:tm_shutdown:
destroying callback lists
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:tm:tm_shutdown:
tm_shutdown : done
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]:
NOTICE:db_virtual:destroy: destroying module...
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:core:shm_mem_destroy:
destroying the shared memory lock
Oct 24 18:32:10 rst-lab-gw /usr/sbin/opensips[10798]: DBG:core:handle_sigs:
terminating due to SIGCHLD
Thanks,
Jeremiah
_______________________________________________
Users mailing list
[email protected]
http://lists.opensips.org/cgi-bin/mailman/listinfo/users