[SR-Users] Re: [EXTERNAL] Issue: SIP retransmissions and UDP packet drops under high load (Kamailio + RTPEngine co-located)

Richard Fuchs via sr-users Fri, 15 May 2026 07:24:38 -0700

On 15/05/2026 08.47, xf han via sr-users wrote:

1.  Since `tcpdump` sees the packets but Kamailio doesn't, is this a classic 
case of OS-level UDP receive buffer (`rmem`) overflow

That is very likely the case, but you can confirm this by inspecting theRX queue length of the affected sockets during peak times, e.g. with`ss` or `netstat`. I believe there are also system-wide counters forpacket drops due to RX queue overflows.

Note that these conditions can be very short lived and might only beapparent when you look at the queue length at the exact right moment.

possibly exacerbated by CPU context switching between Kamailio and RTPEngine?

Context switches are just another kind of CPU load, so would show up inthe regular CPU stats.

2.  With `children=40` for Kamailio and RTPEngine also running on the same 
56-core machine, is this configuration leading to significant CPU contention or 
an imbalanced distribution of workload, especially at 3000 CC? Should 
Kamailio's `children` count be adjusted (e.g., increased closer to 56, or 
perhaps fewer to leave more for RTPEngine and OS network processing)?

CPU load distribution is certainly one aspect, but another aspect tokeep in mind is that each worker process/thread can only do one thing ata time. That one thing might be crunching numbers on the CPU, but it canalso be just waiting for something. If every worker ends up waiting forsomething, then the CPU would be idle, but no new request can be processed.

18 ms processing time per request means every worker can process 55requests per second, and that's assuming this is evenly spread outwithout any peaks. How does that compare to the CPS you're seeing(keeping in mind that not every request is a new call)?

3.  Does this scenario typically imply that the host CPU is 
exhausted/interrupt-bound, or is there a specific Kamailio/OS tuning I am 
missing?

CPU exhaustion would be quite obvious in the system stats. IRQ issuescan probably be discounted if the packets show up in tcpdump.

There are lots of other things that can block a process. I/O is a usualsuspect. Swapping due to memory pressure can be deadly, and logging isanother likely culprit. Depending on how logging is done, if the loggingsystem can't keep up, then processes might end up having to wait towrite their logs, blocking everything else in the meantime.


Communications to an external service (DB...) are another possible cause.

Cheers

__________________________________________________________
Kamailio - Users Mailing List - Non Commercial Discussions -- 
[email protected]
To unsubscribe send an email to [email protected]
Important: keep the mailing list in the recipients, do not reply only to the 
sender!

[SR-Users] Re: [EXTERNAL] Issue: SIP retransmissions and UDP packet drops under high load (Kamailio + RTPEngine co-located)

Reply via email to