Hi Mohammad,

thanks for the data. Looking at it I do not see anything wrong, apart from 
being ~50% slower than what we typically get with this kind of HW.
Just to rule out any other issue, could you try to run your test without your 
plugin and just doing simple IPv4 forwarding? That would allow to better 
compare with CSIT tests.

best
ben
________________________________________
From: [email protected] <[email protected]> on behalf of mohammad mahdi 
nemati haravani via lists.fd.io <[email protected]>
Sent: Wednesday, December 17, 2025 15:19
To: [email protected]
Subject: Re: [vpp-dev] Huge number of "dTLB-load-misses" when assigning more 
than 1 rx descriptor in VPP

Hi,
these are the outputs

great
mohammad

On Wed, Dec 17, 2025 at 4:51 PM Benoit Ganne (bganne) via 
lists.fd.io<http://lists.fd.io> 
<[email protected]<mailto:[email protected]>> wrote:
Hello,

can you share the output of the following commands while traffic is running:
 - vppctl cle run && vppctl sh run
 - vppctl cle err && vppctl sh err
 - vppctl cle hard && vppctl sh hard

And also the output of:
 - vppctl sh log
 - vppctl sh pci
 - vppctl sh thr

Best
ben

________________________________________
From: [email protected]<mailto:[email protected]> 
<[email protected]<mailto:[email protected]>> on behalf of mohammad mahdi 
nemati haravani via lists.fd.io<http://lists.fd.io> 
<[email protected]<mailto:[email protected]>>
Sent: Wednesday, December 17, 2025 11:20
To: [email protected]<mailto:[email protected]>
Subject: [vpp-dev] Huge number of "dTLB-load-misses" when assigning more than 1 
rx descriptor in VPP

Hi,
I was testing VPP on a system with CPU Intel® Xeon® Gold 6248R Processor
and I have ICE 810-C one hundred gig
when i try to assign more queue to worker threads i see DTLB miss rise in vtune 
and perf
i even set system to drop all packet but it seems there is a problem with VPP 
code or configuration
for reference i use dpdk testpmd and base on test pmd result i have this

VPP
1 queue 8160 rx descriptor -> rx miss & 100% DTLB miss
1 queue 4096 rx descriptor -> no miss & no DTLB miss
2 queue 4096 rx descriptor -> no miss & no DTLB miss (but vpp loop per second 
drop drastically)
4 queue 4096 rx descriptor -> rx miss & 100% DTLB miss

DPDK testpmd
1 queue 8160 rx descriptor ->  no miss & no DTLB miss
1 queue 4096 rx descriptor -> no miss & no DTLB miss
2 queue 4096 rx descriptor ->  no miss & no DTLB miss
4 queue 4096 rx descriptor ->  no miss & no DTLB miss
4 queue 8160 rx descriptor ->  no miss & no DTLB miss
8 queue 8160 rx descriptor ->  no miss & no DTLB miss

for both test i use one thread
i want to know what is the problem that causes this since DPDK testpmd is ok 
but vpp is not
also i use 1G huge page for both test



-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#26663): https://lists.fd.io/g/vpp-dev/message/26663
Mute This Topic: https://lists.fd.io/mt/116824309/21656
Group Owner: [email protected]
Unsubscribe: https://lists.fd.io/g/vpp-dev/leave/14379924/21656/631435203/xyzzy 
[[email protected]]
-=-=-=-=-=-=-=-=-=-=-=-

Reply via email to