Hi Maciek,
Couple of question.
1. Any idea why the bad plugins are bad - is there anything common in
their design that you suspect might be a common thread in the
performance regressions.
2. When I compare the CSIT 1710 to the 1801 test report from DPDK, it
suggests improved performance of x520? , < 24 mpps compared to > 26 mpps
with VPP 1801. Can you point us at the specific data demonstrating the
performance regression in VPP 1801.
https://docs.fd.io/csit/rls1710/report/dpdk_performance_tests/packet_throughput_graphs/l2.html#ndr-throughput
https://docs.fd.io/csit/rls1801/report/dpdk_performance_tests/packet_throughput_graphs/l2.html#ndr-throughput
Thanks,
Ray K
On 19/03/2018 23:14, Maciek Konstantynowicz (mkonstan) wrote:
Update on the four rls1801 performance regressions reported in CSIT rls1801
report[1]:
1. CSIT-925 Plugin induced NDR/PDR regression
a. All plugins loaded (VPP default) shows throughput variability of 3..5%
for all NIC2NIC tests;
b. Initial regression tests isolated a group of 6 "bad" plugins causing
variability:
- "bad" plugins: gtpu, ila, ixge, kubeproxy, l2e, sixrd;
- "good" plugins: dpdk, acl, flowprobe, ioam, lb, memif, nat, pppoe and
stn;
c. Further tests isolated it further to:
- gtpu plugin: worst case >25% performance drop on some testbed, less on
others;
- other "bad" plugins: affected by dpdk1711 + x520 NIC regression, see
CSIT-926;
d. Conclusions and next steps:
- Due to a number of issues with "bad" plugins CSIT is now loading VPP
plugins on the need basis;
- startup.conf used to load needed plugins per test;
- gtpu: requires deeper perf analysis e.g. "perf top";
2. CSIT-926 Generic NDR/PDR regression
a. Generic regression of up to -3% vs. rls1710;
b. Re-analysis of VPP daily performance trending jobs isolated the trouble
period to between two patches;
c. Further bisecting confirm problem is related to introduction of dpdk1711
for x520 NICs;
d. Conclusions and next steps:
- dpdk1711 introduced up to 3% generic performance regression to VPP for
x520 NICs
- affects some packet paths more than others;
- also affects some plugin loads, see "bad" plugins in 1b.
- x710/i40e NICs seem not be affected;
3. CSIT-927 vhost-user vring size of 1024 - NDR regression
a. Much lower NDR for vhostvr1024 tests, up to -42% vs. rls1710;
b. Root caused to bug in CSIT code, side effect of introducing Aarch64 Qemu;
c. CSIT bug fixed, https://gerrit.fd.io/r/#/c/10746/
4. CSIT-928 IPSec scale with HW QAT crypto-dev - NDR/PDR regression
a. NDR/PDR regression up to -15% vs. rls1710;
b. Root caused to bug in VPP code, QAT crypto-dev driver;
c. VPP patch by Radu Nicolau, under review/test verification.
- https://gerrit.fd.io/r/#/c/11082/
Comments?
Cheers,
-Maciek
(CSIT PTL)
[1]
https://docs.fd.io/csit/rls1801/report/vpp_performance_tests/csit_release_notes.html#known-issues
-=-=-=-=-=-=-=-=-=-=-=-
Links:
You receive all messages sent to this group.
View/Reply Online (#8607): https://lists.fd.io/g/vpp-dev/message/8607
View All Messages In Topic (2): https://lists.fd.io/g/vpp-dev/topic/15624779
Mute This Topic: https://lists.fd.io/mt/15624779/21656
New Topic: https://lists.fd.io/g/vpp-dev/post
Change Your Subscription: https://lists.fd.io/g/vpp-dev/editsub/21656
Group Home: https://lists.fd.io/g/vpp-dev
Contact Group Owner: [email protected]
Terms of Service: https://lists.fd.io/static/tos
Unsubscribe: https://lists.fd.io/g/vpp-dev/unsub
-=-=-=-=-=-=-=-=-=-=-=-