Re: Proposal for a regular upstream performance testing

Jason Wang Thu, 26 Nov 2020 00:24:49 -0800


On 2020/11/26 下午4:10, Lukáš Doktor wrote:

Hello guys,
I had been around qemu on the Avocado-vt side for quite some time anda while ago I shifted my focus on performance testing. Currently I amnot aware of any upstream CI that would continuously monitor theupstream qemu performance and I'd like to change that. There is a lotto cover so please bear with me.
Goal
====
The goal of this initiative is to detect system-wide performanceregressions as well as improvements early, ideally pin-point theindividual commits and notify people that they should fix things. Allin upstream and ideally with least human interaction possible.
Unlike the recent work of Ahmed Karaman'shttps://ahmedkrmn.github.io/TCG-Continuous-Benchmarking/ my aim is onthe system-wide performance inside the guest (like fio, uperf, ...)
Tools
=====
In house we have several different tools used by various teams and Ibet there are tons of other tools out there that can do that. I cannot speak for all teams but over the time many teams at Red Hat havecome to like pbenchhttps://distributed-system-analysis.github.io/pbench/ to run the testsand produce machine readable results and use other tools (Ansible,scripts, ...) to provision the systems and to generate the comparisons.
As for myself I used python for PoC and over the last year I pushedhard to turn it into a usable and sensible tool which I'd like tooffer: https://run-perf.readthedocs.io/en/latest/ anyway I am open tosuggestions and comparisons. As I am using it downstream to watchregressions I do plan on keep developing the tool as well as thepipelines (unless a better tool is found that would replace it or it'sparts).

FYI, Intel has invented a lot on the 0-day Linux kernel automatedperformance regression test: https://01.org/lkp. It's being activelydeveloped upstream.


It's powerful and tons of regressions were reported (and bisected).

I think it can use qemu somehow but I'm not sure. Maybe we can have a try.

Thanks

How
===
This is a tough question. Ideally this should be a standalone servicethat would only notify the author of the patch that caused the changewith a bunch of useful data so they can either address the issue orjust be aware of this change and mark it as expected.
Ideally the community should have a way to also issue their custombuilds in order to verify their patches so they can debug and addressissues better than just commit to qemu-master.
The problem with those is that we can not simply use travis/gitlab/...machines for running those tests, because we are measuring in-guestactual performance. We can't just stop the time when the machinedecides to schedule another container/vm. I briefly checked the publicbare-metal offerings like rackspace but these are most probably notsufficient either because (unless I'm wrong) they only give you amachine but it is not guaranteed that it will be the same machine thenext time. If we are to compare the results we don't need just thesame model, we really need the very same machine. Any change to themachine might lead to a significant difference (disk replacement, evenfirmware update...).
Solution 1
----------
Doing this for downstream builds I can start doing this for upstreamas well. At this point I can offer a single pipeline watching onlychanges in qemu (downstream we are checking distro/kernel changes aswell but that would require too much time at this point) on a singlex86_64 machine. I can not offer a public access to the testingmachine, not even checking custom builds (unless someone provides me apublicly available machine(s) that I would use for this). What I canoffer is running the checks on the latest qemu master, publishing thereports, bisecting issues and notifying people about the changes. Anexample of a report can be found here:https://drive.google.com/file/d/1V2w7QpSuybNusUaGxnyT5zTUvtZDOfsb/view?usp=sharinga documentation of the format is here:https://run-perf.readthedocs.io/en/latest/scripts.html#html-results Ican also attach the raw pbench results if needed (as well as detailsabout the tests that were executed and the params and other details).
Currently the covered scenarios would be a default libvirt machinewith qcow2 storage and tuned libvirt machine (cpus, hugepages, numa,raw disk...) running fio, uperf and linpack on the latest GA RHEL. Inthe future I can add/tweak the scenarios as well as tests selectionbased on your feedback.
Solution 2
----------
I can offer a documentation:https://run-perf.readthedocs.io/en/latest/jenkins.html and someone canfork/inspire by it and setup the pipelines on their system, making itavailable to the outside world, add your custom scenarios andvariants. Note the setup does not require Jenkins, it's just anexample and could be easily turned into a cronjob or whatever you chose.
Solution 3
----------
You name it. I bet there are many other ways to perform system-wideperformance testing.
Regards,
Lukáš

Re: Proposal for a regular upstream performance testing

Reply via email to