Re: Changes to performance testing?

John Ericson Tue, 23 Feb 2021 11:22:31 -0800

If we make the changes proposed in the "On CI" thread, I think we solvemost of this for free. If a perf test fails, resubmitting should notneed to rebuild GHC because it is cached. I'd say at that point, it'snot even worth making Marge bot auto-accept extra improvements becauserestarting with new perf windows should be so easy.


John


On 2/22/21 8:46 AM, Andreas Klebinger wrote:

This seems quite reasonable to me.
Not sure about the cost of implementing it (and the feasability of itif/when merge-trains arrive).
Andreas

Am 21/02/2021 um 21:31 schrieb Richard Eisenberg:
On Feb 21, 2021, at 11:24 AM, Ben Gamari <b...@well-typed.com<mailto:b...@well-typed.com>> wrote:
To mitigate this I would suggest that we allow performance test failures
in marge-bot pipelines. A slightly weaker variant of this idea would
instead only allow performance *improvements*. I suspect the latter
would get most of the benefit, while eliminating the possibility that a
large regression goes unnoticed.
The value in making performance improvements a test failure is sothat patch authors can be informed of what they have done, to makesure it matches expectations. This need can reasonably be satisfiedwithout stopping merging. That is, if Marge can accept performanceimprovements, while (say) posting to each MR involved that it mayhave contributed to a performance improvement, then I think we'vedone our job here.
On the other hand, a performance degradation is a bug, just like,say, an error message regression. Even if it's a combination ofcommits that cause the problem (an actual possibility even for errormessage regressions), it's still a bug that we need to either fix oraccept (balanced out by other improvements). The pain of debuggingthis scenario might be mitigated if there were a collation of theperformance wibbles for each individual commit. This information is,in general, available: each commit passed CI on its own, and so itshould be possible to create a little report with its rows being perftests and its columns being commits or MR #s; each cell in the tablewould be a percentage regression. If we're lucky, the regressionMarge sees will be the sum(*) of the entries in one of the rows --this means that we have a simple agglomeration of performancedegradation. If we're less lucky, the whole will not equal the sum ofthe parts, and some of the patches interfere. In either case, thetable would suggest a likely place to look next.
(*) I suppose if we're recording percentages, it wouldn't necessarilybe the actual sum, because percentages are a bit funny. But you getmy meaning.
Pulling this all together:
* I'm against the initial proposal of allowing all performancefailures by Marge. This will allow bugs to accumulate (in my opinion).* I'm in favor of allowing performance improvements to be accepted byMarge.* To mitigate against the information loss of Marge acceptingperformance improvements, it would be great if Marge could alert MRauthors that a cumulative performance improvement took place.* To mitigate against the annoyance of finding a performanceregression in a merge commit that does not appear in any componentcommit, it would be great if there were a tool to collect performancenumbers from a set of commits and present them in a table for furtheranalysis.
These "mitigations" might take work. If labor is impossible toproduce to complete this work, I'm in favor of simply allowing theperformance improvements, maybe also filing a ticket about thesepotential improvements to the process.
Richard

_______________________________________________
ghc-devs mailing list
ghc-devs@haskell.org
http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs
_______________________________________________
ghc-devs mailing list
ghc-devs@haskell.org
http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs

_______________________________________________
ghc-devs mailing list
ghc-devs@haskell.org
http://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-devs

Re: Changes to performance testing?

Reply via email to