[
https://issues.apache.org/jira/browse/FLINK-29825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17685641#comment-17685641
]
Dong Lin commented on FLINK-29825:
----------------------------------
Thanks [~Yanfei Lei] for implementing and evaluating the algorithm!
[~pnowojski] Cool, I think we have agreed to make incremental improvements and
used the algorithm proposed in the above doc to detect regression for Flink
benchmarks.
We probably still have different understandings regarding the pros/cons of
these alternative choices. It will be great if you or someone else can help
implement an alternative choice and show that it can do better than the one we
are going to use. I probably won't have time to try the Hunter algorithm myself
in the near future.
> Improve benchmark stability
> ---------------------------
>
> Key: FLINK-29825
> URL: https://issues.apache.org/jira/browse/FLINK-29825
> Project: Flink
> Issue Type: Improvement
> Components: Benchmarks
> Affects Versions: 1.17.0
> Reporter: Yanfei Lei
> Assignee: Yanfei Lei
> Priority: Minor
>
> Currently, regressions are detected by a simple script which may have false
> positives and false negatives, especially for benchmarks with small absolute
> values, small value changes would cause large percentage changes. see
> [here|https://github.com/apache/flink-benchmarks/blob/master/regression_report.py#L132-L136]
> for details.
> And all benchmarks are executed on one physical machine, it might happen that
> hardware issues affect performance, like "[FLINK-18614] Performance
> regression 2020.07.13".
>
> This ticket aims to improve the precision and recall of the regression-check
> script.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)