Dan Smith created GEODE-6203:
--------------------------------
Summary: Include uncertainity values to anaylzeRun results
Key: GEODE-6203
URL: https://issues.apache.org/jira/browse/GEODE-6203
Project: Geode
Issue Type: Task
Components: benchmarks
Reporter: Dan Smith
It's hard to tell from the results of analyzeRun whether differences between
runs are statistically significant or not.
As first cut, we should try to include a confidence interval with each of our
measurements.
We can use 3 * the standard error (99.7% confidence interval) for the
throughput and mean latency measurements, and propagate the error to the ratio
between two runs. That should give us a confidence interval for the ratio as
well which will help us determine if the changes have a statistically
significant difference.
After making these changes, we should investigate to see if multiple runs on
the same hardware remain within this confidence interval. This methodology may
not work, since all of the measurements we take within a single run aren't
really independent. If the multiple runs don't stay within the interval, we may
need to consider running each benchmark multiple times like JMH to produce a
more accurate error estimate.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)