Re: what is new in talos, what is coming up

2015-07-27 Thread jmaher
It has been a while since we posted an update on Talos

Here are some new things:
* bug 1166132 - new tps test - tab switching
* e10s on all platforms, only runs on mozilla-central for pgo builds, broken 
tests, big regressions are tracked in bug 1144120
* perfherder is easier to use, some polish on test selection and the compare 
view, and most importantly we have found a few odd bugs that has caused 
duplicate data to show up, check it out: 
https://treeherder.mozilla.org/perf.html#/graphs

Here is what is upcoming:
* moving talos source code in-tree (bug 787200)
* starting to move android talos to autophone (bug 1170685)
* perfherder: easier to find it when pushing to try and more polish on 
selecting which revisions to compare against.
* automatic 5 retriggers for talos jobs on try server

As always if you have issues you can file bugs:
* talos: 
https://bugzilla.mozilla.org/enter_bug.cgi?product=Testingcomponent=Talos
* perfherder: 
https://bugzilla.mozilla.org/enter_bug.cgi?product=Tree%20Managementcomponent=Perfherder

Thanks for responding to regressions when pinged!  Expect another update 
sometime in late August or early September.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: what is new in talos, what is coming up

2015-06-05 Thread jmaher

 
  can we keep the
  snarkfest version running please until this is resolved?
 
 My main concern was because I inferred from the previous post that
 deprecation of snarkfest was scheduled on a timeline basis.
 
 Can we instead schedule on a when-its-ready basis, please?
 

yes- we are not doing this on a time basis- datazilla deprecation is on a time 
schedule- the rest is not.  Most likely this is an August/September thing- 
ideally within the next 4-6 weeks you will be using Perfherder happily for 
everything!

Thanks for bringing up suggestions and using the new tools.  
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: what is new in talos, what is coming up

2015-06-04 Thread Karl Tomlinson
William Lachance writes:

 Hi Karl,

 On 2015-06-04 12:30 AM, Karl Tomlinson wrote:
 jma...@mozilla.com  writes:

 We will deprecate those instances of compare-talos next quarter
 completely.

 The treeherder version seems to randomly choose which and how many
 of the results to load and so the comparison changes after
 reloads of the page.

 So this is a bug. :) Could you please file something here:

 https://bugzilla.mozilla.org/enter_bug.cgi?product=Tree%20Managementcomponent=Perfherder

 For best results, include reproduction steps and comparison with
 the existing snarkfest interface.

Thanks for the link.  Knowing the right component is half the
process of filing bugs.

https://bugzilla.mozilla.org/show_bug.cgi?id=1171707

I don't know how to compare with snarkfest, but comparing
treeherder with treeherder is sufficient to observe the bug.

 can we keep the
 snarkfest version running please until this is resolved?

My main concern was because I inferred from the previous post that
deprecation of snarkfest was scheduled on a timeline basis.

Can we instead schedule on a when-its-ready basis, please?

 If I may sneak in a request or two, then a number of results or an
 estimate of standard error in the mean would be helpful to know
 how to interpret the standard deviations.  Also, a way to see
 whether higher or lower is better even for those tests without
 enough data to detect a statistically significant change would be
 a bonus.

https://bugzilla.mozilla.org/show_bug.cgi?id=1171694
https://bugzilla.mozilla.org/show_bug.cgi?id=1171703

 I'm keen to find out what is in the Details links, but they
 currently just ask me to wait a minute.

 Likewise, that sounds like a bug. Clicking on details is supposed
 to open up a subtest summary.  Here it seems to work fine for me:

 https://treeherder.mozilla.org/perf.html#/compare?originalProject=mozilla-centraloriginalRevision=b2ad2f9f8e53newProject=mozilla-centralnewRevision=f21fac50a8cc

That link is loading for me, and that info can be very helpful,
thanks.  However, others links are not.

https://bugzilla.mozilla.org/show_bug.cgi?id=1171710
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: what is new in talos, what is coming up

2015-06-04 Thread William Lachance

Hi Karl,

On 2015-06-04 12:30 AM, Karl Tomlinson wrote:

jma...@mozilla.com  writes:


2) compare-talos is in perfherder
(https://treeherder.mozilla.org/perf.html#/comparechooser), other instances of
compare-talos have a warning message at the top indicating you should use
perfherder.  We will deprecate those instances of compare-talos next quarter
completely.

This is very helpful to present PGO separated and to know that
higher is better for canvasmark, but it is not yet ready to
replacehttp://perf.snarkfest.net/compare-talos/index.html

The treeherder version seems to randomly choose which and how many
of the results to load and so the comparison changes after
reloads of the page.


So this is a bug. :) Could you please file something here:

https://bugzilla.mozilla.org/enter_bug.cgi?product=Tree%20Managementcomponent=Perfherder

For best results, include reproduction steps and comparison with the 
existing snarkfest interface.



upcoming work:
2) continue polishing perfherder graphs, compare-view

Perhaps the above issue is already in this work and you know it
will be addressed by next quarter, but, if not, can we keep the
snarkfest version running please until this is resolved?

Today, the treeherder version is not loading enough results to do a
reasonable comparison, while the snarkfest version doesn't seem to
have the problem and presents results almost instantly.

If I may sneak in a request or two, then a number of results or an
estimate of standard error in the mean would be helpful to know
how to interpret the standard deviations.  Also, a way to see
whether higher or lower is better even for those tests without
enough data to detect a statistically significant change would be
a bonus.


These sound like good ideas. Can't remember whether we've filed issues 
for them yet. Feel free to have a look at the perfherder bug list and 
file a feature request if something isn't open yet:


https://wiki.mozilla.org/Auto-tools/Projects/Perfherder#Bug_Table


I'm keen to find out what is in the Details links, but they
currently just ask me to wait a minute.


Likewise, that sounds like a bug. Clicking on details is supposed to 
open up a subtest summary.  Here it seems to work fine for me:


https://treeherder.mozilla.org/perf.html#/compare?originalProject=mozilla-centraloriginalRevision=b2ad2f9f8e53newProject=mozilla-centralnewRevision=f21fac50a8cc

Will

___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: what is new in talos, what is coming up

2015-06-03 Thread jmaher
Things are still changing with Talos- many things are becoming easier, while 
others still have kinks to work out- here are a few things which have changed 
recently:

1) Android talos has been reduced to what is useful- it should run faster and 
sets us up for migrating to autophone next quarter
2) compare-talos is in perfherder 
(https://treeherder.mozilla.org/perf.html#/comparechooser), other instances of 
compare-talos have a warning message at the top indicating you should use 
perfherder.  We will deprecate those instances of compare-talos next quarter 
completely.
3) datazilla no longer collects talos data.  This has been stopped on all 
branches we care about and we will be turning datazilla off completely next 
month!
4) talos counters are now streamlined a bit and showing up in perfherder 
(https://treeherder.mozilla.org/perf.html#/graphs)
5) compare view in perfherder has a more realistic comparison algorithm to 
point out regressions/improvements.

upcoming work:
1) finish evaluating talos counters - collect only what is useful
2) continue polishing perfherder graphs, compare-view
3) start generating alerts from perfherder (in parallel to graph server)
4) document and work on a smoother method for getting enough data and comparing 
data points on try pushes, regressions on the tree, and running tests locally.

A lot of good ideas come in from various folks (usually folks who are 
investigating a regression or worried about a change they are making).  While 
in Whistler, we would love to show folks how these tools currently work, walk 
through the end game and ideal use cases, and brainstorm on things we are 
overlooking.

If you are interested, do look for a performance discussion on the schedule.  
It has yet to be scheduled.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: what is new in talos, what is coming up

2015-06-03 Thread Karl Tomlinson
Thanks, Joel.  I've benefited from being able to use
perf.html#/comparechooser and will look forward to the performance
discussion.

jma...@mozilla.com writes:

 2) compare-talos is in perfherder
 (https://treeherder.mozilla.org/perf.html#/comparechooser), other instances of
 compare-talos have a warning message at the top indicating you should use
 perfherder.  We will deprecate those instances of compare-talos next quarter
 completely.

This is very helpful to present PGO separated and to know that
higher is better for canvasmark, but it is not yet ready to
replace http://perf.snarkfest.net/compare-talos/index.html

The treeherder version seems to randomly choose which and how many
of the results to load and so the comparison changes after
reloads of the page.

 upcoming work:

 2) continue polishing perfherder graphs, compare-view

Perhaps the above issue is already in this work and you know it
will be addressed by next quarter, but, if not, can we keep the
snarkfest version running please until this is resolved?

Today, the treeherder version is not loading enough results to do a
reasonable comparison, while the snarkfest version doesn't seem to
have the problem and presents results almost instantly.

If I may sneak in a request or two, then a number of results or an
estimate of standard error in the mean would be helpful to know
how to interpret the standard deviations.  Also, a way to see
whether higher or lower is better even for those tests without
enough data to detect a statistically significant change would be
a bonus.

I'm keen to find out what is in the Details links, but they
currently just ask me to wait a minute.
___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform


Re: what is new in talos, what is coming up

2015-05-05 Thread Joel Maher
Great question Brian!

I believe you are asking if we would generate alerts based on individual
tests instead of the summary of tests.  The short answer is no.  In looking
at reporting the subtest results as new alerts, we found there was a lot of
noise (especially in svgx, tp5o, dromaeo, v8) as compared to the summary
alerts.  cart/tart has a bit of noise which might be more realistic, to
report specific pages on, we could investigate this more if there is a
strong need for it.

late last year we cleaned up our summary reporting to be a geometric mean
of all the pages/subtests which were run.  This means that we do a better
job of reporting a regression of the test summary when a specific test is
the cause.

One thing we have been working on is a compare mode to perfherder which
replaces compare-talos.  This is live (
https://treeherder.mozilla.org/perf.html#/comparechooser - although
changing rapidly) and will do a great job of telling you which specific
test caused a regression.

Do you have concerns about a lack of reporting subtests, or tooling to make
finding the results easier?  suggestions are welcome, this is something we
work on regularly and improve to make our lives easier!

-Joel


On Mon, May 4, 2015 at 2:08 PM, Brian Grinstead bgrinst...@mozilla.com
wrote:

 The upcoming changes sound great!  Is there currently a way (or plans to
 add a way) to track regressions / improvements for a single measurement
 within a test?  I see that in perfherder I can add these measurements to a
 graph (http://mzl.la/1E17Zyo) but it’s hard to distinguish between normal
 variation across runs and an actual regression by looking at the graph.

 Brian


___
dev-platform mailing list
dev-platform@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-platform