Re: [webkit-dev] Iterating SunSpider

Maciej Stachowiak Tue, 07 Jul 2009 17:08:42 -0700


On Jul 7, 2009, at 4:19 PM, Peter Kasting wrote:

For example, the framework could compute both sums _and_ geomeans,if people thought both were valuable.

That's a plausible thing to do, but I think there's a downside: if youmake a change that moves the two scores in opposite directions, thebenchmark doesn't help you decide if it's good or not. Avoidingparalysis in the face of tradeoffs is part of the reason we lookprimarily at the total score, not the individual subtest scores. Thewhole point of a meta-benchmark like this is to force ourselves tosimplemindedly look at only one number.

We could agree on a way of benchmarking a representative sample ofcurrent sites to get an idea of how widespread certain operationscurrently are. We could talk with the maintainers of jQuery, Dojo,etc. to see what sorts of operations they think would be helpful tofuture apps to make faster. We could instrument browsers to havesome sort of (opt-in) sampling of real-world workloads. etc.Surely together we can come up with ways to make Sunspider evenbetter, while keeping its current strengths in mind.

I think these are all good ideas. I think there's one way in whichsampling the Web is not quite right. To some extent, what matters isnot average density of an operation but peak density. An operationthat's used a *lot* by a few sites and hardly used by most sites, maydeserve a weighting above its average proportion of Web use. I wouldlike to hear input on what is inadequately covered. I tend to thinkthere should be more coverage of the following:


- property access, involving at least some polymorphic access patterns
- method calls
- object-oriented programming patterns
- GC load
- programming in a style that makes significant use of closures

I think the V8 benchmark does a much better job of covering the firstfour of these things. I also think it overweights them, to theexclusion of most other considerations(*). As I mentioned before, I'dlike to include some of V8's tests in a future SunSpider 2.0 contentset.

It would be good to know what other things should be tested that arenot sufficiently covered.


Regards,
Maciej

* - For example, Mozilla's TraceMonkey effort showed relatively littleimprovement on the V8 benchmark, even though it showed significantimprovement on SunSpider and other benchmarks. I think TraceMonkeyspeedups are real and significant, so this would tend to undermine myconfidence in the V8 benchmark's coverage. Note: I don't mean to starta side thread about whether the V8 benchmark is good or not, I justwanted to justify my remarks above._______________________________________________

webkit-dev mailing list
webkit-dev@lists.webkit.org
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

Re: [webkit-dev] Iterating SunSpider

Reply via email to