Re: [webkit-dev] PreloadScanner aggressiveness

Maciej Stachowiak Thu, 07 Jan 2010 12:49:34 -0800


On Jan 7, 2010, at 12:09 PM, Mike Belshe wrote:

Hi -
I've been working on SPDY, but I think I may have found a goodperformance win for HTTP. Specifically, if the PreloadScanner,which is responsible for scanning ahead within an HTML document tofind subresources, is throttled today. The throttling isintentional and probably sometimes necessary. Nonetheless, un-throttling it may lead to a 5-10% performance boost in someconfigurations. I believe Antti is no longer working on this? Isthere anyone else working in this area that might have data on howaggressive the PreloadScanner should be? Below I'll describe someof my tests.
The PreloadScanner throttling happens in a couple of ways. First,the PreloadScanner only runs when we're blocked on JavaScript (seeHTMLTokenizer.cpp). But further, as it discovers resources to befetched, it may delay or reject loading the subresource at all dueto throttling in loader.cpp and DocLoader.cpp. The throttling isvery important, depending on the implementation of the HTTPnetworking stack, because throwing too many resources (or the low-priority ones) into the network stack could adversely affect HTTPload performance. This latter problem does not impact my Chromiumtests, because the Chromium network stack does its ownprioritization and throttling (not too dissimilar from the work doneby loader.cpp).

The reason we do this is to prevent head-of-line blocking by low-priority resources inside the network stack (mainly considering howCFNetwork / NSURLConnection works).

Theory:
The theory I'm working under is that when the RTT of the network issufficiently high, the *best* thing the browser can do is todiscover resources as quickly as possible and pass them to thenetwork layer so that we can get started with fetching. This is notspeculative - these are resources which will be required to renderthe full page. The SPDY protocol is designed around this concept -allowing the browser to schedule all resources it needs to thenetwork (rather than being throttled by connection limits).However, even with SPDY enabled, WebKit itself prevents resourcerequests from fully flowing to the network layer in 3 ways:a) loader.cpp orders requests and defers requests based on thestate of the page load and a number of criteria.b) HTMLTokenizer.cpp only looks for resources further in the bodywhen we're blocked on JSc) "preload" requests are treated specially (docloader.cpp); ifthey are discovered too early by the tokenizer, then they are eitherqueued or discarded.

I think your theory is correct when SPDY is enabled, and possibly whenusing HTTP with pipelining. It may be true to a lesser extent with non-pipelining HTTP implementations when the network stack does its ownprioritization and throttling, by reducing latency in getting therequest to the network stack. This is especially so when issuing anetwork request to the network stack may involve significant latencydue to IPC or cross-thread communication or the like.

Test Case
Can aggressive preloadscanning (e.g. always preload scan beforeparsing an HTML Document) improve page load time?
To test this, I'm calling the PreloadScanner basically as the firstpart of HTMLTokenizer::write(). I've then removed all throttlingfrom loader.cpp and DocLoader.cpp. I've also instrumented thePreloadScanner to measure its effectiveness.
Benchmark Setup
Windows client (chromium).
Simulated network with 4Mbps download, 1Mbps upload, 100ms RTT, 0%packet loss.I run through a set of 25 URLs, loading each 30 times; not recyclingany connections and clearing the cache between each page.
These are running over HTTP; there is no SPDY involved here.


I'm interested in the following:

- What kind of results do you get in Safari?

- How much of this effect is due to more aggressive preload scanningand how much is due to disabling throttling? Since the test includesmultiple logically indpendent changes, it is hard to tell which arethe ones that had an effect.

Results:
Baseline
(without my changes)    Unthrottled     Notes
Average PLT     2377ms  2239ms  +5.8% latency redux.
Time spent in the PreloadScanner 1160ms 4540ms As expected, we spendabout 4x more time in the PreloadScanner. In this test, we loaded750 pages, so it is about 6ms per page. My machine is fast, though.
Preload Scripts discovered      2621    9440    4x more scripts discovered
Preload CSS discovered  348     1022    3x more CSS discovered
Preload Images discovered       11952   39144   3x more images discovered
Preload items throttled 9983    0       
Preload Complete hits 3803 6950 This is the count of items whichwere completely preloaded before WebKit even tried to look them upin the cache. This is pure goodness.Preload Partial hits 1708 7230 These are partial hits, where theitem had already started loading, but not finished, before WebKittried to look them up.Preload Unreferenced 42 130 These are bad and the count should bezero. I'll try to find them and see if there isn't a fix - thePreloadScanner is just sometimes finding resources that are neverused. It is likely due to clever JS which changes the DOM.
Conclusions:
For this network speed/client processor, more aggressivePreloadScanning clearly is a win. More testing is needed forslower machines and other network types. I've tested many networktypes; the aggressive preload scanning seems to always be either awin or a wash; for very slow network connections, where we'realready at capacity, the extra CPU burning is basically free. Forsuper fast networks, with very low RTT, it also appears to be awash. The networks in the middle (including mobile simulations) seenice gains.
Next Steps and Questions:
I'd like to land my changes so that we can continue to gather data.I can enable these via macro definitions or I can enable these viadynamic settings. I can then try to do more A/B testing.


I'd like answers to my questions above before we consider that.

Are there any existing web pages which the WebKit team would liketested under these configurations? I don't see a lot of testingthat I can leverage from the initial great work Antti did forverifying that I'm not breaking anything.
Is there any other information or data from the originalPreloadScanner work which I should read?


There's the original blog announcement of preload scanning:

http://webkit.org/blog/166/optimizing-page-loading-in-web-browser/

It might be a good idea to try replicating those results with proposedchanges.


Regards,
Maciej

_______________________________________________
webkit-dev mailing list
[email protected]
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

Re: [webkit-dev] PreloadScanner aggressiveness

Reply via email to