Re: [webkit-dev] HTML5 tokenizer landing soon

David Hyatt Mon, 14 Jun 2010 14:09:36 -0700

On Jun 14, 2010, at 4:05 PM, Adam Barth wrote:

> On Mon, Jun 14, 2010 at 1:53 PM, David Hyatt <[email protected]> wrote:
>> On Jun 14, 2010, at 3:48 PM, Adam Barth wrote:
>>> We ended up using the same algorithm as the old tokenizer to manage
>>> insertion points, however, we moved all the work into a separate
>>> InputStream data structure:
>>> 
>>> http://trac.webkit.org/browser/trunk/WebCore/html/HTML5DocumentParser.h#L75
>>> 
>>> The old code was actually pretty clever once I figured out what it was
>>> doing.  We're considering moving InputStream into its own file instead
>>> of keeping it as an inner class of the document parser.
>> 
>> If you're talking about the segmented string stuff, I added that to the 
>> existing tokenizer. :)
> 
> Yeah, well, I already knew you were a clever guy.  :)


In all seriousness,  though, those are the kinds of optimizations to make sure 
to move over.  I remember other optimizations we did around quick comparisons, 
some AtomicString stuff, and the SegmentedString stuff.  As long as that moved 
over, you should see comparable performance I would think.  We just need to 
make sure not to lose that work in the transition (and it sounds like we 
haven't, so good).

dave

_______________________________________________
webkit-dev mailing list
[email protected]
http://lists.webkit.org/mailman/listinfo.cgi/webkit-dev

Re: [webkit-dev] HTML5 tokenizer landing soon

Reply via email to