Updates:
        Status: WontFix

Comment #1 on issue 1068 by jasvir: Consider using nekohtml instead of htmlparser
http://code.google.com/p/google-caja/issues/detail?id=1068

htmlparser was recommended by Ian Hickson, author of large chunks of the HTML5 spec as conforming closely to the spec. Nekohtml is indeed quite fast but htmlparser does a better job of more accurately producing the kind of DOM that you would get in an
actual browser (which is what we're trying to codify) when parsing tag soup.

Mike Samuel looked at nekohtml more recently (primarily to see if we could benefit from faster parsing by neko) and improved our own parsing speed to a point where it
is comparable to neko.  I am not sure I fully follow the benefit of removing
dependency on icu4j.

--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust your issue notification preferences at:
http://code.google.com/hosting/settings

Reply via email to