Updates:
Status: WontFix
Comment #1 on issue 1068 by jasvir: Consider using nekohtml instead of
htmlparser
http://code.google.com/p/google-caja/issues/detail?id=1068
htmlparser was recommended by Ian Hickson, author of large chunks of the
HTML5 spec
as conforming closely to the spec. Nekohtml is indeed quite fast but
htmlparser does
a better job of more accurately producing the kind of DOM that you would
get in an
actual browser (which is what we're trying to codify) when parsing tag soup.
Mike Samuel looked at nekohtml more recently (primarily to see if we could
benefit
from faster parsing by neko) and improved our own parsing speed to a point
where it
is comparable to neko. I am not sure I fully follow the benefit of removing
dependency on icu4j.
--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust your issue notification preferences at:
http://code.google.com/hosting/settings