On Dec 22, 2008, at 19:31, Edward Z. Yang wrote:
The condition here is reaaaally long. Is there any way we can make it shorter?
Not really, but it's possible to flatten out the lookahead by adding states so that the condition in each state becomes simpler. (In fact, it's possible to remove lookahead from the tokenizer altogether by adding more states.) See Tokenizer.java in the Validator.nu HTML Parser.
SVN URL: http://svn.versiondude.net/whattf/htmlparser/trunk/ -- Henri Sivonen [email protected] http://hsivonen.iki.fi/
