On Mon, 13 Feb 2006 13:50:57 +0600, dolphinling <[EMAIL PROTECTED]> wrote:

So, will the HTML 5 parsing section be of use here? Will it be of use to things other than browsers? Are there small differences needed because what's being parsed is a document fragment instead of a document? And when it's re-serialized, how closely will today's browsers interpret the original and the new?

A HTML parser is defenitely a thing which isn't only used by browsers. Search engines, archivation and comparison tools, web page translators -- they all need a parser.

About websites like forums and blogs, it can be a bit trickier: many of them introduce their own markup (BBcode, LiveJournal tags) in addition to allowing some HTML, so they'd need a modified version of HTML 5 parser.


-- Opera M2 9.0 TP2 on Debian Linux 2.6.12-1-k7
* Origin: X-Man's Station at SW-Soft, Inc. [ICQ: 115226275] <[EMAIL PROTECTED]>

Reply via email to