On Mon, 13 Feb 2006 13:50:57 +0600, dolphinling
<[EMAIL PROTECTED]> wrote:
So, will the HTML 5 parsing section be of use here? Will it be of use to
things other than browsers? Are there small differences needed because
what's being parsed is a document fragment instead of a document? And
when it's re-serialized, how closely will today's browsers interpret the
original and the new?
A HTML parser is defenitely a thing which isn't only used by browsers.
Search engines, archivation and comparison tools, web page translators --
they all need a parser.
About websites like forums and blogs, it can be a bit trickier: many of
them introduce their own markup (BBcode, LiveJournal tags) in addition to
allowing some HTML, so they'd need a modified version of HTML 5 parser.
-- Opera M2 9.0 TP2 on Debian Linux 2.6.12-1-k7
* Origin: X-Man's Station at SW-Soft, Inc. [ICQ: 115226275]
<[EMAIL PROTECTED]>