On Wed, 29 Nov 2006 17:37:52 +0100, Sam Ruby <[EMAIL PROTECTED]> wrote:
The bug would request that Wordpress doesn't try to output XML for the text/html media type. That seems to be the problem here.

If the code for Wordpress fit on a page, that suggestion would be easy to implement.

As it stands now, it appear that several hundred lines of code would need to change. And in each case, the code would need to be aware of the content type in effect. In some cases, that information may not be available. In fact, that may not have been determined yet.

Why would the code need to be aware of that? Atom supports type="html".


One way cross-cutting concerns such as this one are often handled is to simple capture the output and post-process it. Latchlan opted to do so with the WHATWG Blog. The first pass for things like this generally takes the form of simple pattern matching and regular expressions.

The solution used on the blog was a simple find (/>) and replace (>).


Often this evolves. What would be better is something that could take that string and produce a DOM, from which a correct serialization can take place.

Shouldn't this be done throughout the code instead of at the end?


Now, what type of parser would you use? HTML5's rules come tantalizingly close to handling this situation, except for a few cases involving tags that are self-closing...

Those cases are covered as well. You seem to think that "parse error" means the same as it does in XML or something. It does not. It merely means there's a syntax error and that you can continue parsing following the specified rules.


--
Anne van Kesteren
<http://annevankesteren.nl/>
<http://www.opera.com/>

Reply via email to