On Wed, Mar 03, 2010 at 09:22:52AM -0500, Joshua Marantz wrote:
> In https://bugzilla.gnome.org/show_bug.cgi?id=611655 I reported what looked
> like a bug where a tag like:
> 
>   <option selected>
> 
> would be transformed, in the parser to
> 
>   <option selected="selected">
> 
> This is consistent with
> http://www.w3.org/TR/html4/intro/sgmltut.html#h-3.3.4.2 but that spec also
> warns:
> 
>    Authors should be aware that many user agents *only* recognize the
> minimized form of boolean attributes and not the full form.
> 
> By making this transformation in the parser, it is not possible to use
> libxml2 to process HTML without potentially breaking behavior in some
> browsers.
> 
> 
> Currently this transformation is implemented in HTMLparser.c in the static
> function htmlParseAttribute, based on htmlIsBooleanAttr(name) .  Daniel
> Veillard explains that some downstream tools expect this transformation to
> be done. However, I would like to propose that this transformation be moved
> out of the parser and done in a later phase.  Daniel suggested "the SAX2.c
> module building the tree". This would be OK for my purposes, as I am using
> my own SAX bindings. and not relying on the tree-building code.
> 
> So I'm proposing this change to see if there are objections.

  Okay, I just did this as I think this makes sense overall. Change is
in git head.

Daniel

-- 
Daniel Veillard      | libxml Gnome XML XSLT toolkit  http://xmlsoft.org/
[email protected]  | Rpmfind RPM search engine http://rpmfind.net/
http://veillard.com/ | virtualization library  http://libvirt.org/
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to