On Tue, Sep 3, 2013 at 8:51 PM, Alex Shinn <[email protected]> wrote:

> html-parser processes entities, but the default for html->sxml
> is just to leave the encoded as-is.  I'm not sure if that's the best
> default,

I'm not going to suggest that this is a major problem, especially
since you are not claiming html-parser conforms to any particular
standard, and the docs clearly indicate its pragmatic focus. But just
for the record, if you wanted to be an XML-1.1-conformant processor,
you would have to normalize attribute values, which includes
dereferencing character entities:

http://www.w3.org/TR/xml11/#AVNormalize

As for the non-XML varieties of HTML, well ... life is too short to go
digging into all that hoary SGML stuff. Did that once upon a time ...
but I was younger then, and thought markup languages were the greatest
thing since sliced bread ;-)

--
Matt Gushee

_______________________________________________
Chicken-users mailing list
[email protected]
https://lists.nongnu.org/mailman/listinfo/chicken-users

Reply via email to