On Tue, Sep 3, 2013 at 8:51 PM, Alex Shinn <[email protected]> wrote:
> html-parser processes entities, but the default for html->sxml > is just to leave the encoded as-is. I'm not sure if that's the best > default, I'm not going to suggest that this is a major problem, especially since you are not claiming html-parser conforms to any particular standard, and the docs clearly indicate its pragmatic focus. But just for the record, if you wanted to be an XML-1.1-conformant processor, you would have to normalize attribute values, which includes dereferencing character entities: http://www.w3.org/TR/xml11/#AVNormalize As for the non-XML varieties of HTML, well ... life is too short to go digging into all that hoary SGML stuff. Did that once upon a time ... but I was younger then, and thought markup languages were the greatest thing since sliced bread ;-) -- Matt Gushee _______________________________________________ Chicken-users mailing list [email protected] https://lists.nongnu.org/mailman/listinfo/chicken-users
