Our web pages validate to XHTML 1.0 Transitional.

When returning results in XML Format, entities which are encoded using valid XHTML entity references are causing the xml results to be invalid, and therefore unparseable.

For example a page with "©" (the copyright symbol) is being returned in the xml results with "\xa9" (the Hexadecimal character reference which is ©)

I have created an embedded DTD in my results for parsing, and included the DTD fragments for defining Latin1 entities, but still no luck.

Has anyone seen this problem, and found a solution?


-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
ht://Dig general mailing list: <htdig-general@lists.sourceforge.net>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to