Our web pages validate to XHTML 1.0 Transitional.
When returning results in XML Format, entities which are encoded using
valid XHTML entity references are causing the xml results to be invalid,
and therefore unparseable.
For example a page with "©" (the copyright symbol) is being
returned in the xml results with "\xa9" (the Hexadecimal character
reference which is ©)
I have created an embedded DTD in my results for parsing, and included
the DTD fragments for defining Latin1 entities, but still no luck.
Has anyone seen this problem, and found a solution?
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
ht://Dig general mailing list: <htdig-general@lists.sourceforge.net>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general