Hello, I would like to parse localy downloaded (via <xi:include parse="text">) html pages.
After download, <xi:include> give me an "escape" html file. I suppress <!Doctype ... > with regex, but now the unescape transformer throw this error : " Caused by: org.xml.sax.SAXParseException: The entity "nbsp" was referenced, but not declared." I found this on internet : "To allow the use off   in you stylesheet, you have to declare it first : <!DOCTYPE xsl:stylesheet [<!ENTITY nbsp " ">]> " How I can add this declaration in the java unescape transformer ? I think that I can remove all   with a regex, but I would like to more understand how work java transformer. Thanks and have a good day. Florent --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
