Hello, I'm using Digester 2.0 and trying to process XML that may include HTML entities and trying to get Digester to decode them when parsing.
For example, my XML contains: <name><![CDATA[Grüber]]></name> Currently, Digester is parses this as: Grüber But what I am really after is "GrĂ¼ber", so I am looking for a way to get this ü entity decoded by Digester. How do I tell Digester to decode HTML entities? Also, if I don't use CDATA, like this: <name>Grüber</name> Digester gives me: Grber Any help would be very appreciated. Thanks, Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
