Hello,

I'm using Digester 2.0 and trying to process XML that
may include HTML entities and trying to get Digester to decode them
when parsing.

For example, my XML contains:
  <name><![CDATA[Gr&uuml;ber]]></name>

Currently, Digester is parses this as:  Gr&uuml;ber

But what I am really after is "GrĂ¼ber", so I am looking for a way to get this 
&uuml; entity decoded by Digester.
How do I tell Digester to decode HTML entities?

Also, if I don't use CDATA, like this:
  <name>Gr&uuml;ber</name>

Digester gives me: Grber

Any help would be very appreciated.  Thanks,

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to