[Digester] HTML entity decoding?

Otis Gospodnetic Wed, 15 Apr 2009 15:07:02 -0700

Hello,

I'm using Digester 2.0 and trying to process XML that
may include HTML entities and trying to get Digester to decode them
when parsing.


For example, my XML contains:
  <name><![CDATA[Gr&uuml;ber]]></name>

Currently, Digester is parses this as:  Gr&uuml;ber

But what I am really after is "Grüber", so I am looking for a way to get this 
&uuml; entity decoded by Digester.
How do I tell Digester to decode HTML entities?

Also, if I don't use CDATA, like this:
  <name>Gr&uuml;ber</name>

Digester gives me: Grber

Any help would be very appreciated.  Thanks,

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[Digester] HTML entity decoding?

Reply via email to