Hi Christophe

It's very unlikely that the characters are the problem - all Unicode characters 
are allowed in XML - see e.g. http://www.xml.com/axml/testaxml.htm (section 
2.2) and hence in XmlBeans.

What is more likely is that the characters are not encoded (as bytes) in the 
way XmlBeans expects. By default XmlBeans assumes UTF-8 encoding. Yours are 
probably ISO8859_1 or some such thing. If you want to play around with 
character encoding have a look at XmlOptions.setCharacterEncoding().

Cheers,

Lawrence

> -----Original Message-----
> From: Christophe Bouhier (MC/ECM) [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, December 14, 2005 6:04 PM
> To: '[email protected]'
> Subject: Illegal characters, can xmlbeans be forgiving?
> 
> Hi,
> 
> My application parses XML from many different sources. (It's a RSS
> reader/Podcast receiver).
> Before I switched to XMLBeans I was using an xml parser called nanoXMl
> which didn't mind
> Some illegal characters especially when wrapped in CDATA.
> Now XMLBeans stumbles over the illegal chars below:(“) (Throws
> exception).
> 
> ....
> <description><![CDATA[
>       Miljenko “Mike� Grgich first gained international recognition at
> the celebrated “Paris Tasting� of 1976.  They had chosen Mike’s 1973
> Chateau Montelena Chardonnay as the finest white wine in the world.
>       Today, Mike oversees daily operations at his winery  Grgich Hills.
> His aim, year after year, is to improve the quality of their
> [...]]]></description>
> ......
> 
> Is there anyway I can set an option to ignore illegal chars and go on. For
> me this could be a deal-breaker. I unfortunatly can't expect all XML out
> on the web to be "nice and tidy".
> 
> Thanks for the help!
> Cheers / Christophe
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to