I used Castor Framework to generate automatically classes and Descriptor to marshal and unmarshall XML data. To be more precise, I managed dictionaries and the class model used to manage them is pretty complex. My problem is the following : when I try to Unmarshall data, the parser used by Castor (xerces) doesn�t load XML standard entites (ex : à ). Here�s the message returned when I try upload the file attached at the end of the message :The problem is that à and ï are not standard XML entities -- they are standard in HTML (and thus in XHTML). If you want to use them, you have to declare them in your XML document. A probably-better workaround if you want to use entities is to use numeric references like à and ï (or à and ï). You can find the full list of HTML 4.0 entities with their numeric equivalents at http://www.w3.org/TR/html401/sgml/entities.html.
At the end of the message is the file I try to load. You may notice that the character �� is handled by the parser � while it�s not an XML entity � when ï is not (my test�)�This is also a difference between XML and HTML. HTML requires that all characters outside of a specific subset of iso-8859-1 (I believe) be referenced with entities. XML does not -- to an XML parser, � and ç are exactly the same thing.
Rhett
=====================================================
| Rhett Sutphin
| Research Assistant (Software)
| Coordinated Laboratory for Computational Genomics
| and the Center for Macular Degeneration
| University of Iowa - Iowa City, IA 52242 - USA
| 4111 MEBRF - email: [EMAIL PROTECTED]
=====================================================
----------------------------------------------------------- If you wish to unsubscribe from this mailing, send mail to
[EMAIL PROTECTED] with a subject of:
unsubscribe castor-dev
