Hi Edwin,
I have an enquiry regarding unicode again...
I have 2 xml files.
one of them encode as #x2014A manages to decode it as a dash when I print it out after using getText() to get my data.
however the second file with #x00E9
is not able to decode this character... May I know how can I decode my character & display them appropriately? I do not need to have any explicit declarations to display unicodes right? does that mean it's the xml file problem... or the unicode is not defined? how do i know if the unicode char is compatible with UTF-8 or UTF-16? thanks alot!
regards,
Jac |
Edwin Dankert <[EMAIL PROTECTED]> wrote:Dom4j fully supports unicode characters.
However in XML you can specify unicode characters in 2
different ways:
- Use the character as is and use a unicode compatible
encoding (UTF-8 or UTF-16).
- Use a character reference.
A character reference would look like '–’'.
Note: when you use character-references the XML parser will
substitute the character-reference for its unicode character.
Regards,
Edwin
-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
dom4j-user mailing list
dom4j-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dom4j-user
Yahoo! Mobile
- Download the latest ringtones, games, and more!