Hi Edwin,
 
I have an enquiry regarding unicode again...
I have 2 xml files.
one of them encode as #x2014A manages to decode it as a dash when I print it out after using getText() to get my data.
however the second file with #x00E9
is not able to decode this character...
May I know how can I decode my character & display them appropriately?

I do not need to have any explicit declarations to display unicodes right? does that mean it's the xml file problem... or the unicode is not defined? how do i know if the unicode char is compatible with UTF-8 or UTF-16?
 
thanks alot!
 
regards,
Jac


Edwin Dankert <[EMAIL PROTECTED]> wrote:
Dom4j fully supports unicode characters.

However in XML you can specify unicode characters in 2
different ways:

- Use the character as is and use a unicode compatible
encoding (UTF-8 or UTF-16).

- Use a character reference.

A character reference would look like '–’'.

Note: when you use character-references the XML parser will
substitute the character-reference for its unicode character.

Regards,
Edwin


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
dom4j-user mailing list
dom4j-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dom4j-user

Yahoo! Mobile
- Download the latest ringtones, games, and more!

Reply via email to