Hi,

I have a very basic question regarding encoded text in an XML Text node.

I am using the DOM parser
javax.xml.parsers.DocumentBuilder
to parse some xml. The xml has some international characters which are encoded 
in decimal unicode - eg. 'ó' is 'ó'.

The parser takes the encoded character out of the word for example jóbb and the 
Text node is left with jbb as the node value. If I use getData() on the Text 
node it returns jbb also. The character is not lost though - a sibling Node, an 
Entity Reference Node, has the decimal value and an index. The index value 
corresponds to where this character should be inserted. In this case for 
example if would have an index of 1 and value of '#234'.

This indicates that there must be a simple way of accessing the value of the 
Text node of its parent in such a way as to get 'jóbb'. It seems unlikely that 
I need to reconstruct this by doing a lookup to convert the encoded character 
to utf8 and then insert it into the string for the Text node. Does anyone know 
how this works?

Sorry for asking what is probably a really basic question, but I can't find the 
answer.

Thank you for any help.

Regards,
Julius.


-- 
You received this message because you are subscribed to the Google
Groups "Android Developers" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/android-developers?hl=en

Reply via email to