Hi all,
It is a good idea to use a different encoding (like
US-ASCII) to convert all the special characters into their entity references.
But, the problem is there are many places in the
application the xml gets processed using XML processors (JDOM), after the
transformation is done. We use encodings like ISO-8859-1 and UTF-8 at those
places. And, the transformed xml is now US-ASCII encoded. Do you think
those entity references will remain as is (without again getting converted into
direct characters) during such XML processings? If it does, then that is the
best solution. The problem it takes huge time to
change the remaining part of the code (where XML processing is done), if it
doesn't. If it doesn't, is there any way to change the encoding back to the
original (without costly processing) and change those "®" to
"&174;"
And the main goal is to make the xml source have
entity references (like ®) for ALL special characters, so that
no XML processor fails because of encoding. Is that reasonable to do (If not the
smartest thing:-)?
Please suggest!
Thanks,
Pramodh.
|
- RE: replacing ALL chars in a string while transforming Andrew Welch
- Re: replacing ALL chars in a string while transformi... Pramodh Peddi
- RE: replacing ALL chars in a string while transformi... Andrew Welch
- Using Extensions Functions - its efficiency Pramodh Peddi
- RE: replacing ALL chars in a string while transformi... Joseph Kesselman
- Pramodh Peddi