IIRC, that funky A character is what you see when you look at UTF-8 data with a Latin-1 viewer. My guess is the document is loaded correctly, Mark Logic is returning it in UTF-8 encoding to your viewer, and it's just that your viewer is mistakenly using Latin-1 to display it to you.

-jh-

On Oct 26, 2009, at 6:52 AM, Chowdhury, Soumadri wrote:

Hi,
I have some files in ISO-8859-1 format. I loaded those documents in MarkLogic using the following code,

xdmp:document-load("d:\myFile.xml", <options xmlns="xdmp:document- load"> <uri>/docs/myFile.xml</uri> <format>xml</format> <encoding>ISO-8859-1</encoding> </options>)

But when I look at the generated XML file, I saw “” character in many places, which was not there in the input XML. How to ensure that those characters are not generated during ISO-8859-1 to UTF-8 conversion.

Disclaimer:
-----------

"This e-mail and any files transmitted with it are for the sole use of the intended recipient(s) and may contain confidential and privileged information. If you are not the intended recipient, please contact the sender by reply e-mail and destroy all copies of the original message. Any unauthorized review, use, disclosure, dissemination, forwarding, printing or copying of this e-mail or any action taken in reliance on this e-mail is strictly prohibited and may be unlawful."

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to