Hi Abhishek,


Did you try xdmp:unquote with repair-full option? There are also some
format option that might interest you.



http://community.marklogic.com/pubs/5.0/apidocs/Ext-5.html#xdmp:unquote



Kind regards,

Geert





*Van:* [email protected] [mailto:
[email protected]] *Namens *Abhishek53 S
*Verzonden:* maandag 25 juni 2012 16:01
*Aan:* MarkLogic Developer Discussion
*Onderwerp:* Re: [MarkLogic Dev General] UTF -8 Encoding Exception




Hi Geert,

Thanks for prompt reply! Is there any way to convert Non UTF 8 encoded file
to UTF -8 encoded through some different API? The downloaded text file has
invalid XML characters like  which needs to be pre-processed before
updating this to a XML file.

Thanks
Abhishek Srivastav
Systems Engineer
Tata Consultancy Services
Cell:- +91-9883389968
Mailto: [email protected]
Website: http://www.tcs.com
____________________________________________
Experience certainty.        IT Services
                       Business Solutions
                       Outsourcing
____________________________________________

From:

Geert Josten <[email protected]>

To:

MarkLogic Developer Discussion <[email protected]>

Date:

06/25/2012 06:41 PM

Subject:

Re: [MarkLogic Dev General] UTF -8 Encoding Exception

Sent by:

[email protected]


------------------------------




Hi Abhishek,

The encoding option is not to specify a target encoding for conversion, but
to specify the encoding of the file you try to download. So, you should
figure out which encoding file-location.txt itself has, and just specify
that..

Kind regards,
Geert

*Van:* [email protected] [mailto:
[email protected]] *Namens *Abhishek53 S*
Verzonden:* maandag 25 juni 2012 14:51*
Aan:* MarkLogic Developer Discussion*
Onderwerp:* [MarkLogic Dev General] UTF -8 Encoding Exception


Hi Folks,

I am having issue in downloading non UTF 8 encoded text file from file
server. I am using http-get method to download text files and then updating
the text inside XML documents.

How to convert non UTF 8 to UTF 8 encoded?

Sample Code
xdmp:http-get("file-location.txt",
        <options xmlns="xdmp:document-get">
                       <encoding>utf-8</encoding>
             </options>

)

Exception: XDMP-DOCUTF8SEQ: -- document is not UTF-8 encoded
Please let me know your suggestion

Thanks
Abhishek Srivastav
Systems Engineer
Tata Consultancy Services
Cell:- +91-9883389968
Mailto: [email protected]
Website: http://www.tcs.com
____________________________________________
Experience certainty.        IT Services
                       Business Solutions
                       Outsourcing
____________________________________________

=====-----=====-----=====
Notice: The information contained in this e-mail
message and/or attachments to it may contain
confidential or privileged information. If you are
not the intended recipient, any dissemination, use,
review, distribution, printing or copying of the
information contained in this e-mail message
and/or attachments to it are strictly prohibited. If
you have received this communication in error,
please notify us by reply e-mail or telephone and
immediately and permanently delete the message
and any attachments. Thank you
_______________________________________________
General mailing list
[email protected]
http://community.marklogic.com/mailman/listinfo/general
_______________________________________________
General mailing list
[email protected]
http://community.marklogic.com/mailman/listinfo/general

Reply via email to