Hi John,


According to the docs that should work. Are you testing this with CQ? You
might be fooled by the browser, displaying content with wrong encoding..



Kind regards,

Geert



*Van:* [email protected] [mailto:
[email protected]] *Namens *John Zhong
*Verzonden:* donderdag 3 november 2011 17:25
*Aan:* [email protected]
*Onderwerp:* [MarkLogic Dev General] encoding option "auto" in
xdmp:document-get function



Hi all,

I want to know how MarkLogic (I am using 4.2.7 version) determine the
encoding if setting "auto" option in xdmp:document-get function? for
example:

xdmp:document-get('D:\TOC-oe-17-26.xml',
       <options xmlns="xdmp:document-get">
           <encoding>*auto*</encoding>
       </options>)

I have many xml files in file system, but they are declaring different
encoding, like <?xml version="1.0" encoding="UTF-8"?>, <?xml version="1.0"
encoding="iso-8859-1"?>. So I want to use the "auto" encoding option to
read them without specifying encoding. When I tested to read some utf-8 xml
files by setting the "auto" option, some characters are messy. For example:

It returns the "*Brückner*" (should be *"Brückner"*), but when I set the
encoding option to "utf-8", it returns correctly.

<article author="*Brückner*" fpage="24334" lpage="24341"
msid="120315" type="Regular"/>

Thanks,
John
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to