I am not sure if this will work, but you can try using the <output-encoding> 
option to xdmp:quote.  Something like:

text { xdmp:quote(  xdmp:get-request-field("upload"),
         <options xmlns="xdmp:quote">
           <output-encoding>ASCII</output-encoding>
          </options>  )   }

-Danny

From: [email protected] 
[mailto:[email protected]] On Behalf Of Neil Bradley
Sent: Thursday, December 03, 2009 1:30 AM
To: [email protected]
Subject: [MarkLogic Dev General] Upload Data via Form - Invalid UTF-8 Escape 
Sequence

Hi,

I have a requirement to import data from spreadsheets and databases, using 
tab-separated text format, which I convert to XML. The problem I am having 
occurs when the source data comes from Excel and contains a pound symbol (or, I 
suspect, any character with an ASCII value above 127).

Initially, the problem was that the text file was not recognised by the browser 
as text, so it came in as "application/octet-stream" instead of "text/plain", 
but I solved that using the following technique:

  text {   xdmp:quote(  xdmp:get-request-field("upload")  )   }

That solved the problem when the pound symbol was not in the data, (and also 
works when the data arrives in "plain/text" format, so covers both scenarios). 

But when the pound symbols was present, I got the following error:

XDMP-UTF8SEQ: 
xdmp:quote(binary{"46756e64204e616d650944617465094e65742041737365742056616c75650944..."})
 -- Invalid UTF-8 escape sequence 
in /test/UploadData.xqy, on line 61 [1.0-ml] 

Now, I have opened the file I am uploading in TextPad, which tells me it is a 
PC format ANSI text file, so I guess that might explain the UTF-8 error. The 
document is NOT in UTF 8. So I think it converting from ANSI to UTF-8. Any idea 
how to do that in this form-upload scenario?

Thanks

Neil.

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to