[ 
https://issues.apache.org/jira/browse/SHINDIG-46?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567226#action_12567226
 ] 

Artemy Tregubenko commented on SHINDIG-46:
------------------------------------------

I was told that Java internal encoding is UTF-16 
(http://java.sun.com/j2se/1.4.2/docs/api/java/nio/charset/Charset.html), and 
BOM for UTF-16 is 0xFEFF (http://en.wikipedia.org/wiki/Byte_order_mark). 

I do out.codePointAt(0) after I converted string to internal encoding. 

This works at least for particular BOM-marked widget, and it doesn't break some 
other widgets without BOM. 

> gadgets.io.makeRequest malfunctions on non-ASCII web sites.
> -----------------------------------------------------------
>
>                 Key: SHINDIG-46
>                 URL: https://issues.apache.org/jira/browse/SHINDIG-46
>             Project: Shindig
>          Issue Type: Bug
>          Components: Gadgets Server - Java
>            Reporter: Brian Eaton
>            Assignee: John Hjelmstad
>         Attachments: patch
>
>
> See this thread for background: 
> http://mail-archives.apache.org/mod_mbox/incubator-shindig-dev/200802.mbox/browser
> Short term, we should change the HTTP proxy code to always use UTF-8 as the 
> character set for converting remote content bytes to strings before returning 
> them to clients.  We should do this ASAP to prevent anyone from becoming 
> dependent on the current undefined behavior.
> Long term we might want to add some kind of character set detection, probably 
> via the HTTP content-type header.  IE style charset content sniffing would 
> probably not be a good idea.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to