Just a correction - GWT uses UTF-8 and not UTF-16. Also, you *can* do String.getBytes() and similar hacks to convert from ISO-8859-1 (oracle) to UTF-8 -- but in my opinion it is best to store data in UTF-8 in the database.
In general, you need to revisit all interfaces where data exchange happens, and ensure that a) both systems are using same encoding or b) One system re-encodes the data appropriately. (a) is always better than (b). --Sri On 6 May 2010 22:53, David Given <[email protected]> wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > On 06/05/10 16:08, undertow wrote: > > Thank you for confirming what i had suspected i would need to do. So > > the idea is, user enters a bunch of text into a textarea via typing it > > all in or cut and paste from somewhere (like Word, ugh and its mangled > > characters). when time comes to ship that text off to the server i > > would then pluck the string out of the textarea stick it in a transfer > > object of sorts. (this is where i am a little fuzzy) I would then > > take the input string do a getBytes() on it and then push that array > > of bytes into a blob. would i need to get the bytes with an encoding > > argument? > > I believe so. GWT ought to get the string from the browser in UTF-16 --- > as that's what Strings are defined to be. You can then ship it back to > the server, as a String, and it should Just Work. Then you get to do the > charset conversion on the server. > > > e.g. txt.getBytes("ISO-8859-1"). This method seems to work > > ok, but if user had pasted from ms word into the text box things still > > come out mangled. > > I'm quite willing to believe that there are web browser bugs with all > this. It may be worth verifying that GWT is actually getting a valid > string from the browser (by going through and listing all the codepoints > in the string). > > In addition, if Word is using all kinds of whacky non-ISO-8859-1 > characters such as unbreaking spaces and quotation marks, then > getBytes() might be replacing them with ? signs --- how is it being > mangled? > > - -- > ┌─── dg@cowlark.com ───── http://www.cowlark.com ───── > │ > │ life←{ ↑1 ⍵∨.^3 4=+/,¯1 0 1∘.⊖¯1 0 1∘.⌽⊂⍵ } > │ --- Conway's Game Of Life, in one line of APL > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.10 (GNU/Linux) > Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ > > iEYEARECAAYFAkvi+w4ACgkQf9E0noFvlziutgCghRKvCoszHy+H0ONz6CnbNtSw > FL4AoKv2Jo0V1wznq4awrMVgzzaNXDuQ > =+bWt > -----END PGP SIGNATURE----- > > -- > You received this message because you are subscribed to the Google Groups > "Google Web Toolkit" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]<google-web-toolkit%[email protected]> > . > For more options, visit this group at > http://groups.google.com/group/google-web-toolkit?hl=en. > > -- You received this message because you are subscribed to the Google Groups "Google Web Toolkit" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/google-web-toolkit?hl=en.
