We are getting following error: java.io.CharConversionException: Not an ISO 8859-1 character: <EF><BF><83>. It is not just <83>. Sorry I missed those last time.
We are working with java6. If I use tomcat 5.5.23 it is working... But we would like to use the tomcat 6. awarnier wrote: > > pramodpm wrote: >> We are facing an encoding issue in apache-tomcat-6.0.20. This is working >> in >> tomcat 5.5.23. We are trying to make a get request to external site. >> The >> page contains some utf-8 characters. > > No. The page probably contains Unicode characters, all encoded in the > UTF-8 encoding. What you probably mean is that some of these characters > have a Unicode codepoint above 127 decimal, and are thus represented by > 2 or more bytes in UTF-8. > > When we access the page from the >> application we are getting the following error. >> Can you please help us to resolve this issue. Any help is appreciated. >> > From your log below, it does not look like you have problems when > accessing the external page. Reading the page is fine, and the content > of the page is being properly translated, from its original UTF-8 > encoding, into a Unicode string in Java (in your servlet). > > However, what happens next is that your servelt is trying to output this > string to the servlet output stream, which is specified as having the > ISO-8859-1 charset/encoding. And at least one of these internal Unicode > characters does not have a valid representation in ISO-8859-1. So Java > complains at the moment you are trying to write out this character, > because it cannot translate it from the internal Unicode, to the > external desired ISO-8859-1 (because that particular character does not > exist in ISO-8859-1 (which contains only the 256 characters that are > part of the latin-1 set, which covers only some Western European > languages). > > Now having written all that, I am still a bit uneasy, if the <83> below > represents the hexadecimal Unicode codepoint of this character. Because > 0083 is a character known as "NBH", which looks like some kind of > control character. So where would that one come from, in a html page ? > >> >> >> >> WARNING: Handler caused Not an ISO 8859-1 character: <83> >> >> >> java.io.CharConversionException: Not an ISO 8859-1 character: <83> >> >> at javax.servlet.ServletOutputStream.print(ServletOutputStream.java:89) >> >> at >> >> at java.io.OutputStream.write(OutputStream.java:99) >> >> > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org > For additional commands, e-mail: users-h...@tomcat.apache.org > > > -- View this message in context: http://old.nabble.com/Tomcat-6-encoding-issue-tp26313150p26315285.html Sent from the Tomcat - User mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org For additional commands, e-mail: users-h...@tomcat.apache.org