> Hi, > > while it is nice and pretty easy to support multi-lingual html pages by > specifying a content type "text/html; charset=utf-8", I have serious > problems detecting the incoming character encoding for FORM submission. > > Let's say you have a unicode page with one FORM on it that contains a > hidden input field that contains truly non-ISO-8859-1 characters. Now on > submission of the form, sometime tomcat (e.g.) says it is utf-8, sometimes > it is ISO-8859-1 (by calling getCharacterEncoding() on ServletRequest). > > I noticed that there seems to be no HTTP based information on the > character encoding of a form submission (at least IE doesn't send any). > Therefore I assume that Tomcat is doing some guess work here. > > In Servlet API 2.3, there is a setCharacterEncoding() method on the > request, so that you can force the web container to use a specific > encoding. Unfortunately, most commercial Web Containers (as well as tomcat > 3.3) do not implement Servlet API 2.3, so that that method is not > available. Also, I assume that you will run into serious trouble, if you > force the web container to use utf-8 but there are non-utf-8 characters > coming in... > > > Does anybody know of a good handling for these problems. Any help would be > appreciated, also I think it is a quite interesting problem... > > Regards, > Henning
___________________________________________________________________________ To unsubscribe, send email to [EMAIL PROTECTED] and include in the body of the message "signoff SERVLET-INTEREST". Archives: http://archives.java.sun.com/archives/servlet-interest.html Resources: http://java.sun.com/products/servlet/external-resources.html LISTSERV Help: http://www.lsoft.com/manuals/user/user.html
