https://issues.apache.org/bugzilla/show_bug.cgi?id=45957
Summary: UTF-8 POST request results in corrupted data
Product: Tomcat 5
Version: 5.5.27
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P2
Component: Connector:HTTP
AssignedTo: [email protected]
ReportedBy: [EMAIL PROTECTED]
Currently in Tomcat 5 if a request is received containing UTF-8 content then
any accents or non-Roman characters are corrupted, since there is an assumption
the POST request is ISO-8895-1 (latin1). For example 'é' becomes 'é'. A
work around is specified on the following page:
http://wiki.apache.org/tomcat/Tomcat/UTF-8
in the form of a Filter class, called 'CharsetFilter'. IMHO, this is not the
most ideal solution for a site that is wanting to standardise on UTF-8.
It should be noted that in the same document, referenced above, there is an
option to specify the encoding of the URI. It would be handy to have something
equivalent for the POST content, maybe something like "POSTencoding",
defaulting to ISO-8859-1 as is used currently, and then providing the necessary
code to use UTF-8 instead.
This issue is valid for both Tomcat 5 and 6.
additional keywords: internationalisation, internationalization, i18n
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]