Holger Sunke created FILEUPLOAD-206:
---------------------------------------

             Summary: Encoding of parts incorrectly determined and assumed to be
                 Key: FILEUPLOAD-206
                 URL: https://issues.apache.org/jira/browse/FILEUPLOAD-206
             Project: Commons FileUpload
          Issue Type: Bug
            Reporter: Holger Sunke
             Fix For: 1.2.2


When submitting user input form fields with a form having 
enctype="multipart/form-data", FileUpload assumes "ISO-8...-1" to be the 
default encoding for each part, not taking that request.getContentEncoding() 
returns "UTF-8".

The description of FILEUPLOAD-101 states: "Each individual part in the 
'multipart/form-data' encoded requests may have its own content type 
definition."

In my opinion, this assumption is wrong, as the the multipart/form-data 
encoding algorithm tells the right opposite regarding form fields not beeing 
files:

"... The parts of the generated multipart/form-data resource that correspond to 
non-file fields must not have a Content-Type header specified. Their names and 
values must be encoded using the character encoding selected above (field names 
in particular do not get converted to a 7-bit safe encoding as suggested in RFC 
2388)..."
See: 
http://www.whatwg.org/specs/web-apps/current-work/multipage/association-of-controls-and-forms.html#multipart-form-data

So for now as user of FileUpload, i don't see a way to tell FileUpload that my 
FormFields are UTF-8 encoded, resulting in corrupted user input.
Even request.setContentEncoding("UTF-8"); does not help, neither <form ... 
accept-charset="UTF-8" ...> does.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to