[FileUpload] various issues - encoding, RFC compliance, and tweaks

A. Rothman Thu, 18 May 2006 04:23:06 -0700

Hi!



I'm considering moving to FileUpload for uploaded files handling.

I've gone over the code, and found various issues (RFC compliance orjust little implementation tweaks) I figured I'd mention here beforeopening bugs:

1. according to the RFC (1522), non-ASCII headers use word encoding(=?...?= syntax). I didn't find this implemented in FileUpload(MultipartStream ?)2. FileUploadBase.parseHeaders() does not handle header folding (also inRFCs).3. FileUploadBase.parseHeaders() calls header.indexOf(':') 3 times, itcan call it once and save the value (each call iterates over the stringcharacters again).4. where does the 1024 byte max header size limit come from (RFCs orjust reasonable value)?5. content encoding is not respected as defined in RFC - if a requestencoding (charset) is specified, it should be used in parsing all formvalues. Currently each FileItem value must be retrieved with theexplicit encoding (which is taken from the request). I've seen thisreported also within other apache projects(http://myfaces.apache.org/tomahawk/xref/org/apache/myfaces/webapp/filter/MultipartRequestWrapper.html- the comment stands out).

6. further, the charset does seems to be used in parsing the headers -isn't this non-RFC behavior? from what I understand, anything that'snon-ASCII within the headers themselves should be word-encoded (seeissue #1), and the content-type charset should be used on the content,not the headers...

7. MultipartStream.readHeaders() - uses a one-byte array instead ofsingle byte, for no apparent reason.

Please let me know which should have bugs opened for, and/or point outwhat I've misunderstood :-)


Thanks,

Amichai


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[FileUpload] various issues - encoding, RFC compliance, and tweaks

Reply via email to