On Thu, 2009-04-16 at 00:12 -0700, Graham Dumpleton wrote:
> > So, from where I sit, we have:
> >
> >  1. Many header values which are ASCII.
> >  2. A few header values which are ISO-8859-1 plus RFC 2047.
> >  3. A few header values which are URI's (no specified encoding) or
> IRI's
> > (UTF-8).
> >
> > I understand the desire to decode ASAP, and I agree with Guido that
> we
> > should use a default encoding which the app can override. Looking at
> the
> > above, ISO-8859-1 is the best encoding I know of for all three
> header
> > cases. ASCII can be used as a valid subset without transcoding;
> headers
> > which are ISO-8859-1 are decoded perfectly; URI/IRI headers can be
> > transcoded by the app if needed, but mangled opaquely by middleware.
> >
> > If we make *that* call, then IMO there's no reason not to do the
> > same to SCRIPT_NAME, PATH_INFO, and QUERY_STRING.
> 
> I am not sure we ended up with a final answer on all of this, but I
> don't want to hold up mod_wsgi 3.0, which includes Python 3.0 support,
> any longer. As such, am implementing things as per:
> 
>   http://www.wsgi.org/wsgi/Amendments_1.0
> 
> with exception that will not be attempting to do decoding per RFC
> 2047. Any CGI variables not related to HTTP headers will also be
> handled as latin-1, including SCRIPT_NAME, PATH_INFO and QUERY_STRING.
> This should be equivalent with what wsgiref does in Python 3.X and
> basically keeps the status quo.
> 
> If anyone has any last things to say on all of this, please speak up
> now.
> 
That sounds fine to me, Graham, and is what I'll be implementing in my
python3 branch for CherryPy barring any unforeseen impediments.


Robert Brewer
fuman...@aminus.org

_______________________________________________
Web-SIG mailing list
Web-SIG@python.org
Web SIG: http://www.python.org/sigs/web-sig
Unsubscribe: 
http://mail.python.org/mailman/options/web-sig/archive%40mail-archive.com

Reply via email to