Re: [Python-Dev] email package status in 3.X

P.J. Eby Mon, 21 Jun 2010 07:55:35 -0700

At 10:20 PM 6/21/2010 +1000, Nick Coghlan wrote:

For the idea of avoiding excess copying of bytes through multiple
encoding/decoding calls... isn't that meant to be handled at an
architectural level (i.e. decode once on the way in, encode once on
the way out)? Optimising the single-byte codec case by minimising data
copying (possibly through creative use of PEP 3118) may be something
that we want to look at eventually, but it strikes me as something of
a premature optimisation at this point in time (i.e. the old adage
"first get it working, then get it working fast").

The issue is, I'd like to have an idempotent incantation that I canuse to make the inputs and outputs to stdlib functions behave in atype-safe manner with respect to bytes, in cases where bytes arereally what I want operated on.

Note too that this is an argument for symmetry in wrapping the inputsand outputs, so that the code doesn't have to "know" what it's dealing with!

After all, right now, if a stdlib function might return bytes orunicode depending on runtime conditions, I can't even hardcode an.encode() call -- it would fail if the return type is a bytes.

This basically goes against the "tell, don't ask" pattern, and thePythonically idempotent approach. That is, Python builtins normallyreturn you back the same thing if it's already what you want -int(someInt)-> someInt, iter(someIter)->someIter, etc.

Since this incantation may need to be used often, and in places thatare not known to me in advance, I would like it to not impose newoverhead in unexpected places. (i.e., the usual argument broughtagainst making changes to the 'list' type that would change certainoperations from O(1) to O(log something)).

It's more about predictability, and having One *Obvious* Way To DoIt, as opposed to "several ways, which you need to think carefullyabout and restructure your entire architecture around ifnecessary". One obvious way means I can focus on the mechanicaleffort of porting *first*, without having to think.

So, the performance issue isn't really about performance *per se*, somuch as about the "mental UI" of the language. You could just aseasily lie and tell me that your bstr implementation is O(1), and Iwould probably be happy and never notice, because the issue was neverreally about performance as such, but about having to *think* aboutit. (i.e., breaking flow.)

Really, the entire issue can presumably be dealt with by some seriesof incantations - it's just code after all. But having to sit andthink about *every* situation where I'm dealing with bytes/unicodedistinctions seems like a torture compared to being able to say,"okay, so when dealing with this sort of API and this sort of data,this is the One Obvious Way to do the conversions."

It's One Obvious Way that I want, but some people seem to be arguingthat the One Obvious Way is to Think Carefully About It Every Time --and that seems to violate the "Obvious" part, IMO. ;-)


_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] email package status in 3.X

Reply via email to