Re: [Email-SIG] fixing the current email module

Glenn Linderman Thu, 08 Oct 2009 15:39:38 -0700

On approximately 10/8/2009 6:00 AM, came the following characters fromthe keyboard of Barry Warsaw:

On Oct 8, 2009, at 3:29 AM, Glenn Linderman wrote:
And I agree that APIs to retrieve any MIME part as undecoded bytes isappropriate; and to retrieve it as decoded strings is appropriate fortext MIME parts. Not sure that non-text MIME parts need to supportbeing returned as strings.
I hate to open another can of worms, but I've been thinking about thisa lot too :). It's been discussed on list before, so nothing newhere. I think the parser and MIME classes need to be hookable fordecoding their contents. For example, if you have a text/* it mightwell make sense to support bytes() and str()/unicode() on the partinstance. But if it's image/* str() makes no sense. part.decode() orsomething similar makes sense, but this needs to be extensible becausethe email package will not know how to convert every content-type. Atbest it will only know how to decode content-types that Python'sstdlib knows about.


Seems like the following should be obtainable from a MIME parts:

1) wire format. Either what came in, in the parser case, or what wouldbe generated.

2) internal headers from the MIME part

3) decoded BLOB. This means that quopri and base64 are decoded, no moreand no less. This is bytes. No headers, only payload. ForContent-Transfer-Encoding: binary, this is mostly a noop.4) text/* parts should also be obtainable as str()/unicode(), payloadonly. This is where charset decoding is done.

I think your talk in the next paragraph about hooks and other objecttypes being produced is a generalization of 4, not 3, and generally noadditional decoding needs to be done, just conversion to the rightobject type (or file, or file-like object).

The problem is that if the bytes came off the wire, the parsercurrently can only attach the most basic MIME base class. It doesn'tknow that an image/png should create a MIMEImagePNG instance there.This is different from hacking the model directly because theapplication can instantiate the right class. So the parser either hasto have a hookable way for an application to go from content-type toclass, or the generic MIME base class needs to be hookable in its.decode() method.

So either the email package can stop at 3, and 4 only for text/* parts,or it could learn more types (registered types, with well-definedcorresponding objects could be potentially built-in to the emailpackage), and/or it could become hookable for application types. Ofcourse, for disposition to files, storing the BLOB in a file of theright name is adequate... to avoid the file, I agree that converting toa useful object type is handy. But maybe file-like objects wouldsuffice, for most of the types.


--
Glenn -- http://nevcal.com/
===========================
A protocol is complete when there is nothing left to remove.
-- Stuart Cheshire, Apple Computer, regarding Zero Configuration Networking

_______________________________________________
Email-SIG mailing list
[email protected]
Your options: 
http://mail.python.org/mailman/options/email-sig/archive%40mail-archive.com

Re: [Email-SIG] fixing the current email module

Reply via email to