Related to the second part of Werner's message being scrubbed with the
message:

    An embedded and charset-unspecified text was scrubbed...

Poking in the email package (on python 2.4.4) shows:

    def get_content_charset(self, failobj=None):
        """Return the charset parameter of the Content-Type header.

        The returned string is always coerced to lower case.  If there is no
        Content-Type header, or if that header has no charset parameter,
        failobj is returned.
        """

This seems to violate section 5.2 of RFC 2045 which says parts lacking
a Content-type header should be assumed to be text/plain with a
charset of us-ascii.  The get_content_type method in email.Message
does mention RFC 2045 and uses text/plain if the content-type is
invalid.  Would it be appropriate to set failobj="us-ascii" when
calling this method in Scrubber.py?

BTW, if it's easier for you guys to track this in bugzilla, let me
know and I'll file on one SF.

-- 
Todd        OpenPGP -> KeyID: 0xBEAF0CE3 | URL: www.pobox.com/~tmz/pgp
======================================================================
The marvel of all history is the patience with which men and women
submit to burdens unnecessarily laid upon them by their governments.
    -- William H. Borah

Attachment: pgpP97opAbw6b.pgp
Description: PGP signature

------------------------------------------------------
Mailman-Users mailing list
[email protected]
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp

Reply via email to