New submission from Christian Tanzer:
For an email message with `Content-type: text/plain; charset=utf-8`, in Python
3.5, get_payload returns a bytes object encoded with `latin-1`. Python 2.7
returns a str object encoded with `utf-8` as expected.
Running the attached test script `email_get_payload__test.py` with Python 2.7
and 3.5 shows the difference.
Python 2.7::
2.7.10.final.0 *** utf8 ***
From: Christian Tanzer <[email protected]>
To: Christian Tanzer <[email protected]>
Content-type: text/plain; charset=utf-8
Sehr geehrte Damen und Herren,
...
Danke und mit freundlichen GrĂ¼ssen,
--
Christian Tanzer http://www.c-tanzer.at/
Python 3.5::
3.5.0.final.0 *** latin-1 ***
From: Christian Tanzer <[email protected]>
To: Christian Tanzer <[email protected]>
Content-type: text/plain; charset=utf-8
Sehr geehrte Damen und Herren,
...
Danke und mit freundlichen GrĂ¼ssen,
--
Christian Tanzer http://www.c-tanzer.at/
In both Python versions, `msg.get_content_charset()` returns None, which is not
correct, either.
----------
components: Library (Lib)
files: email_get_payload__test.py
messages: 253994
nosy: [email protected]
priority: normal
severity: normal
status: open
title: email.message.get_payload returns wrong encoding
type: behavior
versions: Python 3.5
Added file: http://bugs.python.org/file40934/email_get_payload__test.py
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue25545>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com