[issue10686] email.Generator should use unknown-8bit encoded words for headers with 8 bit data
R. David Murray
report at bugs.python.org
Sun Dec 12 19:01:49 CET 2010
New submission from R. David Murray <rdmurray at bitdance.com>:
This is a followon to Issue 4661. The fix for that issue introduced a way to parse messages containing 8bit bytes. When Generator is called on a model containing 8 bit bytes, it converts it to 7bit clean. There is, however, a bug in this conversion process: currently when encountering 8bit bytes in headers, it simply replaces then with ?. According to the RFCs[*], what it should do instead is to replace them with encoded words using the 'charset' "unknown-8bit".
[*] I'm specifically referring to RFC 1428...email is effectively acting as a translating gateway when requested to do the 8bit to 7bit conversion. Although that RFC does not explicitly say that the unknown-8bit charset should be used in encoded words, it does imply it strongly in its section 3 prescription.
stage: needs patch
title: email.Generator should use unknown-8bit encoded words for headers with 8 bit data
versions: Python 3.2
Python tracker <report at bugs.python.org>
More information about the Python-bugs-list