[issue2541] Unicode escape sequences not parsed in raw strings.

Marc-Andre Lemburg report at bugs.python.org
Tue Apr 8 22:03:29 CEST 2008


Marc-Andre Lemburg <mal at egenix.com> added the comment:

You can't change the codec - it's being used in other places as well,
e.g. for use cases where you need to have an 8-bit encoded readable
version of a Unicode object (which happens to be Latin-1 + Unicode
escapes for all non-Latin-1 characters, due to Unicode being a superset
of Latin-1).

Adding a new codec would be fine, though I don't know how this would map
raw Unicode strings with non-Latin-1 characters in them to an 8-bit
string. Perhaps this is not needed at all in Py3k.

__________________________________
Tracker <report at bugs.python.org>
<http://bugs.python.org/issue2541>
__________________________________


More information about the Python-bugs-list mailing list