[issue11461] Reading UTF-16 with codecs.readline() breaks on surrogate pairs

Amaury Forgeot d'Arc report at bugs.python.org
Thu Mar 10 13:19:31 CET 2011


Amaury Forgeot d'Arc <amauryfa at gmail.com> added the comment:

The utf16 incremental codec does not like incomplete surrogate pairs.
Patch attached.
I also plan to refactor all the test_partial() functions of test_codecs, to give them a common implementation.

----------
keywords: +patch
nosy: +amaury.forgeotdarc
Added file: http://bugs.python.org/file21071/partial_utf16.patch

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue11461>
_______________________________________


More information about the Python-bugs-list mailing list