[docs] [issue13997] Clearly explain the bare minimum Python 3 users should know about Unicode

STINNER Victor report at bugs.python.org
Sun Feb 12 10:55:49 CET 2012


STINNER Victor <victor.stinner at haypocalc.com> added the comment:

Why do you use Unicode with the ugly surrogateescape error handler in
this case? Bytes are just fine for such usecase.

The surrogateescape error handler produces unusual characters in range
U+DC80-U+DCFF which cannot be printed to a console because sys.stdout
uses the strict error handler, and sys.stderr  uses the
backslashreplace error handler. If I remember correctly, only UTF-7
encoder allow lone surrogate characters.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue13997>
_______________________________________


More information about the docs mailing list