[docs] [issue36789] Unicode HOWTO incorrectly states that UTF-8 contains no zero bytes

Andrew Svetlov report at bugs.python.org
Fri May 3 20:06:45 EDT 2019


Andrew Svetlov <andrew.svetlov at gmail.com> added the comment:

This is right for 99.99% cases: utf8 doesn't encode any character except explicit zero with zero bytes.

UTF-16 for example encodes 'a' as b'\xff\xfea\x00'

----------
nosy: +asvetlov

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue36789>
_______________________________________


More information about the docs mailing list