[docs] [issue36789] Unicode HOWTO incorrectly states that UTF-8 contains no zero bytes

Andrew Svetlov report at bugs.python.org
Fri May 3 20:06:45 EDT 2019

Andrew Svetlov <andrew.svetlov at gmail.com> added the comment:

This is right for 99.99% cases: utf8 doesn't encode any character except explicit zero with zero bytes.

UTF-16 for example encodes 'a' as b'\xff\xfea\x00'

nosy: +asvetlov

Python tracker <report at bugs.python.org>

More information about the docs mailing list