[docs] [issue36789] Unicode HOWTO incorrectly states that UTF-8 contains no zero bytes

mbiggs report at bugs.python.org
Sat May 4 21:27:28 EDT 2019


mbiggs <pythonbugs at doubleplum.net> added the comment:

So a correct statement would be "A UTF-8 string is turned into a sequence of bytes that contains embedded zero bytes only where they represent the NULL character (U+0000)."

I think it's important to correct this because the part about processing UTF-8 with C functions like strcpy(), was wrong and could cause bugs.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue36789>
_______________________________________


More information about the docs mailing list