[docs] [issue36789] Unicode HOWTO incorrectly states that UTF-8 contains no zero bytes
report at bugs.python.org
Sat May 4 21:27:28 EDT 2019
mbiggs <pythonbugs at doubleplum.net> added the comment:
So a correct statement would be "A UTF-8 string is turned into a sequence of bytes that contains embedded zero bytes only where they represent the NULL character (U+0000)."
I think it's important to correct this because the part about processing UTF-8 with C functions like strcpy(), was wrong and could cause bugs.
Python tracker <report at bugs.python.org>
More information about the docs