This is an excellent enumeration of some of the concerns!
One minor comment about the introductory material:
On Mon, Nov 1, 2021 at 5:21 AM Petr Viktorin
Introduction ============
Python code is written in `Unicode`_ – a system for encoding and handling all kinds of written language.
Unicode specifies the mapping of glyphs to code points. Then a second mapping from code points to sequences of bytes is what is actually recorded by the computer. The second mapping is what programmers using Python will commonly think of as the encoding while the majority of what you're writing about has more to do with the first mapping. I'd try to word this in a way that doesn't lead a reader to conflate those two mappings. Maybe something like this? `Unicode`_ is a system for handling all kinds of written language. It aims to allow any character from any human natural language (as well as a few characters which are not from natural languages) to be used. Python code may consist of almost all valid Unicode characters.
While this allows programmers from all around the world to express themselves, it also allows writing code that is potentially confusing to readers.
-Toshio