Glyphs and graphemes [was Re: Cult-like behaviour]

Marko Rauhamaa marko at
Tue Jul 17 01:26:45 EDT 2018

Steven D'Aprano <steve+comp.lang.python at>:
> On Mon, 16 Jul 2018 22:51:32 +0300, Marko Rauhamaa wrote:
>> UTF-8 bytes can only represent the first 128 code points of Unicode.
> This is DailyWTF material. Perhaps you want to rethink your wording
> and maybe even learn a bit more about Unicode and the UTF encodings
> before making such statements.
> The idea that UTF-8 bytes cannot represent the whole of Unicode is not
> even wrong. Of course a *single* byte cannot, but a single byte is not
> "UTF-8 bytes".

So I hope that by now you have understood my point and been able to
decide if you agree with it or not.


More information about the Python-list mailing list