Latin-1 is the encoding that maps every byte (0-255) to the Unicode character with the same number. So it's special in that sense, and it gets used when mapping 8-bit bytes via Unicode "without encoding". Excuse my imprecise language here, I don't know the correct Unicode terms without going & looking them up. Paul On Thu, 2 Jul 2020 at 13:48, Barry Scott <barry@barrys-emacs.org> wrote:
On 29 Jun 2020, at 10:57, Victor Stinner <vstinner@python.org> wrote:
I would prefer to only have a fast-path for the most common encodings: ASCII, Latin1, UTF-8, Windows ANSI code page. That's all.
It's not obvious to me why the latin1 encoding is in this list as its just one of all the 8-bit char sets. Why is it needed?
Barry
_______________________________________________ Python-Dev mailing list -- python-dev@python.org To unsubscribe send an email to python-dev-leave@python.org https://mail.python.org/mailman3/lists/python-dev.python.org/ Message archived at https://mail.python.org/archives/list/python-dev@python.org/message/XGJ5NG4W... Code of Conduct: http://python.org/psf/codeofconduct/