[Python-3000] PEP: Supporting Non-ASCII Identifiers

Rauli Ruohonen rauli.ruohonen at gmail.com
Sun Jun 3 15:12:20 CEST 2007


(sorry about replying to so old mail, but I didn't find a better place
to put this)

On 5/1/07, "Martin v. Löwis" <martin at v.loewis.de> wrote:
> All identifiers are converted into the normal form NFC while parsing;

Actually, shouldn't the whole file be converted to NFC, instead of
only identifiers? If you have decomposable characters in strings and
your editor decides to normalize them to a different form than in the
original source, the meaning of the code will change when you save
without you noticing anything.

It's always better to be explicit when you want to make invisible
distinctions. In the rare cases anything but NFC is really needed you
can do explicit conversion or use escapes. Having to add normalization
calls around all unicode strings to code defensively is neither
convenient nor obvious.


More information about the Python-3000 mailing list