[Python-3000] Unicode strings, identifiers, and import
Jason Orendorff
jason.orendorff at gmail.com
Mon May 14 17:42:24 CEST 2007
On 5/14/07, Guido van Rossum <guido at python.org> wrote:
> Isn't normalization also going to be an issue with using non-ASCII in
> general? Does it mean that Python will have to use a normalization
> before comparing identifiers as equal? That's terrible, as it will
> vastly increase the amount needed to hash a string, too.
PEP 3131 addresses this. The tokenizer would normalize identifier
tokens to NFC. Because this happens so early, the rest of Python
would be unaffected.
-j
More information about the Python-3000
mailing list