Re: [Python-Dev] [Python-checkins] r87433 - python/branches/py3k/Doc/reference/lexical_analysis.rst

On 12/21/2010 8:37 PM, alexander.belopolsky wrote:
Author: alexander.belopolsky Date: Wed Dec 22 02:37:36 2010 New Revision: 87433
Log: Both PEP 3131 and the current implementation use NFKC normalization for identifiers. Fixed the documentation to agree.
Modified: python/branches/py3k/Doc/reference/lexical_analysis.rst
Modified: python/branches/py3k/Doc/reference/lexical_analysis.rst
--- python/branches/py3k/Doc/reference/lexical_analysis.rst (original) +++ python/branches/py3k/Doc/reference/lexical_analysis.rst Wed Dec 22 02:37:36 2010 @@ -309,8 +309,8 @@
- *Nd* - decimal numbers
- *Pc* - connector punctuations
-All identifiers are converted into the normal form NFC while parsing; comparison -of identifiers is based on NFC. +All identifiers are converted into the normal form NFKC while parsing; comparison +of identifiers is based on NFKC.
A non-normative HTML file listing all valid identifier characters for Unicode 4.1 can be found at
Has that file been updated for Unicode 6.0?

On Tue, Dec 21, 2010 at 9:16 PM, Terry Reedy tjreedy@udel.edu wrote: ..
A non-normative HTML file listing all valid identifier characters for Unicode 4.1 can be found at http://www.dcl.hpi.uni-potsdam.de/home/loewis/table-3131.html
Has that file been updated for Unicode 6.0?
Apparently not. In Python 3.2:
'\N{KANNADA SIGN JIHVAMULIYA}'.isidentifier()
True $ curl -s http://www.dcl.hpi.uni-potsdam.de/home/loewis/table-3131.html%7C grep JIHVAMULIYA $
participants (2)
-
Alexander Belopolsky
-
Terry Reedy