[Python-3000] Unicode IDs -- why NFC? Why allow ligatures?

Jim Jewett jimjjewett at gmail.com
Thu Jun 7 01:29:09 CEST 2007


On 6/6/07, Stephen J. Turnbull <turnbull at sk.tsukuba.ac.jp> wrote:
> Jim Jewett writes:

>  > Depends on what you mean by technical symbols.  ... The math
>  > versions (generally 1D400 - 1DC7B) are included.  But
>  > http://unicode.org/reports/tr39/data/xidmodifications.txt suggests
>  > excluding them again.

> Eg, the letterlike symbols (DEGREE CELSIUS),

not an ID character

> the number forms (ROMAN NUMERAL ONE),

an ID_START (a letter), not excluded even by xidmodifications
No canonical equivalent.
Will be turned into the regular ASCII letters (only) by Kompatibility
canonicalization.

> and the APL set (2336--237A) in the BMP.

not ID characters

-jJ


More information about the Python-3000 mailing list