[Python-3000] Unicode IDs -- why NFC? Why allow ligatures?
Jim Jewett
jimjjewett at gmail.com
Thu Jun 7 01:29:09 CEST 2007
On 6/6/07, Stephen J. Turnbull <turnbull at sk.tsukuba.ac.jp> wrote:
> Jim Jewett writes:
> > Depends on what you mean by technical symbols. ... The math
> > versions (generally 1D400 - 1DC7B) are included. But
> > http://unicode.org/reports/tr39/data/xidmodifications.txt suggests
> > excluding them again.
> Eg, the letterlike symbols (DEGREE CELSIUS),
not an ID character
> the number forms (ROMAN NUMERAL ONE),
an ID_START (a letter), not excluded even by xidmodifications
No canonical equivalent.
Will be turned into the regular ASCII letters (only) by Kompatibility
canonicalization.
> and the APL set (2336--237A) in the BMP.
not ID characters
-jJ
More information about the Python-3000
mailing list