[Python-3000] Unicode IDs -- why NFC? Why allow ligatures?

Hagen Fürstenau hfuerstenau at gmx.net
Wed Jun 6 08:01:04 CEST 2007


Stephen J. Turnbull writes:

>  > http://www.unicode.org/versions/corrigendum3.html suggests that many
>  > of the Hangul are either pronunciation guide variants or even exact
>  > duplicates (that were presumably missed when the canonicalization was
>  > frozen?)
> 
> I'll have to ask some Koreans what they would use.

The Windows Korean Input Method chooses between Unified Han and
Compatibility characters based on the reading you use to enter them. So
I guess most Koreans won't be aware of what variant they're using at any
given moment. Seems to me that NFKC would be essential here.




More information about the Python-3000 mailing list