Incorrect title case?
MRAB
google at mrabarnett.plus.com
Fri Jan 16 17:07:03 EST 2009
Python 2.6.1
I've just found that the following 4 Unicode characters/codepoints don't
behave as I'd expect: Dž (U+01C5), Lj (U+01C8), Nj (U+01CB), Dz (U+01F2).
For example, u"\u01C5".istitle() returns True and
unicodedata.category(u"\u01C5") returns "Lt", but u"\u01C5".title()
returns u'\u01C4', which is the uppercase equivalent. Are these mistakes
in the Unicode database?
More information about the Python-list
mailing list