[Python-Dev] unicodedata module is out of date

Andrew Miller A.J.Miller at bcs.org.uk
Fri Sep 6 11:54:45 CEST 2013


The unicodedata module only contains data up to Unicode 5.2 (October 2009),
so attempting to reference any character from a later version e.g:

unicodedata.lookup("TURKISH LIRA SIGN")

results in a KeyError.

Also, it seems to be limited to properties in the UnicodeData.txt file and
does not contain any data from the other files from the Unicode Character
Database (the perl library Unicode::UCD is far more complete).

Are there any plans to update this module to the latest Unicode version
(6.2, with 6.3 being released shortly), or is there another module that
provides more up to date information?

Thanks,

Andrew
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-dev/attachments/20130906/48935204/attachment.html>


More information about the Python-Dev mailing list