[New-bugs-announce] [issue16684] Unicode property value abbreviated names and long names
report at bugs.python.org
Fri Dec 14 18:33:12 CET 2012
New submission from Pander:
The package unicodedata
offers looking up of property values in terms of general category, bidirectional class and east asian width for Unicode characters
The abbreviated name of the specific category is returned. However, for certain applications it is important to be able to get the from abbreviated name to the long name and vice versa.
The data needed to do this can be found at
# General_Category (gc)
# Bidi_Class (bc)
# East_Asian_Width (ea)
Use only the second (abbreviated name) and third (long name) fields and ignoring other fields and possible comments.
For general category, also support translation back and forth of the one-letter abbreviations which are groups representing two-letter general categories abbreviations with the same initial letter.
Please extend this package with a way of translating back and forth between abbreviated name and long name for property values defined in Unicode for general category, bidirectional class and East Asian width. This functionality should be independent of retrieving the abbreviated names for Unicode character as is available now and should be accessible via separate methods or dictionaries in which developers can perform lookups themselves.
Implementing the functionality requested in this issue allows Python developers to get from an abbreviated property value to a meaningful property value name and vice versa without having to retrieve this information from the Unicode Consortium and/or shipping this information with their code with the risk of using outdated information.
nosy: PanderMusubi, ezio.melotti
title: Unicode property value abbreviated names and long names
versions: Python 3.5
Python tracker <report at bugs.python.org>
More information about the New-bugs-announce