[issue3811] Update Unicode database to 5.1.0
Martin v. Löwis
report at bugs.python.org
Thu Sep 11 08:05:23 CEST 2008
Martin v. Löwis <martin at v.loewis.de> added the comment:
> The control characters?
Indeed, also the private-use characters. test_unicode explicitly
comments that the test is about unassigned characters, although
I don't understand the purpose of that test (it then also tests
a surrogate character, which is also guaranteed to remain
One of the characters that is guaranteed to remain unassigned is
U+FFFE (and its mirrors in other planes, e.g. U+1FFFE, ...).
This guarantee is made to support the BOM. Along with U+FFFF,
these are non-characters. #765036 once suggested that Python should
refuse to represent them at all, but that proposal was rejected.
Python tracker <report at bugs.python.org>
More information about the Python-bugs-list