[issue26917] unicodedata.normalize(): bug in Hangul Composition

Armin Rigo report at bugs.python.org
Tue May 3 06:29:22 EDT 2016


Armin Rigo added the comment:

See also https://bitbucket.org/pypy/pypy/issues/2289/incorrect-unicode-normalization .  It seems that you reached the same conclusion than the OP in that issue: the problem would really be that normalizing "\uafb8\u11a7" should not drop the second character.  Both Python and PyPy do that, but Python adds the "is_normalized()" check, so in some cases it returns the correct unmodified result.

----------

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue26917>
_______________________________________


More information about the Python-bugs-list mailing list