[New-bugs-announce] [issue33108] Unicode char 304 in lowercase has len 2
Kiril Dimitrov
report at bugs.python.org
Tue Mar 20 09:22:06 EDT 2018
New submission from Kiril Dimitrov <kiril.dimitroff at gmail.com>:
>>> chr(304)
'İ'
>>> chr(304).lower()
'i̇'
>>> len(chr(304).lower())
2
This breaks unicode text matching. There is no other unicode character with the same behaviour (in 3.6.2 and 3.6.4).
----------
components: Unicode
messages: 314142
nosy: Kiril Dimitrov, ezio.melotti, vstinner
priority: normal
severity: normal
status: open
title: Unicode char 304 in lowercase has len 2
type: behavior
versions: Python 3.6
_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue33108>
_______________________________________
More information about the New-bugs-announce
mailing list