[New-bugs-announce] [issue33108] Unicode char 304 in lowercase has len 2

Kiril Dimitrov report at bugs.python.org
Tue Mar 20 09:22:06 EDT 2018


New submission from Kiril Dimitrov <kiril.dimitroff at gmail.com>:

>>> chr(304)
'İ'
>>> chr(304).lower()
'i̇'
>>> len(chr(304).lower())
2

This breaks unicode text matching. There is no other unicode character with the same behaviour (in 3.6.2 and 3.6.4).

----------
components: Unicode
messages: 314142
nosy: Kiril Dimitrov, ezio.melotti, vstinner
priority: normal
severity: normal
status: open
title: Unicode char 304 in lowercase has len 2
type: behavior
versions: Python 3.6

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue33108>
_______________________________________


More information about the New-bugs-announce mailing list