[New-bugs-announce] [issue36671] str.lower() looses character information when working with UTF-8

Kadam Parikh report at bugs.python.org
Sat Apr 20 03:02:43 EDT 2019


New submission from Kadam Parikh <parikhkadam at gmail.com>:

When converting a particular UTF-8 character "İ" to lowercase, it doesn't behave correctly. It returns two lowercase characters instead of one. This is not as desired.

Code:

>>> print("\u0130")
İ
>>> print("\u0130".lower())
i̇
>>>

----------
components: Unicode
messages: 340563
nosy: Kadam Parikh, ezio.melotti, vstinner
priority: normal
severity: normal
status: open
title: str.lower() looses character information when working with UTF-8
type: behavior
versions: Python 3.6

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue36671>
_______________________________________


More information about the New-bugs-announce mailing list