[pypy-issue] Issue #2857: str.casefold is _still_ wrong for some codepoints (Cherokee) (pypy/pypy)

byllyfish issues-reply at bitbucket.org
Wed Jul 11 21:40:57 EDT 2018


New issue 2857: str.casefold is _still_ wrong for some codepoints (Cherokee)
https://bitbucket.org/pypy/pypy/issues/2857/strcasefold-is-_still_-wrong-for-some

byllyfish:

Under python, '\u13a0' correctly casefolds to '\u13a0'.

Under pypy3 (5.10.1), '\u13a0' casefolds to '\uab70'.

```
Python 3.5.3 (3f6eaa010fce, Jan 11 2018, 05:27:47)
[PyPy 5.10.1 with GCC 4.2.1 Compatible Apple LLVM 9.0.0 (clang-900.0.39.2)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>>> '\u13a0'.casefold()
'\uab70'
```

Since '\uab70' and '\u13a0' casefold to each other, neither is stable under case folding.

Related to fixed issue #2544.




More information about the pypy-issue mailing list