[New-bugs-announce] [issue7045] utf-8 encoding error

Arc Riley report at bugs.python.org
Sat Oct 3 05:46:53 CEST 2009


New submission from Arc Riley <arcriley at gmail.com>:

The following is a minimal example which does not work under Python
3.1.1 but functions as expected on Pyhton 2.6 and 3.0.

Python 3.1.1 believes the single UTF-8 glyph is two entirely different
(and illegal) unicode characters:

Traceback (most recent call last):
  File "t.py", line 2, in <module>
    print('𐑛')
UnicodeEncodeError: 'utf-8' codec can't encode character '\ud801' in
position 0: surrogates not allowed

Test system is Ubuntu 9.10-beta 32-bit

----------
components: Interpreter Core
files: t.py
messages: 93475
nosy: ArcRiley
severity: normal
status: open
title: utf-8 encoding error
type: behavior
versions: Python 3.1
Added file: http://bugs.python.org/file15028/t.py

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue7045>
_______________________________________


More information about the New-bugs-announce mailing list