Very strange unicode behaviour

Syver Enstad syver at inout.no
Fri Jan 16 11:16:21 EST 2004


Here's the interactive session

Python 2.2.2 (#37, Oct 14 2002, 17:02:34) [MSC 32 bit (Intel)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> ord('\xe5')
229
>>> '\xe5'.find(u'')
-1
>>> 'p\xe5'.find(u'')
UnicodeError: ASCII decoding error: ordinal not in range(128)
>>> 'p\xe4'.find(u'')
-1
>>> 'p\xe5'.find(u'')
UnicodeError: ASCII decoding error: ordinal not in range(128)
>>> print '\xe5'
Õ
>>> print 'p\xe5'
pÕ
>>> 'p\xe5'
'p\xe5'
>>> def func():
...     try:
...         '\xe5'.find(u'')
...     except UnicodeError:
...         pass
...
>>> func()
>>> for each in range(1):
...     func()
...
UnicodeError: ASCII decoding error: ordinal not in range(128)
>>>

It's weird that \xe5 throws and not \xe4 but even weirder that the
exception is not cleared so that the loop reports it.

Is this behaviour the same on Python 2.3?





More information about the Python-list mailing list