Possible Unicode Bug in Pythonwin on version 2.2.2 ?

Chris Lyon chris.lyon at spritenote.co.uk
Thu Feb 13 05:17:16 EST 2003


"vincent wehren" <v.wehren at home.nl> wrote in message news:<b2e5mf$poc$1 at news1.tilbu1.nb.home.nl>...
> "Chris Lyon" <chris.lyon at spritenote.co.uk> schrieb im Newsbeitrag
> news:d232c5e.0302121048.5033368c at posting.google.com...
> > In the interactive window type:-
> > s=u'La Pe\xf1a Colada'
> >
> > print s
> >
> > removes the last 'a' on Colada,
> 
> Actually, I would expect an UnicodeError
> OTOH:
> >>> print s.encode("cp1252")
>  La Peña Colada
> 
> Regards
> Vincent
> 
> 
> Sorry I didn't fully explain I have 

sitecustomize.py 

set as follows

# sitecustomize.py                   
# this file can be anywhere in your Python path,
# but it usually goes in ${pythondir}/lib/site-packages/

import sys

sys.setdefaultencoding('iso-8859-1')

So I am able to print in this fashion.. :)


So to re-iterate.

PythonWin 2.2.2 (#37, Oct 14 2002, 17:02:34) [MSC 32 bit (Intel)] on
win32.
Portions Copyright 1994-2001 Mark Hammond (mhammond at skippinet.com.au)
- see 'Help/About PythonWin' for further copyright information.
>>> s = u'La Pe\xf1a'
>>> print s.encode('latin-1')
La Peña
>>> s = u'La Pe\xf1a Colada'
>>> print s.encode('latin-1')
La Peña Colad
>>> len(s)
14
>>> s[:-1]
u'La Pe\xf1a Colad'
>>> s[-1:]
u'a'
>>> 

The trailing 'a' character is not shown as above however when cut and
paste from the pythonwin interactive window it into the above section
it does appear!.
It's a small point but It's the sort of thing that can have you
running round the houses for many a long while if you are trying to
get your head round a bit of interactive debugging.

For students of Spanish and Alcoholic Beverages, please note I don't
know if this is indeed how this particular liquid distraction is spelt
its just a little bit better that the 'fred' I originally tested with
which demonstrated the same effect turning into 'La Pe\xf1ana Fre'.




More information about the Python-list mailing list