Unicode problem
"Martin v. Löwis"
martin at v.loewis.de
Sat Jul 7 12:55:58 EDT 2007
> I agree, but the problem is much subtle. I have coverted a text from
> iso-8859-1 to utf-8 and the codecs have translated \x27 ( the iso
> apostrophe ) to \xe28099 in utf-8 ( or u'2019' in unicode code point
> notation )
What software did you use to make that so? The Python codec certainly
never would do such a thing.
Are you sure it was latin-1 and \x27, and not windows-1252 and \x92?
Regards,
Martin
More information about the Python-list
mailing list