xml and unicode problems
Martin v. Löwis
martin.vonloewis at hpi.uni-potsdam.de
Wed Mar 26 11:32:45 CET 2003
> File "menews.py", line 123, in normalize
> return escape(" ".join(result).strip())
> File "menews.py", line 69, in <lambda>
> escape = lambda x: xml.sax.saxutils.escape(x).encode('UTF-8')
> UnicodeDecodeError: 'ascii' codec can't decode byte 0x93 in position
> 0: ordinal not in range(128)
> Would appreciate any hints as to a solution, code available upon
It would be good if you could report the value of "result" in line 123.
It looks like you have some CP1252 data in there; 0x93 is
LEFT DOUBLE QUOTATION MARK (U+201C) in CP1252.
You should arrange to decode them to Unicode first before
encoding them as UTF-8.
More information about the Python-list