not quite 1252

Anton Vredegoor anton.vredegoor at gmail.com
Wed Apr 26 07:54:58 EDT 2006


I'm trying to import text from an open office document (save as .sxw and 
  read the data from content.xml inside the sxw-archive using 
elementtree and such tools).

The encoding that gives me the least problems seems to be cp1252, 
however it's not completely perfect because there are still characters 
in it like \93 or \94. Has anyone handled this before? I'd rather not 
reinvent the wheel and start translating strings 'by hand'.

Anton



More information about the Python-list mailing list