How to get an encoding a value?
aleaxit at yahoo.com
Fri Oct 22 19:10:54 CEST 2004
Diez B. Roggisch <deets.nospaaam at web.de> wrote:
> A common approach to guessing the encoding of said string is to try
> something like this:
> s = <some string with unknown encoding>
> encodings ['ascii', 'latin1', 'utf-8', ....] # list of encodings you expect
> for e in encodings:
> if s == s.decode(e).encode(e):
> except UnicodeError:
Yeah, but it doesn't work. iso-8859-x would break for any value of x;
can't tell this way if it was latin-1, or any of the others...
More information about the Python-list