OT: What encoding is this?

Neil Hodgson nyamatongwe+thunder at gmail.com
Sun Sep 10 01:55:35 CEST 2006


>     http://www.loppen.dk/side.php?navn=getin
> 
> I'm guessing ISO-8859-15, but the page doesn't indicate and it's none of the
> ones available in Safari.

    It decodes to the same text using ISO-8859-1, ISO-8859-15, or 
Windows-1252. More pages without declarations are produced on Windows so 
I'd guess that its Windows-1252. To tell, look for prices in Euros ("€") 
on the site. If there are \x80 characters in front of prices then it is 
Windows-1252, if \xa4 then it is ISO-8859-15. ISO-8859-1 does not have a 
Euro sign. It isn't Mac-Roman as decoding with Mac-Roman produces 
non-alphabetics in unusual places: "H¯jde" rather than "Højde".

    Neil



More information about the Python-list mailing list