Also, it almost always calls "latin-1" encoded files "latin-2" and "latin-someOtherNumber", which actually doesnt work to decode the data.
The browsers can translate seemingly untouchable mush of mixed char encodings into UTF-8 (on my linux box) without hiccupping. I tried to emulate their behaviour for almost a week before I gave up. To be fair, I was at that time char set newbie, and I guess I still am, though my scraper works properly.
Christopherq