Web page special characters encoding

Rami Chowdhury rami.chowdhury at gmail.com
Sat Jul 10 13:07:33 EDT 2010


On Jul 10, 2010, at 09:24 , mattia wrote:

> Hi all, I'm using py3k and the urllib package to download web pages. Can 
> you suggest me a package that can translate reserved characters in html 
> like "è", "ò", "é" in the corresponding correct 
> encoding?

It won't do the whole job for you but you may find the 'name2codepoint' mapping in the 'html.entities' module useful.

HTH,
Rami


More information about the Python-list mailing list