Character encoding

jiang.haiyun at gmail.com jiang.haiyun at gmail.com
Wed Nov 8 15:32:31 CET 2006


Dennis Lee Bieber wrote:
> On 7 Nov 2006 11:34:32 -0800, "mp" <mailpitches at email.com> declaimed the
> following in comp.lang.python:
>
> > I have html document titles with characters like &gt;, &nbsp;, and
> > &#135. How do I sddecode a string with these values in Python?
> >
>
> 	Wouldn't HTMLParser be suited for such activity?
> --
> 	Wulfraed	Dennis Lee Bieber		KD6MOG
> 	wlfraed at ix.netcom.com		wulfraed at bestiaria.com
> 		HTTP://wlfraed.home.netcom.com/
> 	(Bestiaria Support Staff:		web-asst at bestiaria.com)
> 		HTTP://www.bestiaria.com/

Use htmlentitydefs and SGMLParser to re-generate it .




More information about the Python-list mailing list