Easy way to remove HTML entities from an HTML document?

Michael Scarlett bitshadow at yahoo.com
Sun Jul 25 23:47:04 EDT 2004


"Robert Oschler" <no_replies at fake_email_address.invalid> wrote in message news:<X9UMc.12838$QO.3354 at bignews5.bellsouth.net>...
> Is there a module/function to remove all the HTML entities from an HTML
> document (e.g. - &nbsp, &amp, &apos, etc.)?
> 
> If not I'll just write one myself but I figured I'd save myself some time.
> 
> Thanks,


check out mark pilgrims site: http://diveintopython.org/html_processing/index.html



More information about the Python-list mailing list