Easy way to remove HTML entities from an HTML document?
Michael Scarlett
bitshadow at yahoo.com
Sun Jul 25 23:47:04 EDT 2004
"Robert Oschler" <no_replies at fake_email_address.invalid> wrote in message news:<X9UMc.12838$QO.3354 at bignews5.bellsouth.net>...
> Is there a module/function to remove all the HTML entities from an HTML
> document (e.g. -  , &, &apos, etc.)?
>
> If not I'll just write one myself but I figured I'd save myself some time.
>
> Thanks,
check out mark pilgrims site: http://diveintopython.org/html_processing/index.html
More information about the Python-list
mailing list