Q: how to extract only text from a html ?

Anders M Eriksson anders.eriksson at morateknikutveckling.se
Thu Nov 2 10:42:30 EST 2000


On Tue, 31 Oct 2000 13:50:54 -0600, "Hwanjo Yu" <hwanjoyu at uiuc.edu>
wrote:

>Could someone please tell me how to get rid of all the tags in a html ?
>It seems that the htmllib.HTMLParser is not helpful to do it.

Have a look at Tibs html2text.py module!

http://starship.python.net/crew/tibs/python/html2text

// Anders





More information about the Python-list mailing list