Parsing HTML

Richie Hindle richie at
Thu Sep 23 14:28:43 CEST 2004

> BeautifulSoup is perfect for this job:

Um, BeautifulSoup may be perfect, but my script isn't.  It fails with the
Swedish page because it doesn't cope with "<b></b>" appearing in the HTML.
And I don't know whether you'd consider it correct to extract only the bold
text from the entries that have bold text.  But it gives you a place to start.

Richie Hindle
richie at

More information about the Python-list mailing list