HTML Table-of-Content Extraction Script

Fredrik Lundh fredrik at
Tue Nov 28 17:21:44 CET 2006

robert wrote:

> I'm looking for a function which extracts a table of contents
 > of HTML file(s) from <Hx....><a name=...></a>...</Hx>
 > and possibly auto-creates the ancors.
> Maybe something already exists?

that's the kind of stuff you'll write in approximately two minutes using 
BeautifulSoup (or if you prefer the ElementTree API, ElementSoup).

start here:


More information about the Python-list mailing list