Parsing a HTML file for links?

Zigron zigron at jps.net
Wed May 5 00:59:30 EDT 1999


    I've never used the HTMLParser class(or SGML?), or the formatter thing,
et al, and they confuse me a little.

    What I want to do is go through a HTML file, and spit out a
dictionary based on the links, and title of the file. I want a dictionary,
I guess, of like,
{"text-between-anchor-tags":["Destination1","DestinationN.."]}

The dictionary has a list in it because the same text might have more then
one destination... I can't figure out how to get this to work :) Any one
have any ideas?

--Stephen








More information about the Python-list mailing list