HTML Structure Extraction
fredrik at pythonware.com
Wed Dec 8 20:25:47 CET 2004
<dayzman at hotmail.com> wrote:
> I'm going to write a program that extracts the structure of HTML
> documents. The structure would be in the form of a tree, separating the
> tags and grouping the start and end tags. I think I will use
> htmllib.HTMLParser, is it appropriate for my application? If so, I
> believe I will need to keep track of the depth reached.
you mean like:
and a few dozen others?
More information about the Python-list