Will xml.dom work with HTML docs?
Martin v. Löwis
loewis at informatik.hu-berlin.de
Sun Sep 15 14:10:08 EDT 2002
Henrik Motakef <henrik.motakef at web.de> writes:
> > if not is there a good parser module for HTML docs?
>
> See the htmllib module in the standard library.
Alternatively, see PyXML. It parses an HTML document into a DOM
tree. It does so by knowing the HTML DTD, ie. it silently inserts
missing closing tags whereever needed.
Regards,
Martin
More information about the Python-list
mailing list