Will xml.dom work with HTML docs?

Martin v. Löwis loewis at informatik.hu-berlin.de
Sun Sep 15 14:10:08 EDT 2002


Henrik Motakef <henrik.motakef at web.de> writes:

> > if not is there a good parser module for HTML docs?
> 
> See the htmllib module in the standard library.

Alternatively, see PyXML. It parses an HTML document into a DOM
tree. It does so by knowing the HTML DTD, ie. it silently inserts
missing closing tags whereever needed.

Regards,
Martin




More information about the Python-list mailing list