Ian Bicking ianb at
Tue Jul 1 22:49:57 CEST 2003

On Tue, 2003-07-01 at 03:32, Alessio Pace wrote:
> Hi, I need to get a sort of DOM from an HTML page that is declared as XHTML
> but unfortunately is *not* xhtml valid.. If I try to parse it with
> xml.dom.minidom I get error with expat (as I supposed), so I was told to
> try in this way, with a "forgiving" html parser:

I would recommend putting the page through mxTidy, then parsing it.


More information about the Python-list mailing list