trying to parse non valid html documents with HTMLParser

florent florent.newsgroups at
Wed Aug 3 17:43:09 CEST 2005

> AFAIK not with HTMLParser or htmllib. You might try (if you haven't done
> yet) htmllib and see, which parser is more forgiving.

You were right, the HTMLParser of htmllib is more permissive. He just 
ignores the bad tags !

More information about the Python-list mailing list