trying to parse non valid html documents with HTMLParser

Benji York benji at
Wed Aug 3 15:10:39 CEST 2005

florent wrote:
> True, I just want to extract some data from html documents. But the 
> problem is the same. The parser looses the position he was in the string 
> when he encounters a bad tag.

Are you saying that Beautiful Soup can't parse the HTML?  If so, I'm 
sure the author would like an example so he can "fix" it.
Benji York

More information about the Python-list mailing list