trying to parse non valid html documents with HTMLParser
benji at benjiyork.com
Wed Aug 3 15:10:39 CEST 2005
> True, I just want to extract some data from html documents. But the
> problem is the same. The parser looses the position he was in the string
> when he encounters a bad tag.
Are you saying that Beautiful Soup can't parse the HTML? If so, I'm
sure the author would like an example so he can "fix" it.
More information about the Python-list