Parsing HTML, extracting text and changing attributes.

Jay Loden jloden at
Mon Jun 18 19:16:57 CEST 2007

Stefan Behnel wrote:
> Jay Loden wrote:
>> Someone else mentioned lxml but as I understand it lxml will only work if
>> it's valid XHTML that they're working with.
> No, it was meant as the OP requested. It even has a very good parser from
> broken HTML.

I stand corrected, I missed that whole part of the LXML documentation :-)

More information about the Python-list mailing list