[XML-SIG] Parsing malformed XHTML
Brian Quinlan
brian at sweetapp.com
Sun May 21 10:23:59 CEST 2006
Lars Kellogg-Stedman wrote:
> I need to parse this document into a DOM, make some changes, and then
> spit back out the modified file as (X?)HTML (ideally well-formed). Am
> I going to be able to do this with PyXML? If not, I'd love to hear
> your suggestions for the appropriate tools.
>
> Thanks!
>
> -- Lars
>
You might want to look into Beautiful Soup. Another approach is to pass
the document through HTML Tidy and then process the output.
Cheers,
Brian
More information about the XML-SIG
mailing list