strip not well formed html tags...
marklists at mceahern.com
Tue Oct 22 14:14:08 CEST 2002
> i've seen many post about how to strip html tags from a string,
> some use sgmllib, others regular expressions... i 'd the following
> trouble i would like to strip html (or even xml) tags but i had
> to work on incomplete string so they could be not well formed - what
> should i use ? regexp ? sgmllib with many exceptions handling ?
1. Try mxTidy.
2. Consider providing an example of the data you're talking about.
More information about the Python-list