strip not well formed html tags...

Mark McEahern marklists at mceahern.com
Tue Oct 22 14:14:08 CEST 2002


> i've seen many post about how to strip html tags from a string, 
> some use sgmllib, others regular expressions... i 'd the following
> trouble i would like to strip html (or even xml) tags but i had 
> to work on incomplete string so they could be not well formed - what
> should i use ? regexp ? sgmllib with many exceptions handling ?

1.  Try mxTidy.

2.  Consider providing an example of the data you're talking about.

Cheers,

// mark
-





More information about the Python-list mailing list