On Fri, 2003-05-02 at 13:14, Harald Massa wrote: > So... is there any replacement for the HTMLParser from the python.lib > which even can eat Microsoft Word HTML ? You might want to try running mxTidy on the HTML first: http://www.lemburg.com/files/python/mxTidy.html Ian