clean up html document created by Word

Peter Otten __peter__ at web.de
Fri Mar 30 19:29:19 CEST 2007


jd wrote:

> I am looking for python code (working or sample code) that can take an
> html document created by Microsoft Word and clean it up (if you've
> never had to look at a Word-generated html document, consider yourself
> lucky ;-)  Alternatively, if you know of a non-python solution, I'd
> like to hear about it.

The non-python solution:

http://www.w3.org/People/Raggett/tidy/

Peter



More information about the Python-list mailing list