HTML "sanitizer" in Python

Tres Seaver tseaver at palladion.com
Fri Apr 30 08:59:53 EDT 1999


Scott Stirling wrote:
> > 4) If someone helps me out, I think I should be able to use this info. and the tutorial and the Lutz book to loop the process and make the program run until all *.htm files in a folder have been handled once.
> 
> Well, if I understand correctly, the *only* thing you're trying to do
> is to remove some specific strings from a bunch of files. Now if I
> were you, I wouldn't even bother to use Python on something that
> simple; I would just use sed. With sed, you could do:
> 
>   sed 'g/string_to_be_eliminated//g' my_file.html > output.html
> 
> Presto, that's it.  I think that there is a version for GNU sed for
> Windows somewhere out there; do yourself a favour and get it.

Look for the "user tools" under http://sourceware.cygnus.com/cygwin/

-- 
=========================================================
Tres Seaver         tseaver at palladion.com    713-523-6582
Palladion Software  http://www.palladion.com




More information about the Python-list mailing list