HTML filtering

Stuart D. Gathman stuart at bmsi.com
Wed May 1 15:06:36 EDT 2002


I need to filter HTML to remove certain constructs (e.g. <script ...> ...
</script>).  I am trying to use the batteries.  The htmllib module helps
with the parsing, but it seems like a lot of work to create a formatter
that passes everything (except script) through in HTML syntax - espicially
trying to preserve original syntax.  Am I missing something?  Is there
another module I should be using for filtering HTML?  Perhaps one of
those ad stripping filters written in python would provide a usable
example?

-- 
	      Stuart D. Gathman <stuart at bmsi.com>
Business Management Systems Inc.  Phone: 703 591-0911 Fax: 703 591-6154
"Confutatis maledictis, flamis acribus addictis" - background song for
a Microsoft sponsored "Where do you want to go from here?" commercial.



More information about the Python-list mailing list