[Python-Dev] Getting started with GBayes testing

Anthony Baxter Anthony Baxter <anthony@interlink.com.au>
Fri, 06 Sep 2002 00:28:25 +1000


>>> "Brad Clements" wrote
> This is one way to do it, but I was planning on experimenting with tokenizer 
methods 
> that strip out HTML tags, leaving only the text. 

The set I'm working with, I found I needed to strip out everything 
but for src="" and href="" attributes of tags. Too much goodness in
them for the system to get it's teeth into.


> Tells me (spammer hat on) that I can send message with a non-spammish text 
> only part, and a spam html part since most "non-techie" email client users 
> automatically display the html version when available, however Tim's 
> implementation will ignore it.

I've actually got a bunch of spam like that. The text/plain is something
like 

**This is a HTML message** 

and nothing else.


Anthony
-- 
Anthony Baxter     <anthony@interlink.com.au>   
It's never too late to have a happy childhood.