[Spambayes] FW: [Spambayes-checkins] spambayes tokenizer.py,1.57,1.58

Brad Clements bkc@murkworks.com
Thu Oct 31 17:55:52 2002


On 31 Oct 2002 at 11:22, Guido van Rossum wrote:

> > A new mini-phase of body tokenization scours HTML for common virus clues,
> > variations of
> > 
> >     <script    </script
> >     <iframe    </iframe
> >     src=cid:
> >     height=0   width=0
> 
> This gets us awfully close to SA's "precompiled list of clues to look
> for" approach. :-(
> 

I get valid messages with embedded images that contain cid: clues. Hopefully my ham 
tokens will overpower ;-)

Brad Clements,                bkc@murkworks.com   (315)268-1000
http://www.murkworks.com                          (315)268-9812 Fax
AOL-IM: BKClements