[Spambayes] Spam in Images
Tim Stone
tim at aterraform.com
Thu Aug 3 06:22:41 CEST 2006
I've got a bit of experience with the PIL, if I get a few spare minutes,
maybe I'll mess with something. Can someone send me a typical spam
image? I don't seem to get them, ever.
skip at pobox.com wrote:
> Peter> For my spam and non-spam, a good indicator is that I very seldom
> Peter> receive non-spam messages with a .gif image attached (attachments
> Peter> are usually .jpg or various document types). And if a wanted mail
> Peter> has a .gif attachment it has much more text than the usual
> Peter> gibberish in the spam messages (because it is usually just a
> Peter> company logo or similar, and not essential to the message). So if
> Peter> spambayes can score attachment type and text size it may help.
>
>True, but for those people with correspondents who do send them mail with
>image attachments ("Subject: Cute pictures of my new granddaughter"), the
>presence or absence of images may fall around the middle and thus either not
>be used at all, or only provide a negligible bump in one direction or the
>other.
>
>Scoring images can run the entire gamut, from running OCR software to (try
>to) extract the text it contains to ignoring them altogether. Right now we
>note the presence of images by their content-type. My image size patch adds
>another measurement. We probably can develop other measures. My feeble
>attempts to use the open source OCR tool ocrad yielded no useful results.
>Do we want to require PIL and start digging into the images that way?
>
>Skip
>
>_______________________________________________
>SpamBayes at python.org
>http://mail.python.org/mailman/listinfo/spambayes
>Check the FAQ before asking: http://spambayes.sf.net/faq.html
>
>
>
>
More information about the SpamBayes
mailing list