<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML dir=ltr><HEAD>
<META http-equiv=Content-Type content="text/html; charset=us-ascii">
<META content="MSHTML 6.00.3790.2491" name=GENERATOR></HEAD>
<BODY>
<DIV dir=ltr align=left>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=651413618-02102005>Back in April, Tony Meyer posted that he was receiving
a lot of image-based spam.</SPAN></FONT></DIV>
<DIV><FONT face=Arial color=#0000ff size=2><SPAN
class=651413618-02102005></SPAN></FONT> </DIV>
<DIV><FONT face=Arial color=#0000ff size=2><SPAN class=651413618-02102005>I too
am having nothing but trouble with embedded images:</SPAN></FONT></DIV>
<DIV><FONT face=Arial color=#0000ff size=2><SPAN class=651413618-02102005>-
Daily adds for fake Rolex watches</SPAN></FONT></DIV>
<DIV><FONT face=Arial color=#0000ff size=2><SPAN class=651413618-02102005>-
Daily stock tips</SPAN></FONT></DIV>
<DIV><SPAN class=651413618-02102005></SPAN><FONT face=Arial color=#0000ff
size=2><SPAN class=651413618-02102005>- TONS of drugs for
sale.</SPAN></FONT></DIV>
<DIV><FONT face=Arial color=#0000ff size=2><SPAN
class=651413618-02102005></SPAN></FONT> </DIV>
<DIV><SPAN class=651413618-02102005><FONT face=Arial><FONT color=#0000ff><FONT
size=2>This style of Spam contains an image at the top, followed by a bunch
of totally unrelated text that has been copied from some kind of random
composition. I have very large Spam & Ham folders, that I've
successfully trained SpamBayes with. It's only these image-based adverts
that sneak by EVERY DAY. <BR><BR><SPAN
class=721213019-02102005><STRONG>Mostly my SpamBayes catches ALL of these when
anything gets this far...</STRONG></SPAN></FONT></FONT></FONT></SPAN></DIV>
<DIV><SPAN class=651413618-02102005><FONT face=Arial><FONT color=#0000ff><FONT
size=2><SPAN
class=721213019-02102005></SPAN></FONT></FONT></FONT></SPAN> </DIV>
<DIV><SPAN class=651413618-02102005><FONT size=+0><FONT size=+0><SPAN
class=721213019-02102005></SPAN><FONT face=Arial><FONT color=#0000ff><FONT
size=2> Something really needs to be done about this type of Spam within
SpamBayes. Are any other Spam engines able to handle this stuff, by
scanning the image for text, or something?<BR><BR><SPAN
class=721213019-02102005><STRONG>Sure, there are others (as well a SpamBayes if
you just keep training EVERY ONE of them) but most of the others are either
commercial (i.e., cost money) OR they run on the Server (SpamAssassin,
greylistd, and other filters.)<BR><BR>There has been talk about filters which
would explicitly do OCR or some other type of image content detection but I
don't (personally) know of any that are working/available/effective right
now.<BR><BR>Such would also likely be "resource (CPU)
intensive".<BR><BR>FWIW, greylisting on the server knocks down practically all
of this junk and SpamAssassin catches the rest.<BR><BR>The VERY occasional item
that slips through our server is caught by SpamBayes. (Defense in depth is
our key to ZERO spam -- with practically everything REJECTED, not bounced, at
the server during SMTP connect
time.)</STRONG></SPAN></FONT></FONT></FONT></FONT></FONT></SPAN></DIV>
<DIV><SPAN class=651413618-02102005><FONT size=+0><FONT size=+0><FONT
face=Arial><FONT color=#0000ff><FONT size=2><SPAN
class=721213019-02102005><STRONG></STRONG></SPAN></FONT></FONT></FONT></FONT></FONT></SPAN> </DIV>
<DIV><SPAN class=651413618-02102005><FONT size=+0><FONT size=+0><FONT
face=Arial><FONT color=#0000ff><FONT size=2><SPAN
class=721213019-02102005><STRONG>And some of us DO WISH to get graphical email
-- picture of my grand kid(s) frequently arrive this
way.</STRONG></SPAN></FONT></FONT></FONT></FONT></FONT></SPAN></DIV></DIV><!-- Converted from text/plain format -->
<P><FONT size=2><SPAN class=721213019-02102005>--<BR></SPAN></FONT><FONT
size=2>Herb Martin</FONT></P>
<P><FONT size=2></FONT> </P><FONT size=2></FONT><FONT size=2></FONT><BR>
<BLOCKQUOTE dir=ltr
style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #0000ff 2px solid; MARGIN-RIGHT: 0px">
<DIV class=OutlookMessageHeader lang=en-us dir=ltr align=left>
<HR tabIndex=-1>
<FONT face=Tahoma size=2><B>From:</B> spambayes-bounces@python.org
[mailto:spambayes-bounces@python.org] <B>On Behalf Of
</B>FreeMJ@hotpop.com<BR><B>Sent:</B> Sunday, October 02, 2005 1:53
PM<BR><B>To:</B> spambayes@python.org<BR><B>Subject:</B> [Spambayes] SpamBayes
to Handle Embedded Images<BR></FONT><BR></DIV>
<DIV></DIV>
<DIV><FONT face=Arial size=2><SPAN
class=651413618-02102005></SPAN></FONT> </DIV></BLOCKQUOTE></BODY></HTML>