[Spambayes] Can we give recovered non-spam more priority ?
Arnold, Paul
parnold at CuraGen.com
Fri Jun 27 14:09:59 EDT 2003
Tim,
As you receive more legitimate email, that score of 67% will drop. By
default, 67% does not get moved to the Spam folder.
I have my threshold lowered to 80% and I will say, that I have received
spam that gets 78%. However, I mark it as spam and the next time a
similar message arrives, it is rated greater than 78%.
Your experience will likely improve as your (email) sample grows.
Paul
-----Original Message-----
From: Murnaghan, Tim [mailto:murnt1 at bp.com]
Sent: Friday, June 27, 2003 1:00 PM
To: SpamBayes at python.org
Subject: [Spambayes] Can we give recovered non-spam more priority ?
My setup involves my work e-mail address having leaked out to spam. I
get around 10 spams a day. The population of emails I get is around 95%
internal and a few external. As the internal stuff comes directly on
exchange that really makes a difference in headers.
When I trained SpamBayes on my recent spam and my (100% squeaky clean)
Inbox it decided that everything external containing a URL is spam. That
doesn't work for the 5% of external mails and even after I recover them
it's still rating them around 67% spam. (example attached - ironically
the email is from SpamCop which is reasonably respectable).
Alternatively can we get it to be less agressive on the fact that it's
got external headers ? The scoring from my name and having a return path
seems ridiculously high.
Regards,
Tim.
Spam Score: 0.567625
word spamprob #ham #spam
'*H*' 0.0372864 - -
'*S*' 0.172536 - -
'subject:' 0.0897017 687 2
'support' 0.0942205 355 1
'2003' 0.122869 920 4
'skip:( 10' 0.138899 417 2
'to:' 0.151907 718 4
'links' 0.165308 28 0
'produced' 0.198531 21 0
'url:id' 0.210626 19 0
'finish' 0.22429 17 0
'from:' 0.237996 704 7
'"murnaghan,' 0.239849 15 0
'tim"' 0.239849 15 0
'triggered' 0.257729 13 0
'microsoft' 0.271698 94 1
'(bst)' 0.278489 11 0
'jun' 0.278489 11 0
'url:shtml' 0.278489 11 0
'from:addr:devnull.spamcop.net' 0.302886 9 0
'from:addr:nobody' 0.302886 9 0
'message-id:@msgid.spamcop.net' 0.302886 9 0
'service!' 0.302886 9 0
'spamcop' 0.302886 9 0
'subject:SpamCop' 0.302886 9 0
'subject:has' 0.302886 9 0
'url:k3zpfy7psqjxbo7q' 0.302886 9 0
'url:upgradeaccount' 0.302886 9 0
'"spamcop' 0.316761 8 0
'(e-mail)"' 0.316761 8 0
'from:name:spamcop autoresponder' 0.316761 8 0
'headers:' 0.316761 8 0
'it:' 0.316761 8 0
'reporting:' 0.316761 8 0
'subject:accepted' 0.316761 8 0
'subject:email' 0.316761 8 0
'subject:processing' 0.316761 8 0
'url:sc' 0.316761 8 0
'use' 0.336996 309 5
'exchange' 0.343614 65 1
'fri,' 0.34871 6 0
'skip:a 30' 0.34871 6 0
'skip:t 10' 0.352676 175 3
'skip:u 20' 0.367229 5 0
'v6.0.6375.0' 0.367229 5 0
'x-mimeole:' 0.367229 5 0
'which' 0.375765 413 8
'the' 0.383649 1436 29
'x-mailer:none' 0.384612 1676 34
'help' 0.609138 175 9
'skip:e 10' 0.611739 95 5
'date:' 0.621629 16 1
'skip:1 10' 0.621629 16 1
'now' 0.62478 311 17
'proto:http' 0.637694 224 13
'like' 0.653221 372 23
'your' 0.660231 566 36
'however,' 0.667701 73 5
'+0100' 0.67757 11 1
'to:2**0' 0.700516 561 43
'invoked' 0.702871 9 1
'skip:[ 10' 0.702871 9 1
'smtp' 0.702871 9 1
'e-mail' 0.704715 99 8
'(interscan' 0.716243 8 1
'(qmail' 0.716243 8 1
'-0000' 0.716243 8 1
'message-id:' 0.716243 8 1
'network);' 0.716243 8 1
'nt);' 0.716243 8 1
'smtp;' 0.716243 8 1
'text/plain;' 0.716243 8 1
'viruswall' 0.716243 8 1
'ready' 0.726363 42 4
'service' 0.75078 128 13
'email' 0.776099 199 23
'pay' 0.78333 29 4
'url:spamcop' 0.811199 9 2
'received:' 0.820938 8 2
'skip:x 10' 0.820938 8 2
'url:net' 0.828304 20 4
'skip:x 20' 0.851611 5 2
'spam' 0.86127 14 4
'spam.' 0.861642 9 3
'to:addr:murnt1' 0.881224 110 28
'header:From:1' 0.896985 188 55
'header:Date:1' 0.898631 188 56
'free.' 0.913915 14 7
'to:addr:bp.com' 0.913916 124 45
'header:Message-ID:1' 0.915621 124 46
'header:MIME-Version:1' 0.915722 121 45
'header:Return-Path:1' 0.91782 68 27
Message Stream:
X-MS-Mail-Gibberish: Microsoft Mail Internet Headers Version 2.0
Received: from BP1GHOEX003.bp1.ad.bp.com ([149.179.248.18]) by
bp1gheex003.bp1.ad.bp.com with Microsoft SMTPSVC(5.0.2195.5329);
Fri, 27 Jun 2003 10:43:43 +0100
Received: from amhouav001.bp.com ([149.179.131.241]) by amhoux3.bp.com
with
SMTP (Microsoft Exchange Internet Mail Service Version
5.5.2653.13) id NKGR7YRB; Fri, 27 Jun 2003 04:31:43 -0500
Received: from 65.198.138.126 by amhouav001.bp.com (InterScan E-Mail
VirusWall
NT); Fri, 27 Jun 2003 04:29:07 -0500
Received: from shagrat.julianhaight.com (shagrat.julianhaight.com
[216.127.43.86]) by amhousmtp01.bp.com
(Switch-3.0.4/Switch-3.0.0) with SMTP id h5R9a6Ho000480 for
<murnt1 at bp.com>; Fri, 27 Jun 2003 04:36:07 -0500 (CDT)
Received: (qmail 31410 invoked from network); 27 Jun 2003 08:38:29 -0000
Received: from saruman.julianhaight.com (HELO spamcop.net)
(216.127.43.87) by
shagrat.julianhaight.com with SMTP; 27 Jun 2003 08:38:29 -0000
content-class: urn:content-classes:message
MIME-Version: 1.0
Subject: SpamCop has accepted 1 email for processing
Content-Type: multipart/mixed;
boundary="----_=_NextPart_001_01C33C8E.EB776980"
Date: Fri, 27 Jun 2003 03:38:54 -0500
Message-ID: <spamid122422379 at msgid.spamcop.net>
X-MS-Has-Attach:
X-MS-TNEF-Correlator: <spamid122422379 at msgid.spamcop.net>
X-MimeOLE: Produced By Microsoft Exchange V6.0.6375.0
Thread-Topic: SpamCop has accepted 1 email for processing
Thread-Index: AcM8juuNBPpXZah+Ede8rgBQi9Y6Rw==
From: "SpamCop AutoResponder" <nobody at devnull.spamcop.net>
To: <murnt1 at bp.com>
Return-Path: nobody at devnull.spamcop.net
X-OriginalArrivalTime: 27 Jun 2003 09:43:43.0555 (UTC)
FILETIME=[98F36130:01C33C90]
PLEASE HELP SUPPORT THIS SERVICE!
SpamCop is free. However, if you like the service please pay for it:
http://spamcop.net/upgradeaccount.shtml?K3zPFY7PSqjxbO7q
SpamCop is now ready to process your spam.
Use links to finish spam reporting:
http://spamcop.net/sc?id=z122422379z3a05545ef058638f062ab78444977786z
The email which triggered this auto-response had the following headers:
Received: (qmail 32546 invoked from network); 27 Jun 2003 08:38:23
-0000
Received: from euhemsmtp01.bp.com (62.189.94.209)
by saruman.julianhaight.com with SMTP; 27 Jun 2003 08:38:23 -0000
Received: from BP1HEMAV001.bp1.ad.bp.com (inetgate21.bp.com
[62.189.94.193])
by euhemsmtp01.bp.com (Switch-3.0.4/Switch-3.0.0) with SMTP id
h5R8hhZ3029230
for <submit.K3zPFY7PSqjxbO7q at spam.spamcop.net>; Fri, 27 Jun 2003
09:43:43 +0100 (BST)
Received: from 149.182.114.119 by BP1HEMAV001.bp1.ad.bp.com (InterScan
E-Mail VirusWall NT); Fri, 27 Jun 2003 09:37:58 +0100
content-class: urn:content-classes:message
Subject:
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Date: Fri, 27 Jun 2003 09:37:02 +0100
Message-ID: <2FE5DE0B8790D411832700508BAF485906F17AF5 at eumorx5.bp.com>
X-MimeOLE: Produced By Microsoft Exchange V6.0.6375.0
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Index: AcM8h2m8K3AaOuxZQ0WTX659Xyx/vg==
From: "Murnaghan, Tim" <murnt1 at bp.com>
To: "SpamCop (E-mail)" <submit.K3zPFY7PSqjxbO7q at spam.spamcop.net>
Message Tokens:
135 unique tokens
'"murnaghan,'
'"spamcop'
'(bst)'
'(e-mail)"'
'(interscan'
'(qmail'
'+0100'
'-0000'
'08:38:23'
'09:37:02'
'09:37:58'
'09:43:43'
'1.0'
'2003'
'32546'
'cc:none'
'content-type:text/plain'
'date:'
'e-mail'
'email'
'exchange'
'finish'
'following'
'for'
'free.'
'fri,'
'from'
'from:'
'from:addr:devnull.spamcop.net'
'from:addr:nobody'
'from:name:spamcop autoresponder'
'had'
'header:Date:1'
'header:From:1'
'header:MIME-Version:1'
'header:Message-ID:1'
'header:Received:6'
'header:Return-Path:1'
'header:Subject:1'
'header:To:1'
'headers:'
'help'
'however,'
'invoked'
'it:'
'jun'
'like'
'links'
'message-id:'
'message-id:@msgid.spamcop.net'
'microsoft'
'network);'
'noheader:abuse-reports-to'
'noheader:errors-to'
'noheader:importance'
'noheader:in-reply-to'
'noheader:organization'
'noheader:reply-to'
'noheader:user-agent'
'noheader:x-abuse-info'
'noheader:x-complaints-to'
'noheader:x-face'
'now'
'nt);'
'pay'
'please'
'process'
'produced'
'proto:http'
'ready'
'received:'
'reply-to:none'
'reporting:'
'sender:none'
'service'
'service!'
'skip:( 10'
'skip:( 20'
'skip:1 10'
'skip:[ 10'
'skip:a 10'
'skip:a 30'
'skip:b 20'
'skip:c 10'
'skip:c 20'
'skip:e 10'
'skip:h 10'
'skip:m 10'
'skip:q 10'
'skip:s 20'
'skip:t 10'
'skip:u 20'
'skip:x 10'
'skip:x 20'
'smtp'
'smtp;'
'spam'
'spam.'
'spamcop'
'subject:'
'subject: '
'subject:SpamCop'
'subject:accepted'
'subject:email'
'subject:for'
'subject:has'
'subject:processing'
'support'
'text/plain;'
'the'
'this'
'tim"'
'to:'
'to:2**0'
'to:addr:bp.com'
'to:addr:murnt1'
'to:no real name:2**0'
'triggered'
'url:id'
'url:k3zpfy7psqjxbo7q'
'url:net'
'url:sc'
'url:shtml'
'url:spamcop'
'url:upgradeaccount' 'url:z122422379z3a05545ef058638f062ab78444977786z'
'use'
'v6.0.6375.0'
'viruswall'
'which'
'with'
'x-mailer:none'
'x-mimeole:'
'you'
'your'
_______________________________________________
Spambayes mailing list
Spambayes at python.org http://mail.python.org/mailman/listinfo/spambayes
LEGAL NOTICE:
Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately.
More information about the Spambayes
mailing list