[Spambayes] Can we give recovered non-spam more priority ?
Murnaghan, Tim
murnt1 at bp.com
Fri Jun 27 18:59:44 EDT 2003
My setup involves my work e-mail address having leaked out to spam.
I get around 10 spams a day. The population of emails I get is around 95% internal and a few external. As the internal stuff comes directly on exchange that really makes a difference in headers.
When I trained SpamBayes on my recent spam and my (100% squeaky clean) Inbox it decided that everything external containing a URL is spam. That doesn't work for the 5% of external mails and even after I recover them it's still rating them around 67% spam. (example attached - ironically the email is from SpamCop which is reasonably respectable).
Alternatively can we get it to be less agressive on the fact that it's got external headers ?
The scoring from my name and having a return path seems ridiculously high.
Regards,
Tim.
Spam Score: 0.567625
word spamprob #ham #spam
'*H*' 0.0372864 - -
'*S*' 0.172536 - -
'subject:' 0.0897017 687 2
'support' 0.0942205 355 1
'2003' 0.122869 920 4
'skip:( 10' 0.138899 417 2
'to:' 0.151907 718 4
'links' 0.165308 28 0
'produced' 0.198531 21 0
'url:id' 0.210626 19 0
'finish' 0.22429 17 0
'from:' 0.237996 704 7
'"murnaghan,' 0.239849 15 0
'tim"' 0.239849 15 0
'triggered' 0.257729 13 0
'microsoft' 0.271698 94 1
'(bst)' 0.278489 11 0
'jun' 0.278489 11 0
'url:shtml' 0.278489 11 0
'from:addr:devnull.spamcop.net' 0.302886 9 0
'from:addr:nobody' 0.302886 9 0
'message-id:@msgid.spamcop.net' 0.302886 9 0
'service!' 0.302886 9 0
'spamcop' 0.302886 9 0
'subject:SpamCop' 0.302886 9 0
'subject:has' 0.302886 9 0
'url:k3zpfy7psqjxbo7q' 0.302886 9 0
'url:upgradeaccount' 0.302886 9 0
'"spamcop' 0.316761 8 0
'(e-mail)"' 0.316761 8 0
'from:name:spamcop autoresponder' 0.316761 8 0
'headers:' 0.316761 8 0
'it:' 0.316761 8 0
'reporting:' 0.316761 8 0
'subject:accepted' 0.316761 8 0
'subject:email' 0.316761 8 0
'subject:processing' 0.316761 8 0
'url:sc' 0.316761 8 0
'use' 0.336996 309 5
'exchange' 0.343614 65 1
'fri,' 0.34871 6 0
'skip:a 30' 0.34871 6 0
'skip:t 10' 0.352676 175 3
'skip:u 20' 0.367229 5 0
'v6.0.6375.0' 0.367229 5 0
'x-mimeole:' 0.367229 5 0
'which' 0.375765 413 8
'the' 0.383649 1436 29
'x-mailer:none' 0.384612 1676 34
'help' 0.609138 175 9
'skip:e 10' 0.611739 95 5
'date:' 0.621629 16 1
'skip:1 10' 0.621629 16 1
'now' 0.62478 311 17
'proto:http' 0.637694 224 13
'like' 0.653221 372 23
'your' 0.660231 566 36
'however,' 0.667701 73 5
'+0100' 0.67757 11 1
'to:2**0' 0.700516 561 43
'invoked' 0.702871 9 1
'skip:[ 10' 0.702871 9 1
'smtp' 0.702871 9 1
'e-mail' 0.704715 99 8
'(interscan' 0.716243 8 1
'(qmail' 0.716243 8 1
'-0000' 0.716243 8 1
'message-id:' 0.716243 8 1
'network);' 0.716243 8 1
'nt);' 0.716243 8 1
'smtp;' 0.716243 8 1
'text/plain;' 0.716243 8 1
'viruswall' 0.716243 8 1
'ready' 0.726363 42 4
'service' 0.75078 128 13
'email' 0.776099 199 23
'pay' 0.78333 29 4
'url:spamcop' 0.811199 9 2
'received:' 0.820938 8 2
'skip:x 10' 0.820938 8 2
'url:net' 0.828304 20 4
'skip:x 20' 0.851611 5 2
'spam' 0.86127 14 4
'spam.' 0.861642 9 3
'to:addr:murnt1' 0.881224 110 28
'header:From:1' 0.896985 188 55
'header:Date:1' 0.898631 188 56
'free.' 0.913915 14 7
'to:addr:bp.com' 0.913916 124 45
'header:Message-ID:1' 0.915621 124 46
'header:MIME-Version:1' 0.915722 121 45
'header:Return-Path:1' 0.91782 68 27
Message Stream:
X-MS-Mail-Gibberish: Microsoft Mail Internet Headers Version 2.0
Received: from BP1GHOEX003.bp1.ad.bp.com ([149.179.248.18]) by
bp1gheex003.bp1.ad.bp.com with Microsoft SMTPSVC(5.0.2195.5329);
Fri, 27 Jun 2003 10:43:43 +0100
Received: from amhouav001.bp.com ([149.179.131.241]) by amhoux3.bp.com with
SMTP (Microsoft Exchange Internet Mail Service Version
5.5.2653.13) id NKGR7YRB; Fri, 27 Jun 2003 04:31:43 -0500
Received: from 65.198.138.126 by amhouav001.bp.com (InterScan E-Mail VirusWall
NT); Fri, 27 Jun 2003 04:29:07 -0500
Received: from shagrat.julianhaight.com (shagrat.julianhaight.com
[216.127.43.86]) by amhousmtp01.bp.com
(Switch-3.0.4/Switch-3.0.0) with SMTP id h5R9a6Ho000480 for
<murnt1 at bp.com>; Fri, 27 Jun 2003 04:36:07 -0500 (CDT)
Received: (qmail 31410 invoked from network); 27 Jun 2003 08:38:29 -0000
Received: from saruman.julianhaight.com (HELO spamcop.net) (216.127.43.87) by
shagrat.julianhaight.com with SMTP; 27 Jun 2003 08:38:29 -0000
content-class: urn:content-classes:message
MIME-Version: 1.0
Subject: SpamCop has accepted 1 email for processing
Content-Type: multipart/mixed; boundary="----_=_NextPart_001_01C33C8E.EB776980"
Date: Fri, 27 Jun 2003 03:38:54 -0500
Message-ID: <spamid122422379 at msgid.spamcop.net>
X-MS-Has-Attach:
X-MS-TNEF-Correlator: <spamid122422379 at msgid.spamcop.net>
X-MimeOLE: Produced By Microsoft Exchange V6.0.6375.0
Thread-Topic: SpamCop has accepted 1 email for processing
Thread-Index: AcM8juuNBPpXZah+Ede8rgBQi9Y6Rw==
From: "SpamCop AutoResponder" <nobody at devnull.spamcop.net>
To: <murnt1 at bp.com>
Return-Path: nobody at devnull.spamcop.net
X-OriginalArrivalTime: 27 Jun 2003 09:43:43.0555 (UTC)
FILETIME=[98F36130:01C33C90]
PLEASE HELP SUPPORT THIS SERVICE!
SpamCop is free. However, if you like the service please pay for it:
http://spamcop.net/upgradeaccount.shtml?K3zPFY7PSqjxbO7q
SpamCop is now ready to process your spam.
Use links to finish spam reporting:
http://spamcop.net/sc?id=z122422379z3a05545ef058638f062ab78444977786z
The email which triggered this auto-response had the following headers:
Received: (qmail 32546 invoked from network); 27 Jun 2003 08:38:23 -0000
Received: from euhemsmtp01.bp.com (62.189.94.209)
by saruman.julianhaight.com with SMTP; 27 Jun 2003 08:38:23 -0000
Received: from BP1HEMAV001.bp1.ad.bp.com (inetgate21.bp.com [62.189.94.193])
by euhemsmtp01.bp.com (Switch-3.0.4/Switch-3.0.0) with SMTP id h5R8hhZ3029230
for <submit.K3zPFY7PSqjxbO7q at spam.spamcop.net>; Fri, 27 Jun 2003 09:43:43 +0100 (BST)
Received: from 149.182.114.119 by BP1HEMAV001.bp1.ad.bp.com (InterScan E-Mail VirusWall NT); Fri, 27 Jun 2003 09:37:58 +0100
content-class: urn:content-classes:message
Subject:
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Date: Fri, 27 Jun 2003 09:37:02 +0100
Message-ID: <2FE5DE0B8790D411832700508BAF485906F17AF5 at eumorx5.bp.com>
X-MimeOLE: Produced By Microsoft Exchange V6.0.6375.0
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Index: AcM8h2m8K3AaOuxZQ0WTX659Xyx/vg==
From: "Murnaghan, Tim" <murnt1 at bp.com>
To: "SpamCop (E-mail)" <submit.K3zPFY7PSqjxbO7q at spam.spamcop.net>
Message Tokens:
135 unique tokens
'"murnaghan,'
'"spamcop'
'(bst)'
'(e-mail)"'
'(interscan'
'(qmail'
'+0100'
'-0000'
'08:38:23'
'09:37:02'
'09:37:58'
'09:43:43'
'1.0'
'2003'
'32546'
'cc:none'
'content-type:text/plain'
'date:'
'e-mail'
'email'
'exchange'
'finish'
'following'
'for'
'free.'
'fri,'
'from'
'from:'
'from:addr:devnull.spamcop.net'
'from:addr:nobody'
'from:name:spamcop autoresponder'
'had'
'header:Date:1'
'header:From:1'
'header:MIME-Version:1'
'header:Message-ID:1'
'header:Received:6'
'header:Return-Path:1'
'header:Subject:1'
'header:To:1'
'headers:'
'help'
'however,'
'invoked'
'it:'
'jun'
'like'
'links'
'message-id:'
'message-id:@msgid.spamcop.net'
'microsoft'
'network);'
'noheader:abuse-reports-to'
'noheader:errors-to'
'noheader:importance'
'noheader:in-reply-to'
'noheader:organization'
'noheader:reply-to'
'noheader:user-agent'
'noheader:x-abuse-info'
'noheader:x-complaints-to'
'noheader:x-face'
'now'
'nt);'
'pay'
'please'
'process'
'produced'
'proto:http'
'ready'
'received:'
'reply-to:none'
'reporting:'
'sender:none'
'service'
'service!'
'skip:( 10'
'skip:( 20'
'skip:1 10'
'skip:[ 10'
'skip:a 10'
'skip:a 30'
'skip:b 20'
'skip:c 10'
'skip:c 20'
'skip:e 10'
'skip:h 10'
'skip:m 10'
'skip:q 10'
'skip:s 20'
'skip:t 10'
'skip:u 20'
'skip:x 10'
'skip:x 20'
'smtp'
'smtp;'
'spam'
'spam.'
'spamcop'
'subject:'
'subject: '
'subject:SpamCop'
'subject:accepted'
'subject:email'
'subject:for'
'subject:has'
'subject:processing'
'support'
'text/plain;'
'the'
'this'
'tim"'
'to:'
'to:2**0'
'to:addr:bp.com'
'to:addr:murnt1'
'to:no real name:2**0'
'triggered'
'url:id'
'url:k3zpfy7psqjxbo7q'
'url:net'
'url:sc'
'url:shtml'
'url:spamcop'
'url:upgradeaccount'
'url:z122422379z3a05545ef058638f062ab78444977786z'
'use'
'v6.0.6375.0'
'viruswall'
'which'
'with'
'x-mailer:none'
'x-mimeole:'
'you'
'your'
More information about the Spambayes
mailing list