[Spambayes] Can we give recovered non-spam more priority ?

Arnold, Paul parnold at CuraGen.com
Fri Jun 27 14:09:59 EDT 2003


Tim,
As you receive more legitimate email, that score of 67% will drop. By
default, 67% does not get moved to the Spam folder.

I have my threshold lowered to 80% and I will say, that I have received
spam that gets 78%. However, I mark it as spam and the next time a
similar message arrives, it is rated greater than 78%.

Your experience will likely improve as your (email) sample grows.

Paul

-----Original Message-----
From: Murnaghan, Tim [mailto:murnt1 at bp.com] 
Sent: Friday, June 27, 2003 1:00 PM
To: SpamBayes at python.org
Subject: [Spambayes] Can we give recovered non-spam more priority ?


My setup involves my work e-mail address having leaked out to spam. I
get around 10 spams a day. The population of emails I get is around 95%
internal and a few external. As the internal stuff comes directly on
exchange that really makes a difference in headers.

When I trained SpamBayes on my recent spam and my (100% squeaky clean)
Inbox it decided that everything external containing a URL is spam. That
doesn't work for the 5% of external mails and even after I recover them
it's still rating them around 67% spam. (example attached - ironically
the email is from SpamCop which is reasonably respectable).

Alternatively can we get it to be less agressive on the fact that it's
got external headers ? The scoring from my name and having a return path
seems ridiculously high.

Regards,

Tim.

Spam Score: 0.567625
word                                spamprob         #ham  #spam
'*H*'                               0.0372864           -      -
'*S*'                               0.172536            -      -
'subject:'                          0.0897017         687      2
'support'                           0.0942205         355      1
'2003'                              0.122869          920      4
'skip:( 10'                         0.138899          417      2
'to:'                               0.151907          718      4
'links'                             0.165308           28      0
'produced'                          0.198531           21      0
'url:id'                            0.210626           19      0
'finish'                            0.22429            17      0
'from:'                             0.237996          704      7
'"murnaghan,'                       0.239849           15      0
'tim"'                              0.239849           15      0
'triggered'                         0.257729           13      0
'microsoft'                         0.271698           94      1
'(bst)'                             0.278489           11      0
'jun'                               0.278489           11      0
'url:shtml'                         0.278489           11      0
'from:addr:devnull.spamcop.net'     0.302886            9      0
'from:addr:nobody'                  0.302886            9      0
'message-id:@msgid.spamcop.net'     0.302886            9      0
'service!'                          0.302886            9      0
'spamcop'                           0.302886            9      0
'subject:SpamCop'                   0.302886            9      0
'subject:has'                       0.302886            9      0
'url:k3zpfy7psqjxbo7q'              0.302886            9      0
'url:upgradeaccount'                0.302886            9      0
'"spamcop'                          0.316761            8      0
'(e-mail)"'                         0.316761            8      0
'from:name:spamcop autoresponder'   0.316761            8      0
'headers:'                          0.316761            8      0
'it:'                               0.316761            8      0
'reporting:'                        0.316761            8      0
'subject:accepted'                  0.316761            8      0
'subject:email'                     0.316761            8      0
'subject:processing'                0.316761            8      0
'url:sc'                            0.316761            8      0
'use'                               0.336996          309      5
'exchange'                          0.343614           65      1
'fri,'                              0.34871             6      0
'skip:a 30'                         0.34871             6      0
'skip:t 10'                         0.352676          175      3
'skip:u 20'                         0.367229            5      0
'v6.0.6375.0'                       0.367229            5      0
'x-mimeole:'                        0.367229            5      0
'which'                             0.375765          413      8
'the'                               0.383649         1436     29
'x-mailer:none'                     0.384612         1676     34
'help'                              0.609138          175      9
'skip:e 10'                         0.611739           95      5
'date:'                             0.621629           16      1
'skip:1 10'                         0.621629           16      1
'now'                               0.62478           311     17
'proto:http'                        0.637694          224     13
'like'                              0.653221          372     23
'your'                              0.660231          566     36
'however,'                          0.667701           73      5
'+0100'                             0.67757            11      1
'to:2**0'                           0.700516          561     43
'invoked'                           0.702871            9      1
'skip:[ 10'                         0.702871            9      1
'smtp'                              0.702871            9      1
'e-mail'                            0.704715           99      8
'(interscan'                        0.716243            8      1
'(qmail'                            0.716243            8      1
'-0000'                             0.716243            8      1
'message-id:'                       0.716243            8      1
'network);'                         0.716243            8      1
'nt);'                              0.716243            8      1
'smtp;'                             0.716243            8      1
'text/plain;'                       0.716243            8      1
'viruswall'                         0.716243            8      1
'ready'                             0.726363           42      4
'service'                           0.75078           128     13
'email'                             0.776099          199     23
'pay'                               0.78333            29      4
'url:spamcop'                       0.811199            9      2
'received:'                         0.820938            8      2
'skip:x 10'                         0.820938            8      2
'url:net'                           0.828304           20      4
'skip:x 20'                         0.851611            5      2
'spam'                              0.86127            14      4
'spam.'                             0.861642            9      3
'to:addr:murnt1'                    0.881224          110     28
'header:From:1'                     0.896985          188     55
'header:Date:1'                     0.898631          188     56
'free.'                             0.913915           14      7
'to:addr:bp.com'                    0.913916          124     45
'header:Message-ID:1'               0.915621          124     46
'header:MIME-Version:1'             0.915722          121     45
'header:Return-Path:1'              0.91782            68     27
Message Stream:
X-MS-Mail-Gibberish: Microsoft Mail Internet Headers Version 2.0
Received: from BP1GHOEX003.bp1.ad.bp.com ([149.179.248.18]) by
	bp1gheex003.bp1.ad.bp.com with Microsoft SMTPSVC(5.0.2195.5329);

	Fri, 27 Jun 2003 10:43:43 +0100
Received: from amhouav001.bp.com ([149.179.131.241]) by amhoux3.bp.com
with
	SMTP (Microsoft Exchange Internet Mail Service Version
	5.5.2653.13) id NKGR7YRB; Fri, 27 Jun 2003 04:31:43 -0500
Received: from 65.198.138.126 by amhouav001.bp.com (InterScan E-Mail
VirusWall
	NT); Fri, 27 Jun 2003 04:29:07 -0500
Received: from shagrat.julianhaight.com (shagrat.julianhaight.com
	[216.127.43.86]) by amhousmtp01.bp.com
	(Switch-3.0.4/Switch-3.0.0) with SMTP id h5R9a6Ho000480 for
	<murnt1 at bp.com>; Fri, 27 Jun 2003 04:36:07 -0500 (CDT)
Received: (qmail 31410 invoked from network); 27 Jun 2003 08:38:29 -0000
Received: from saruman.julianhaight.com (HELO spamcop.net)
(216.127.43.87) by
	shagrat.julianhaight.com with SMTP; 27 Jun 2003 08:38:29 -0000
content-class: urn:content-classes:message
MIME-Version: 1.0
Subject: SpamCop has accepted 1 email for processing
Content-Type: multipart/mixed;
boundary="----_=_NextPart_001_01C33C8E.EB776980"
Date: Fri, 27 Jun 2003 03:38:54 -0500
Message-ID: <spamid122422379 at msgid.spamcop.net>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: <spamid122422379 at msgid.spamcop.net>
X-MimeOLE: Produced By Microsoft Exchange V6.0.6375.0
Thread-Topic: SpamCop has accepted 1 email for processing
Thread-Index: AcM8juuNBPpXZah+Ede8rgBQi9Y6Rw==
From: "SpamCop AutoResponder" <nobody at devnull.spamcop.net>
To: <murnt1 at bp.com>
Return-Path: nobody at devnull.spamcop.net
X-OriginalArrivalTime: 27 Jun 2003 09:43:43.0555 (UTC)
	FILETIME=[98F36130:01C33C90]


PLEASE HELP SUPPORT THIS SERVICE!
SpamCop is free.  However, if you like the service please pay for it:
http://spamcop.net/upgradeaccount.shtml?K3zPFY7PSqjxbO7q

SpamCop is now ready to process your spam.

Use links to finish spam reporting:
http://spamcop.net/sc?id=z122422379z3a05545ef058638f062ab78444977786z


The email which triggered this auto-response had the following headers:
 Received: (qmail 32546 invoked from network); 27 Jun 2003 08:38:23
-0000
Received: from euhemsmtp01.bp.com (62.189.94.209)
  by saruman.julianhaight.com with SMTP; 27 Jun 2003 08:38:23 -0000
Received: from BP1HEMAV001.bp1.ad.bp.com (inetgate21.bp.com
[62.189.94.193])
	by euhemsmtp01.bp.com (Switch-3.0.4/Switch-3.0.0) with SMTP id
h5R8hhZ3029230
	for <submit.K3zPFY7PSqjxbO7q at spam.spamcop.net>; Fri, 27 Jun 2003
09:43:43 +0100 (BST)
Received: from 149.182.114.119 by BP1HEMAV001.bp1.ad.bp.com (InterScan
E-Mail VirusWall NT); Fri, 27 Jun 2003 09:37:58 +0100
content-class: urn:content-classes:message
Subject: 
MIME-Version: 1.0
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Date: Fri, 27 Jun 2003 09:37:02 +0100
Message-ID: <2FE5DE0B8790D411832700508BAF485906F17AF5 at eumorx5.bp.com>
X-MimeOLE: Produced By Microsoft Exchange V6.0.6375.0
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Index: AcM8h2m8K3AaOuxZQ0WTX659Xyx/vg==
From: "Murnaghan, Tim" <murnt1 at bp.com>
To: "SpamCop (E-mail)" <submit.K3zPFY7PSqjxbO7q at spam.spamcop.net>
Message Tokens:

135 unique tokens

'"murnaghan,'
'"spamcop'
'(bst)'
'(e-mail)"'
'(interscan'
'(qmail'
'+0100'
'-0000'
'08:38:23'
'09:37:02'
'09:37:58'
'09:43:43'
'1.0'
'2003'
'32546'
'cc:none'
'content-type:text/plain'
'date:'
'e-mail'
'email'
'exchange'
'finish'
'following'
'for'
'free.'
'fri,'
'from'
'from:'
'from:addr:devnull.spamcop.net'
'from:addr:nobody'
'from:name:spamcop autoresponder'
'had'
'header:Date:1'
'header:From:1'
'header:MIME-Version:1'
'header:Message-ID:1'
'header:Received:6'
'header:Return-Path:1'
'header:Subject:1'
'header:To:1'
'headers:'
'help'
'however,'
'invoked'
'it:'
'jun'
'like'
'links'
'message-id:'
'message-id:@msgid.spamcop.net'
'microsoft'
'network);'
'noheader:abuse-reports-to'
'noheader:errors-to'
'noheader:importance'
'noheader:in-reply-to'
'noheader:organization'
'noheader:reply-to'
'noheader:user-agent'
'noheader:x-abuse-info'
'noheader:x-complaints-to'
'noheader:x-face'
'now'
'nt);'
'pay'
'please'
'process'
'produced'
'proto:http'
'ready'
'received:'
'reply-to:none'
'reporting:'
'sender:none'
'service'
'service!'
'skip:( 10'
'skip:( 20'
'skip:1 10'
'skip:[ 10'
'skip:a 10'
'skip:a 30'
'skip:b 20'
'skip:c 10'
'skip:c 20'
'skip:e 10'
'skip:h 10'
'skip:m 10'
'skip:q 10'
'skip:s 20'
'skip:t 10'
'skip:u 20'
'skip:x 10'
'skip:x 20'
'smtp'
'smtp;'
'spam'
'spam.'
'spamcop'
'subject:'
'subject: '
'subject:SpamCop'
'subject:accepted'
'subject:email'
'subject:for'
'subject:has'
'subject:processing'
'support'
'text/plain;'
'the'
'this'
'tim"'
'to:'
'to:2**0'
'to:addr:bp.com'
'to:addr:murnt1'
'to:no real name:2**0'
'triggered'
'url:id'
'url:k3zpfy7psqjxbo7q'
'url:net'
'url:sc'
'url:shtml'
'url:spamcop'
'url:upgradeaccount' 'url:z122422379z3a05545ef058638f062ab78444977786z'
'use'
'v6.0.6375.0'
'viruswall'
'which'
'with'
'x-mailer:none'
'x-mimeole:'
'you'
'your'



_______________________________________________
Spambayes mailing list
Spambayes at python.org http://mail.python.org/mailman/listinfo/spambayes


LEGAL NOTICE:
Unless expressly stated otherwise, this message is confidential and may be privileged. It is intended for the addressee(s) only. Access to this e-mail by anyone else is unauthorized. If you are not an addressee, any disclosure or copying of the contents or any action taken (or not taken) in reliance on it is unauthorized and may be unlawful. If you are not an addressee, please inform the sender immediately.



More information about the Spambayes mailing list