[spambayes-bugs] [ spambayes-Bugs-972359 ] some 100% SPAM sent to unsure

SourceForge.net noreply at sourceforge.net
Fri Jan 7 18:07:33 CET 2005


Bugs item #972359, was opened at 2004-06-14 02:12
Message generated for change (Comment added) made by traal42
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=498103&aid=972359&group_id=61702

Category: Outlook
Group: Source code - CVS
Status: Open
Resolution: None
Priority: 5
Submitted By: Dan Zenchelsky (dzenc)
Assigned to: Mark Hammond (mhammond)
Summary: some 100% SPAM sent to unsure

Initial Comment:
(This is a continuation of Bugs item #923921.  I am 
opening a new bug report so that I can upload related 
files.)

About 5-10% of all 100%-spam gets shifted to the
"Possible Spam" folder rather than the "Spam" folder. 

-

I received a message which was sent to the "unsure" 
folder.  I then ran "show spam clues" on it, which 
showed that it was 100% SPAM (see attached file).  

I then manually ran "filter messages..." on the unsure 
folder and it correctly detected the message as SPAM 
and moved it to the correct folder.

*No* training was done between the time the message 
arrived and the time that I performed these steps.


 


----------------------------------------------------------------------

Comment By: Derek Hofmann (traal42)
Date: 2005-01-07 17:07

Message:
Logged In: YES 
user_id=662400

I'm also getting 0% spam mails going to Unsure.

----------------------------------------------------------------------

Comment By: Tony Meyer (anadelonbrin)
Date: 2004-11-03 04:05

Message:
Logged In: YES 
user_id=552329

Noting dupe in:

[ 923921 ] 100% Spam moved only to
<https://sourceforge.net/tracker/index.php?func=detail&aid=923921&group_id=61702&atid=498103>

----------------------------------------------------------------------

Comment By: Tony Meyer (anadelonbrin)
Date: 2004-11-03 03:01

Message:
Logged In: YES 
user_id=552329

Hmm.  I wonder whether my one was a wierd fluke (I still
have the clues message in my drafts folder...) since it
doesn't have the content-type stuff, and these others do.

Noting dupe in:
[ 981750 ] 100% SPAM in possible SPAM folder
https://sourceforge.net/tracker/?group_id=61702&atid=498103&func=detail&aid=981750

(That tracker includes another example msg).

----------------------------------------------------------------------

Comment By: Tony Meyer (anadelonbrin)
Date: 2004-06-29 05:01

Message:
Logged In: YES 
user_id=552329

I have never seen this before, but did today (as in
spambayes-dev message).  Attached is the message that it
happened with.  Let me know if you need anything else (I
saved the spam clues message so have that, but the log (ie.
contents of the trace window in Pythonwin) are gone now, but
I assure you it had nothing odd).

----------------------------------------------------------------------

Comment By: Mark Hammond (mhammond)
Date: 2004-06-28 07:21

Message:
Logged In: YES 
user_id=14198

This will be an error in our 'content-type' magic.  If I
look at the headers for this message, as shown by Outlook's
View->Options, I see:

"""
Microsoft Mail Internet Headers Version 2.0
Received: from mx56.dadoihag.us ([64.136.98.56]) by
proxy.danz.com with Microsoft SMTPSVC(5.0.2195.6713);
	 Sun, 13 Jun 2004 18:39:09 -0700
From: "United Coalition" <SarahAllen at mx56.dadoihag.us>
To: " " <danz at danz.com>
Subject:  1 Minute Mortgage Application Here
Date: Sun, 13 Jun 2004 20:39:07 -0600
Message-ID: <lyadopvyxqhkvziyyttf at mx56.dadoihag.us>
MIME-Version: 1.0
Content-Type: multipart/alternative;
	boundary="----=_NextPart_000_0D33_D859C01A.71B0FC9A"
Return-Path: SarahAllen at mx56.dadoihag.us
X-OriginalArrivalTime: 14 Jun 2004 01:39:10.0757 (UTC)
FILETIME=[64083550:01C451B0]

------=_NextPart_000_0D33_D859C01A.71B0FC9A
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: 7bit

------=_NextPart_000_0D33_D859C01A.71B0FC9A
Content-Type: text/html; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable


------=_NextPart_000_0D33_D859C01A.71B0FC9A--
"""
That doesn't look right (to me) for the headers, and I
suspect is what is causing the tokenizer to not see all the
tokens when filtered, but it *does* see them all for 'spam
clues'

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=498103&aid=972359&group_id=61702


More information about the Spambayes-bugs mailing list