[Spambayes] Junk suspects

Niles Parr nparr at mortonwelding.com
Mon Aug 16 19:29:21 CEST 2004


Here's the results from my session so far today.  This was a rather large
email dload since it contained the entire weekend.  My database has about
2600 ham and spam msg's each in it.  I am continually have to mark some
unsure msg's as ham that are from the same person with the same subject some
of the time.  It's like it cant get it through it's head that it's good.
And it seems to constantly mark out of office reply's as spam.

Processed 2024 msg's
1201 Good
660 Spam
163 unsure

88 msg's manually classified as good with 0 being false positives.
78 msg's were manually classified as spam with 3 being false.

Niles  

-----Original Message-----
From: Tony Meyer [mailto:tameyer at ihug.co.nz] 
Sent: Thursday, August 12, 2004 6:30 PM
To: 'Niles Parr'; spambayes at python.org
Subject: RE: [Spambayes] Junk suspects

> How long should I expect it to take before the amount of junk suspect 
> msg's start to decrease.

It really depends completely on the email mix and training that you do.

> I receive around 100-150 msg's an hour.  What it's detecting as spam 
> is almost dead on but I'm still getting email from repeat senders and 
> subjects getting detected as suspects.  A vast majority of suspects 
> are legit.  I've only been working on this for about a day so far and 
> I imagine it takes a good week or so with that kind of volume?

I get good results after training much fewer than 100 ham & 100 spam, so in
theory it could take you less than an hour.  You don't say anything about
the training that you're doing/have done, or the numbers of messages that
you have trained - these make all the difference.

The way to see why a message is scoring what it is is to select it (before
training) and choose "Show spam clues for this message" from the SpamBayes
menu.  This will bring up a message with the clues list - you will probably
be able to see yourself why it scores what it does (which should hint at the
solution), but if you can't, you can send a copy to the list (with an
explanation) and we can try and explain it to you.

=Tony Meyer

---
Please always include the list (spambayes at python.org) in your replies
(reply-all), and please don't send me personal mail about SpamBayes. This
way, you get everyone's help, and avoid a lack of replies when I'm busy.

---
[This E-mail scanned for viruses by Declude Virus]




More information about the Spambayes mailing list