[Spambayes] Important: new feature!
tameyer at ihug.co.nz
Fri Jul 9 05:04:40 CEST 2004
> I understand what you're saying, and I've re-trained
> spambayes on equal numbers of HAM and SPAM, 300 each. Yet, it
> still has a tendency to classify some HAM as spam. A
> significant percentage is like maybe 25% of my regular
> announcement mail gets spamflagged...
Something is definitely not working here - a 25% false positive rate is
extraordinarily high. You shouldn't even get as high as 2.5%. Perhaps you
could send the spam clues report for one of these messages (before any
training on it) to the list? We could take a look at it and see if there's
an obvious reason that it's failing to be classified correctly.
> Perhaps I have a better
> solution: I have a lot of mailtraffic from the local
> university network and a lot of correspondence, announcements
> etc arrive from addresses with the "phys.uu.nl" suffix.
> Surely it must be possible to exclude messages whose
> returnaddress contain that string (or other certified
> addresses) from further processing by spambayes?
This is called "whitelisting", and SpamBayes does not do it. For a lengthy
explanation, please see FAQ 6.6:
You ought to be able to implement some sort of whitelisting with Outlook's
rule system, however.
Please always include the list (spambayes at python.org) in your replies
(reply-all), and please don't send me personal mail about SpamBayes. This
way, you get everyone's help, and avoid a lack of replies when I'm busy.
More information about the Spambayes