[Spambayes] What performance is good?
tameyer at ihug.co.nz
Thu May 13 22:43:11 EDT 2004
> How much do you expect to see in Junk Suspects?
Good results would be 2-5% of total incoming mail, IMO.
> I get about 15% ham. All the ham goes in Inbox (I had one
> ham in Junk Suspects last week), along with 1 or 2 spam,
> which is beautiful. 25% to 30% of incoming mail goes in Junk
> Suspects. Is that reasonable?
Are you displaying the scores for these messages? If so, do they all tend
to score over a certain value? You might find, for example, that you can
simply reduce the spam threshold (say to 80%) and the problem goes away.
What do you have the thresholds set for now? (SpamBayes->SpamBayes
> My database is unbalanced (36 ham/84 spam), so I'm only
> training on ham in Junk Suspects (i.e., hardly ever). Should
> I be seeking ham to train on?
IMO, that's not imbalanced enough to worry about.
Please always include the list (spambayes at python.org) in your replies
(reply-all), and please don't send me personal mail about SpamBayes. This
way, you get everyone's help, and avoid a lack of replies when I'm busy.
More information about the Spambayes