[Spambayes] Re: Training Spambayes

Mathew Hendry TJLWBECGSGWU at spammotel.com
Sat Dec 18 14:17:16 CET 2004


On Fri, 17 Dec 2004 12:21:18 -0500, "Kenny Pitt" <kennypitt at hotmail.com>
wrote:

>The "train-on-mistakes-and-unsures" strategy implemented in the Outlook
>addin is believed to be the most effective strategy for most general users.

Is that how the automated training is implemented in the latest CVS
versions? Or are you talking about manual training, starting with an empty
database and correcting any mistakes as new messages arrive?

I was thinking that the "train on mistakes" approach could be taken a step
further, down to the individual token level: all encountered tokens are
stored in the database, but only "activated" for filtering when found to be
required to filter correctly; that is, when a mistake is found, tokens are
activated in order of decreasing significance until classification is
correct. Has anyone tried anything like this?

-- Mat.




More information about the Spambayes mailing list