[Spambayes] Love Spambayes, but I wonder if
Jesse Pelton
jsp at PKC.com
Wed Nov 9 14:21:07 CET 2005
Interesting! I wonder if there's an appropriate place for this in the
wiki?
> -----Original Message-----
> From: spambayes-bounces at python.org
> [mailto:spambayes-bounces at python.org] On Behalf Of Russ Foster
> Sent: Tuesday, November 08, 2005 4:44 PM
> To: SpamBayes List
> Subject: Re: [Spambayes] Love Spambayes, but I wonder if
>
>
> I've start experimenting with Spambayes a bit more on my home Linux
> machine.
>
> I have two directories: TrainSpam and TrainHam
>
> I put false positives/negatives and unsures in the
> appropriate directory.
> Every 5 minutes a cron job trains on those directories.
>
> Once a day, another cron job:
>
> - purges anything in these two directories that is older than 7 days.
>
> - moves my existing 'hammiedb' file
>
> - creates a new 'hammiedb' file
>
> - forces a re-training on the TrainSpam and TrainHam directories
>
> While I don't have anything quantitative, my amount of false
> negatives and
> false postives seems to be drastically reduced.
>
> This script has the effect of only keeping words in the
> database that have
> been seen in the past 7 days. Accounting, somewhat, for the
> change in the
> character of spam (and ham).
>
> Maybe once a day is overkill...but right now my system has cycles to
> spare.
>
> -r
>
>
>
> On Tue, 8 Nov 2005, Jesse Pelton wrote:
>
> > See
> >
> http://spambayes.sourceforge.net/faq.html#can-i-share-move-my-
> training-d
> > ata-from-one-computer-to-another for how to do this.
> >
> > But there's a price for that answer: I'm going to give you
> my opinion as
> > well. I wouldn't bother copying the training data.
> SpamBayes learns very
> > quickly, and the character of the spam I receive changes
> over time, so I
> > rather than hanging on to training data, I delete it and
> retrain from
> > scratch periodically. Within a day or two I find I'm getting better
> > results.
> >
> >
>
> _______________________________________________
> SpamBayes at python.org
> http://mail.python.org/mailman/listinfo/spambayes
> Check the FAQ before asking: http://spambayes.sf.net/faq.html
>
More information about the SpamBayes
mailing list