[spambayes-dev] Another incremental training idea...
Toby Dickenson
tdickenson at geminidataloggers.com
Thu Jan 15 09:52:00 EST 2004
On Thursday 15 January 2004 13:50, Skip Montanaro wrote:
> Toby> If Im reading this right, my 7:1 imbalance doesnt hurt me.
>
> Toby> filename: unbal bal1 bal2 bal3
> Toby> ham:spam: 14560:1992 1992:1992
> Toby> 1992:1992 1992:1992
> Toby> fp total: 0 0 1 0
> Toby> fp %: 0.00 0.00 0.05 0.00
> Toby> fn total: 12 6 8 6
> Toby> fn %: 0.60 0.30 0.40 0.30
> Toby> unsure t: 102 21 23 29
> Toby> unsure %: 0.62 0.53 0.58 0.73
> It doesn't seem to have a negative effect on false positives, but it looks
> like you will get roughly double the number of false negatives and 4-5x as
> many unsures.
4x as many unsures, out of a total population that is 4x larger. so no overall
percentage change. Am I reading that right?
--
Toby Dickenson
More information about the spambayes-dev
mailing list