[Spambayes] What is spam?
Neale Pickett
neale@woozle.org
16 Sep 2002 22:42:32 -0700
So then, Guido van Rossum <guido@python.org> is all like:
> Paul Graham defines spam as *automated* unwanted email.
No doubt. Here's a run against a properly-sorted set of ham and spam,
then. I don't imagine these variances are even worth looking at, since
0.5% is a single message.
run1s -> run2s
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
false positive percentages
1.500 2.000 lost +33.33%
2.500 1.000 won -60.00%
1.000 0.500 won -50.00%
0.500 0.500 tied
0.000 0.000 tied
won 2 times
tied 2 times
lost 1 times
total unique fp went from 11 to 8 won -27.27%
mean fp % went from 1.1 to 0.8 won -27.27%
false negative percentages
2.000 2.000 tied
1.500 1.500 tied
0.000 0.500 lost +(was 0)
1.000 1.000 tied
1.500 1.500 tied
won 0 times
tied 4 times
lost 1 times
total unique fn went from 12 to 13 lost +8.33%
mean fn % went from 1.2 to 1.3 lost +8.33%