[Spambayes] What is spam?

Neale Pickett neale@woozle.org
16 Sep 2002 22:42:32 -0700


So then, Guido van Rossum <guido@python.org> is all like:

> Paul Graham defines spam as *automated* unwanted email.

No doubt.  Here's a run against a properly-sorted set of ham and spam,
then.  I don't imagine these variances are even worth looking at, since
0.5% is a single message.

run1s -> run2s
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams
-> <stat> tested 200 hams & 200 spams against 800 hams & 800 spams

false positive percentages
    1.500  2.000  lost   +33.33%
    2.500  1.000  won    -60.00%
    1.000  0.500  won    -50.00%
    0.500  0.500  tied
    0.000  0.000  tied

won   2 times
tied  2 times
lost  1 times

total unique fp went from 11 to 8 won    -27.27%
mean fp % went from 1.1 to 0.8 won    -27.27%

false negative percentages
    2.000  2.000  tied
    1.500  1.500  tied
    0.000  0.500  lost  +(was 0)
    1.000  1.000  tied
    1.500  1.500  tied

won   0 times
tied  4 times
lost  1 times

total unique fn went from 12 to 13 lost    +8.33%
mean fn % went from 1.2 to 1.3 lost    +8.33%