[Spambayes] Improved comparison of classifier changes?
piersh at friskit.com
Fri Mar 7 09:51:49 EST 2003
(This came to me in a dream. No, really...)
When comparing two different classifier/tokenizer strategies, instead of
just comparing the numbers of false negatives and positives, how about
comparing some function (product, sum, average,
some-more-appropriate-statistical-function?) of the spam probability of
all messages in each classification (spam, ham, false-positive,
false-negative)? This might give a slightly better indication of not
just the numbers of messages that were classified correctly/incorrectly,
but of how sure the classifier was when it made those decisions.
.. or was I just dreaming...?
More information about the Spambayes