[spambayes-dev] Tricky false positive: US states
Neil Schemenauer
nas-spambayes at python.ca
Mon Oct 6 12:20:36 EDT 2003
On Sat, Oct 04, 2003 at 06:11:45PM -0400, Tim Peters wrote:
> That's what happened here. The rub is that getting the same judgment from
> 100 consultants isn't *really* more reliable than getting it from one
> consultant unless the consultants are independent -- if they are
> independent, very high confidence is fully justified. In this case, the
> consultants are all related, biased in the same direction for a reason.
It occurs to me that if Spambayes could understand some degree of
dependence between tokens, multi-user databases might work much
better. For example, one person might like Amazon pseudospam while
another may consider it outright spam.
Neil
More information about the spambayes-dev
mailing list