[Spambayes] Re: [spambayes-dev] a useful pre-filter for
auto-training bayesian systems?
jm at jmason.org
Thu Feb 19 20:21:44 EST 2004
-----BEGIN PGP SIGNED MESSAGE-----
Ryan Malayter writes:
> I found this interesting. Using the social network as a first step, a
> bunch of "definite ham" and "definite spam" messages are listed. These
> can be used to train a Bayesian filter which then filters the rest of
> the unsures automatically. Very little user intervention would therefore
> be required for training, and it cuts in half the number of messages
> that must be filtered by the much-more-expensive statistical filter.
> One could even imagine users securely posting their email addressee's
> "white lists" by posting SHA-1 hashes instead of actually email
> addresses to some public forum. (This would have to be salted, of
> course). This could create a meta-social-network.
> They don't seem to address the issue of a spam that has a forged address
> from your own social network, though, which might trip up this whole
> social network process.
Yeah -- spam with forged From of your address. That has historically
been how spammers get around address-book-based whitelisting, because
everyone usually has 1 or more of their own addrs in the address book.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (GNU/Linux)
Comment: Exmh CVS
-----END PGP SIGNATURE-----
More information about the Spambayes