[spambayes-dev] NEWTRICKS
David Abrahams
dave at boost-consulting.com
Fri Dec 26 13:16:55 EST 2003
I keep getting quite a few spams which fit the descriptions below
(from NEWTRICKS.txt):
- Punctuation sometimes gets inserted in otherwise spammy words or phrases,
e.g.: "Ch-eck ou=t ou-r sel)ection _of grea)t R_X -emgffj". It might be
helpful to try stripping punctuation. (Idea from Paul Sorenson)
- Similarly, some letters get replaced by numbers, e.g.: "V1agra" instead of
"Viagra". Mapping numbers to suitable letters might help in some
situations.
Since "this file is for ideas that have or have not yet been tried",
I'd love to know what constitutes "trying". Is there some official
testing procedure or corpus we can test against? I'd like to know
whether any change I make is worth proposing. Of course I can try it
on my own databases of Ham and Spam first...
--
Dave Abrahams
Boost Consulting
www.boost-consulting.com
More information about the spambayes-dev
mailing list