[Spambayes] Tricks of Spammer

Tony Meyer tameyer at ihug.co.nz
Wed Oct 8 19:37:13 EDT 2003


> 1: Spam words
> 2: Ham words
> 3: Real words --> Spelling Dictionary from Word (German, English, ..)?

Note that spambayes doesn't know what a spam word or ham word is - it's just
a word with a couple of numbers (times seem in ham and spam).  Any word
might be considered ham, spam, or somewhere in between, depending on the
training you've done.

This is basically the same as:
 [ spambayes-Feature Requests-817813 ] Consider badspelling a sign of spam
<https://sourceforge.net/tracker/?func=detail&atid=498106&aid=817813&group_i
d=61702>

It's come up a few times; feel free to add comments to that tracker.  I
imagine someone will play around with testing the idea at some point (to
close the tracker, if for no other reason), but it'll have to wait until
someone finds the time and inclination to do that.

FWIW, those 'junk' words in your messages shouldn't have affected the score
at all.  If they hadn't been seen before, they would have scored 0.5, and
tokens that score between 0.4 and 0.6 aren't included in calculating the
message's score (unless you've changed the defaults).  If you look at the
clues for the message, it should become apparent why it scored what it did -
note that even 72% isn't that bad - training on a single message like this
may very well push all future ones into certain spam territory.

=Tony Meyer




More information about the Spambayes mailing list