A Moronicity of Guido van Rossum
Tony Meyer
t-meyer at ihug.co.nz
Thu Sep 29 23:51:01 EDT 2005
> I know nobody wants to do add "white/black-listing", so we can do it
> probabilistically. In case it is not obvious, mailings with the words
> "jargon" or "moron" and their derrivatives should be flagged as 99.9%
> probability for Moronicity Xha Lee, Jargonizer, spam. If spam bayes
> can't
> figure this out, then either it is not properly implemented or
> Bayes himself
> was out to lunch.
I knew I'd regret my response <wink>.
The problem here isn't getting an appropriately spammy score for
particular tokens, like Xah's name. The problem is that the
classifier has to taken into account the entire message, and the
hammy clues outweigh the spammy ones (not unexpected, really,
considering that other than all the trolling, the messages are
reasonably on-topic).
This is a feature, not a bug. It's the same feature that means that
messages talking about spam on the spambayes mailing list, or the
legitimate mail I get about viagra <wink>, get through to me.
=Tony.Meyer
More information about the Python-list
mailing list