[Spambayes] Unique text = ham?
tim at aterraform.com
Mon Mar 27 04:09:57 CEST 2006
The pop3proxy uses the same filtering as all of the other spambayes
interfaces. There is no difference there.
There is no function at this point to always flag a unique sequence of
characters as non-spam. Presuming that the unique sequence of
characters occurs only in ham for you, and you train enough of them,
then spambayes will eventually get it right. I might suggest, however,
that you implement a rule in your mailer that looks for that sequence
and simply files it off to a folder before the rule that moves flagged
spam gets ahold of it. That way, whether or not spambayes has flagged
it as spam is of no consequence.
Generally speaking, spambayes flags mail as likely spam or not, but it's
up to your mailer to do something with that information. If you're
using outlook, then the spambayes plugin is a bit more proactive than
that, but that's the only exception.
Thomas Hruska wrote:
>Amedee Van Gasse wrote:
>>Thomas Hruska wrote:
>>>I need a way to tell SpamBayes to ignore classifying e-mails that
>>>contain a specific sequence of characters even if it would have
>>>classified the e-mail as spam. How do I go about accomplishing this
>>>task with the POP3 Proxy SpamBayes server? I can't seem to locate an
>>>option to do this - perhaps I missed it? This feature is a "stupid"
>>>feature, but will keep the SpamBayes database lean-and-mean and a fair
>>>balance between ham and spam for training. A 20:1 ham to spam ratio
>>>instead of 1000:1. I can't have it classifying these critical e-mails
>>>at all. I want them all even if they ARE spam.
>>I think, if you know the mathematical theory behind spambayes, if it is
>>a unique sequence of characters, and that unique sequence of characters
>>will never appear in spam, it's just the law of large numbers that applies.
>>So, if you keep on telling that this specific kind of email is ham,
>>after just a few mails sb will learn. Stupid large numbers always beat
>>smart and stupid features.
>>But I don't know how the pop3 proxy implementation works.
>>At work I use the Oulook plugin, and at home I use the procmail filter.
>>(I mainly use Linux and try to avoid Windows if I can)
>Except I want ANY message that uses the sequence of characters,
>_INCLUDING_ spam. The unique sequence of characters _WILL_ appear in
>spam, but I still want those messages anyway. I can't have even the
>_slightest_ possibility that SpamBayes will flag those messages as spam.
>I realize that wanting spam with an anti-spam tool seems
>counter-intuitive, but I have my reasons for needing this.
>Safe C++ Design Principles (First Edition)
>Learn how to write memory leak-free, secure,
>portable, and user-friendly software.
>Learn more and view a sample chapter:
>SpamBayes at python.org
>Check the FAQ before asking: http://spambayes.sf.net/faq.html
More information about the SpamBayes