dave at boost-consulting.com
Sat Dec 27 07:08:32 EST 2003
"Tim Peters" <tim.one at comcast.net> writes:
> [David Abrahams]
>> I keep getting quite a few spams which fit the descriptions below
>> (from NEWTRICS.txt):
> I'm sure everyone gets them, the interesting question is whether they're
> evading your spambayes filter.
They are showing up as Unsure; I wouldn't see them otherwise.
> They don't seem to give mine particular trouble (of course I train
> on those that score Unsure;
> I'm not sure I've ever seen one score as Ham).
>> ... [descriptions of attempted obfuscation via insertion of
>> punctuation, and replacing letters by digits] ...
>> Since "this file is for ideas that have or have not yet been tried",
>> I'd love to know what constitutes "trying". Is there some official
>> testing procedure or corpus we can test against? I'd like to know
>> whether any change I make is worth proposing. Of course I can try it
>> on my own databases of Ham and Spam first...
> There's no official corpus, else we'd be teaching the system to recognize
> that corpus. Alex gave the right pointers to docs for the testing
Thanks. We'll see if my Christmas downtime lasts long enough for me
to be able to try that ;-)
More information about the spambayes-dev