[Spambayes] lots of unsures, heavily biased towards spam
David Abrahams
dave at boost-consulting.com
Mon Feb 5 17:35:33 CET 2007
skip at pobox.com writes:
> Dave> I know that; I meant the technical implications. In particular, I
> Dave> asked:
>
> >> > I know spambayes keeps a database; when I delete already-trained
> >> > emails from my xxx-training folders does it forget everything about
> >> > those messages and rebuild the database using the other messages as
> >> > though from scratch, or is some of the information about those
> >> > deleted messages retained?
>
> Sorry. Yes, when a message is "untrained" the database forgets about it
> completely (message count is decremented, all token counts adjusted
> down).
And deleting a messgae from the training set will cause it to be
"untrained" at the next training?
>
> >> I know this won't help you with the imap filter, however...
>
> Dave> Why not?
>
> I don't use the imap filter. I presume it has its own way of managing your
> hams and spams.
It doesn't sound different to me. There's a ham training mailbox and
a spam training mailbox.
> >> I use the train-to-exhaustion script in the contrib directory which
> >> helps keep my ham:spam ratio tractable....
>
> Dave> Can that procedure be applied to my IMAP folders?
>
> In theory I suppose it can. The ham and spam mailboxes are opened using
> mboxutils.getmbox() which presumes to be fairly format agnostic but I don't
> know if it will work with IMAP folders. I see no mention of IMAP at all in
> mboxutils.py. I suppose this might be a case of "patches cheerfully
> accepted". :-)
:/
--
Dave Abrahams
Boost Consulting
www.boost-consulting.com
More information about the SpamBayes
mailing list