[Spambayes] Adding a message database

Tim Stone - Four Stones Expressions tim at fourstonesExpressions.com
Wed Feb 26 08:10:43 EST 2003


2/26/2003 7:58:41 AM, Skip Montanaro <skip at pobox.com> wrote:

>    Mark> I would like spambayes to assist in managing a database of
>    Mark> message_ids, mapped to how they were previously trained.  
>    ... much stuff elided ...
>
>I understand what you want to do, but not why.  Can you provide some
>motivation?

Let me speak from recent experience.  I just wrote a Lotus Notes integration, 
and in the process discovered that Notes provides me with almost no facility 
to remember ANYTHING about a message.  It only provides me with a unique 
message id.  I was thus forced to implement a message database, so I could 
remember what had/had not happened to that message.  If I hadn't done that, I 
couldn't have properly retrained a message, and would have had to classify 
every message in the inbox every time I executed the filter.

This pattern was somewhat similar to what happens in the pop3proxy.  Messages 
are given an id, and are managed by that id.  Fortunately in that case, 
information can be embedded in headers.  ***BUT*** headers may not be a good 
place in which to store that information.  Particularly, how the message was 
trained is currently remembered by what subdirectory (Corpus) the message 
lives in.  This idea works for pop3proxy, not for Notes, and not for Outlook.

In Outlook, how a message was trained is currently remembered by what Outlook 
folder the message lives in.  But if I read the continual posts correctly, 
this is an ongoing source of aches, pains, and nausea for Mark, who has to 
handle all of the combinations of user interactions with trained mail to 
correctly untrain/retrain.

Then there's hammiefilter... etc. etc.

All of this adds up (in my mind) to a ton of code rolling around in the system 
simply to manage untrain/retrain which could be nicely abstracted into the 
learn method provided that information about a message could be persisted.

We will undoubtedly encounter more mail systems that could benefit from 
Spambayes.  My Notes work is my most recent example.  I feel that we should 
provide as much facility as we can to make these integrations as easy as 
possible, and this is one such facility.

Thus endeth my apologetic <wink> - TimS


c'est moi - TimS
http://www.fourstonesExpressions.com
http://wecanstopspam.org





More information about the Spambayes mailing list