[Spambayes] [ spambayes-Patches-639122 ] hammie: ignore emails older than n days

SourceForge.net noreply at sourceforge.net
Fri Jan 24 14:29:25 EST 2003


Patches item #639122, was opened at 2002-11-15 15:47
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=498105&aid=639122&group_id=61702

Category: None
Group: None
Status: Open
Resolution: Later
Priority: 5
Submitted By: Jason Hildebrand (jdhildeb)
Assigned to: Neale Pickett (npickett)
Summary: hammie: ignore emails older than n days

Initial Comment:
Since your documentation stresses the importance of
training using only relatively recent emails, I thought
a good way to do this would be to have hammie do it for me.

So I added a new configuration option:

[Hammie]
# when training, hammie will ignore messages older than
this number of days.
# i.e. set to 365 to ignore messages older than one year.
# Set to 0 to disable any filtering by date.
ignore_old_messages: 0

The patch also modifies Hammie to output the number of
messages it read/ignored for each mail file it processes.

This option might also prove useful for doing
incremental training (i.e. set up cron to train once a
week, and set ignore_old_messages to 7).


----------------------------------------------------------------------

>Comment By: Jason Hildebrand (jdhildeb)
Date: 2003-01-24 16:29

Message:
Logged In: YES 
user_id=173690

Unfortunately, I haven't had time to update to a more recent
spambayes; I'm still using a version from last november. 
Since this version is working well for me, I'm not terribly
interested in messing with it -- since I know things have
changed considerably in CVS since then.   So I'm in a poor
position to judge whether the functionality mboxtrain.py
offers is "good enough" -- I'll have to leave it up to
others to comment on.

----------------------------------------------------------------------

Comment By: T. Alexander Popiel (popiel)
Date: 2003-01-23 12:42

Message:
Logged In: YES 
user_id=632302

Parsing the topmost received header for the date is a very
valuable tool for maintaining limited database size.  It's a
key feature of my bulkgraph.py script (over and above
dealing with my non-standard everything vs. spam folders). 
Count this as another vote to include such filtering... even
though my peculiar folder setup precludes me from using
mboxtrain.

----------------------------------------------------------------------

Comment By: Neale Pickett (npickett)
Date: 2003-01-22 23:01

Message:
Logged In: YES 
user_id=619391

Jason, does the current mboxtrain.py script do enough of
this functionality for you, or would you still like to see
us work by the Recieved header?  I suspect it might be good
enough...

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=498105&aid=639122&group_id=61702



More information about the Spambayes mailing list