[Spambayes] Need help getting started

Guido van Rossum guido@python.org
Wed, 18 Sep 2002 23:21:55 -0400


I'd like to run experiments for Tim.  My ham corpus is over 80,000
messages, spread over hundreds of MH folders, one message per file
with numeric names.  (Don't laugh.  I've been a packrat.  And that's
only since May 2000.)  As a spam archive I've downloaded Bruce
Guenter's archives from 2000, 2001, 2002, also one message per file,
with a .txt or .lorien extension.  (Does anyone know what the .lorien
means?)

But how to turn this into Tim's standard data setup?  Do I have to
write programs to do this?  Or does something exist that I simply
didn't see in the README.txt file?  How did Tim create his ham
archive?

--Guido van Rossum (home page: http://www.python.org/~guido/)