[Python-Dev] RE: [Python-checkins] python/nondist/sandbox/spambayes
GBayes.py,1.7,1.8
Tim Peters
tim.one@comcast.net
Tue, 20 Aug 2002 17:51:02 -0400
[Skip Montanaro]
> Anybody up for pooling corpi (corpora?)?
Barry is collecting clean data from mailing-list archives for lists hosted
at python.org. It's unclear that this will be useful for anything other
than mailing lists hosted at python.org (which I expect have a lot of topic
commonality).
There's a lovely spam archive here:
http://www.em.ca/~bruceg/spam/