20 Aug
2002
20 Aug
'02
2:51 p.m.
[Skip Montanaro]
Anybody up for pooling corpi (corpora?)?
Barry is collecting clean data from mailing-list archives for lists hosted at python.org. It's unclear that this will be useful for anything other than mailing lists hosted at python.org (which I expect have a lot of topic commonality). There's a lovely spam archive here: http://www.em.ca/~bruceg/spam/