[Spambayes] Central limit

Charles Cazabon python-spambayes@discworld.dyndns.org
Mon, 30 Sep 2002 12:49:46 -0600


Tim Peters <tim.one@comcast.net> wrote:

> It's not always clear, and in my set of 20,000 hams I'm still keeping a
> message that added a one-line comment to a quote of an entire Nigerian-scam
> spam -- that's one of the 2 false positives remaining in my corpus.

What did the quoting look like?  If it wasn't a top-posting quote, then the
presence of lines beginning with "> " could be noticed by the tokenizer.  Such
standard quote markers are present in roughly ham my ham, and basically none
of my spam.

Charles
-- 
-----------------------------------------------------------------------
Charles Cazabon                 <python-spambayes@discworld.dyndns.org>
GPL'ed software available at:     http://www.qcc.ca/~charlesc/software/
-----------------------------------------------------------------------