[Spambayes] Central limit
Charles Cazabon
python-spambayes@discworld.dyndns.org
Mon, 30 Sep 2002 12:49:46 -0600
Tim Peters <tim.one@comcast.net> wrote:
> It's not always clear, and in my set of 20,000 hams I'm still keeping a
> message that added a one-line comment to a quote of an entire Nigerian-scam
> spam -- that's one of the 2 false positives remaining in my corpus.
What did the quoting look like? If it wasn't a top-posting quote, then the
presence of lines beginning with "> " could be noticed by the tokenizer. Such
standard quote markers are present in roughly ham my ham, and basically none
of my spam.
Charles
--
-----------------------------------------------------------------------
Charles Cazabon <python-spambayes@discworld.dyndns.org>
GPL'ed software available at: http://www.qcc.ca/~charlesc/software/
-----------------------------------------------------------------------