[spambayes-dev] Re: spambayes-dev Digest, Vol 12, Issue 17

Thomas Juntunen juntunen at well.com
Sun Apr 18 15:23:33 EDT 2004


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 04/18/04, Seth Goodman imposed order on a stream of electrons to say:

>I can think of several issues applying PC analysis to a text message instead
>of a signal stream.  Since a text message can be parsed in different ways to
>create a signal to do the Eigendecomposition on, results will depend on
>whether you treat it as a bit stream, a character stream (with what
>character length?) or a token stream (tokenized how?).  It would also be
>possible to treat the SpamAssassin results as tokens and use only those to
>create a token stream.

So far as I understand it, Dr. Sullivan didn't analyze the message text or headers themselves, he looked at which SpamAssassin rules were triggered over time. So the triggered rules are the vectors in this case.


Thomas Juntunen

-----BEGIN PGP SIGNATURE-----
Version: PGP SDK 3.0

iQA/AwUBQILHo9Foei/9T3YdEQKgNwCg6cT33IzOO5zXawXu8Bsdh14HJ2QAn3dW
xAl1gEdAFiWxQP8z9dVgVdZ/
=q7r9
-----END PGP SIGNATURE-----



More information about the spambayes-dev mailing list