[spambayes-dev] Understanding Classifier Code

Neil Schemenauer nas-spambayes at python.ca
Wed Feb 18 12:52:51 EST 2004


On Tue, Feb 17, 2004 at 07:35:29PM -0800, Daniel Eloff wrote:
> I've been looking at ways of increasing the speed at which the
> classifier runs.

Don't forget about Amdahl's law.  Have you profiled Spambayes and
found where most of the time is being spent?  My guess is that
tokenization is expensive.  However, I wouldn't start any optimation
effort myself until profiling since guesses are often wrong.

  Neil



More information about the spambayes-dev mailing list