[Spambayes] Client/server model
Guido van Rossum
guido@python.org
Thu Oct 17 20:19:59 2002
> > > I'd want the server to do tokenization for consistency reasons.
> > > Particularly if you are also spam filtering news articles and not
> > > just e-mail messages.
> >
> > I don't understand this.
>
> So that everybody tokenizes the incoming messages in the same way,
> particularly the same way as that used earlier during training.
The hammie-client approach has a separate client program that's
invoked each time, and that takes care of the uniform parsing.
> Also, I'd have the server keep track of spam from other sources,
> such as UseNet news. Is there anywere else where spam messages
> show up that might need to be included, or is it just mail and
> news?
Not that I know of.
--Guido van Rossum (home page: http://www.python.org/~guido/)