[Spambayes] Outlook weirdness
Sat Nov 23 21:49:10 2002
[Sean True, on using a database]
> Slower *training* would be an issue, however.
For bulk training, but one-at-a-time training would be much faster (no need
for update_probabilities() at the end, which computes a new value for every
word in the database). Bulk training could be taught to use a new
classifier based on an in-memory dict. When that's done, the in-memory
dict's ham and spam counts would be added into the persistent DB (rewriting
only those WordInfo records corresponding to words that appeared in the bulk
training data), and then the in-memory dict could be thrown away.
More information about the Spambayes