[spambayes-dev] Speedup for full retrain when using DB dict

Skip Montanaro skip at pobox.com
Fri Sep 5 10:53:48 EDT 2003


    >> indicated that I had a patch which might speed up full retrains when
    >> using a shelve database.  I'm happy to say it works well for me.  The
    >> test I ran essentially executed

    Tim> Wouldn't it be simpler to do the full retrain using a
    Tim> PickledClassifier instance, then populate a DBDictClassifier from
    Tim> the result?  That would also skip the extra layers of code (and
    Tim> time) to maintain the changed_words dict during the retrain.

Perhaps.  Are you suggesting I detect the zero-length shelve object before
instantiating a classifier or instantiating a PickledClassifier from within
the DBDictClassifier code?

Skip




More information about the spambayes-dev mailing list