[Spambayes] Outlook addin using bsddb
m2 at plusseven.com
Fri Feb 7 11:55:44 EST 2003
On Fri, Feb 07, 2003 at 04:46:30PM +1100, Mark Hammond wrote:
> [Resending - appears to have hit the bit-bucket!]
> I have just checked in a change to the Outlook plugin that will use a bsddb
> style database if a reasonable implementation can be found. Currently, a
> "reasonable" implementation means:
> * A bsddb3 module can be imported.
Maybe you want need bsddb3. My tests indicate that dumbdbm is faster
then bsddb3. And dumbdbm is all python and included in the standard
library. I was quite surprised by this and put a mail out here to see
if anyone could corroborate it but it didn't seem to spark any
Here's what I wrote.
alex> I moved from spamcan to spambayes today and wasted a couple
alex> hours profiling hammie.py
alex> <snip> profile.run("spambayes.hammiebulk.main()",
alex> '/tmp/stats') <snip>
alex> I ran this on approximately 2000 messages and aggregated the
alex> stats. The entire run was 496 CPU seconds.
alex> When looking at the profiling information I realized that I
alex> was using dumbdbm, which is supposed to very slow. I
alex> installed bsddb3, rebuilt my db and rerun the profiling
alex> The entire run was now 520 CPU seconds, a 4.8% increase.
alex> So it seems like "stupid beats smart" goes for speed
alex> optimizations to.
alex> Can anyone corroborate this?
More information about the Spambayes