[Spambayes] Ongoing Database Corruption Problems

jacob-spambayes-list at statisticalanomaly.com jacob-spambayes-list at statisticalanomaly.com
Wed Oct 15 14:01:07 EDT 2003


Hello everyone,

I'm having a lot of trouble with what I think is database corruption.
I've included the output I get from the program before, but from what I've
read, an assertion error usually means the database is dead.

As the FAQ suggests, I've tried both Bsddb[3] and Pickle formats, but
after a few trainings, I always get this error.  If I delete my databases
and start over, then I'm fine for a few additional trainings, but the same
thing happens.

I'm getting a little frusturated with this.  Is there something I can do
to keep this from happening?

Thanks,
Jacob

--------------------------------------------------------------------
suslik% ./sb_imapfilter.py -v -t -c
SpamBayes IMAP Filter Beta1, version 0.1 (September 2003),
using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02
and engine SpamBayes Beta2, version 0.2 (July 2003).

Loading state from hammie.db database
hammie.db is an existing database, with 344 spam and 315 ham
Loading database hammie.db... Done.
Training
   Training ham folder INBOX.-Wanted
........................................................................................................................................................................................................................................................................................................
      0 trained.
   Training ham folder INBOX
.*........**.       3 trained.
   Training spam folder INBOX.-Spam
*.......................................................................................................................................................................................................................................................................................................................................................************************************************
      49 trained.
Persisting hammie.db state in database
Training took 81.9990470409 seconds, 52 messages were trained
Classifying
................*.Traceback (most recent call last):
  File "./sb_imapfilter.py", line 824, in ?
    run()
  File "./sb_imapfilter.py", line 814, in run
    imap_filter.Filter()
  File "./sb_imapfilter.py", line 675, in Filter
    self.unsure_folder)
  File "./sb_imapfilter.py", line 594, in Filter
    evidence=True)
  File "/u/jpfarmer/lib/python2.3/site-packages/spambayes/classifier.py",
line 158, in chi2_spamprob
    clues = self._getclues(wordstream)
  File "/u/jpfarmer/lib/python2.3/site-packages/spambayes/classifier.py",
line 395, in _getclues
    prob = self.probability(record)
  File "/u/jpfarmer/lib/python2.3/site-packages/spambayes/classifier.py",
line 245, in probability
    assert spamcount <= nspam
AssertionError






More information about the Spambayes mailing list