[Spambayes] Ongoing Database Corruption Problems
jacob-spambayes-list at statisticalanomaly.com
jacob-spambayes-list at statisticalanomaly.com
Wed Oct 15 14:01:07 EDT 2003
Hello everyone,
I'm having a lot of trouble with what I think is database corruption.
I've included the output I get from the program before, but from what I've
read, an assertion error usually means the database is dead.
As the FAQ suggests, I've tried both Bsddb[3] and Pickle formats, but
after a few trainings, I always get this error. If I delete my databases
and start over, then I'm fine for a few additional trainings, but the same
thing happens.
I'm getting a little frusturated with this. Is there something I can do
to keep this from happening?
Thanks,
Jacob
--------------------------------------------------------------------
suslik% ./sb_imapfilter.py -v -t -c
SpamBayes IMAP Filter Beta1, version 0.1 (September 2003),
using SpamBayes IMAP Filter Web Interface Alpha2, version 0.02
and engine SpamBayes Beta2, version 0.2 (July 2003).
Loading state from hammie.db database
hammie.db is an existing database, with 344 spam and 315 ham
Loading database hammie.db... Done.
Training
Training ham folder INBOX.-Wanted
........................................................................................................................................................................................................................................................................................................
0 trained.
Training ham folder INBOX
.*........**. 3 trained.
Training spam folder INBOX.-Spam
*.......................................................................................................................................................................................................................................................................................................................................................************************************************
49 trained.
Persisting hammie.db state in database
Training took 81.9990470409 seconds, 52 messages were trained
Classifying
................*.Traceback (most recent call last):
File "./sb_imapfilter.py", line 824, in ?
run()
File "./sb_imapfilter.py", line 814, in run
imap_filter.Filter()
File "./sb_imapfilter.py", line 675, in Filter
self.unsure_folder)
File "./sb_imapfilter.py", line 594, in Filter
evidence=True)
File "/u/jpfarmer/lib/python2.3/site-packages/spambayes/classifier.py",
line 158, in chi2_spamprob
clues = self._getclues(wordstream)
File "/u/jpfarmer/lib/python2.3/site-packages/spambayes/classifier.py",
line 395, in _getclues
prob = self.probability(record)
File "/u/jpfarmer/lib/python2.3/site-packages/spambayes/classifier.py",
line 245, in probability
assert spamcount <= nspam
AssertionError
More information about the Spambayes
mailing list