[spambayes-bugs] [ spambayes-Bugs-884545 ] off-by-one ?
SourceForge.net
noreply at sourceforge.net
Sun Jan 25 23:17:18 EST 2004
Bugs item #884545, was opened at 2004-01-26 04:17
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=498103&aid=884545&group_id=61702
Category: imapfilter
Group: Source code 1.0a7
Status: Open
Resolution: None
Priority: 5
Submitted By: Bip (bippo312)
Assigned to: Tony Meyer (anadelonbrin)
Summary: off-by-one ?
Initial Comment:
WinXP, python 2.3.3
Whenever I train spambayes, I always (later, when
classifying) get the error that I have more spam than
total mails. I normally just do sb_dbexport -e -D
hammie.db -f out.txt
In out.txt, the top line will say that there are 202 ham
and 3342 spam... searching through the file I find the
line
header%3ATo%3A1`170`3343`
(this is the only line in the db where the # spams is
larger than the total # of spams)
I correct this line, and re-import the database. It now
works fine. Every time I train spambayes this 'header-
to' line will always be one higher than the total # of
spams.
(If I train twice, the number of spams for header-to will
be 2 larger than the max # of spams... etc)
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=498103&aid=884545&group_id=61702
More information about the Spambayes-bugs
mailing list