[Spambayes] Interesting behaviour from the Outlook client
Mark Hammond
mhammond at skippinet.com.au
Fri Dec 6 07:17:26 2002
> Over the past few days, I've been seeing an increase in FNs and
> Unsures. I initially trained on my inbox and spam folders (386
> ham, 999 spam), and since then I've trained on errors only. I'm
> now at 391 ham and 1011 spam. Initially, I was getting no errors,
> and 1 or 2 unsures per day. Now, I'm starting to get at least 1
> FN per day, and a slight increase in the unsure rate.
I think something is broken. I'm not sure what though :(
I am seeing bizarre stuff that I can't explain, and don't even know how to
describe reasonably :( Eg, recently I saw a clear spam scored as 3%. The
spam-clues showed:
word spamprob #ham #spam
...
'card-swipe' 0.123921 2 0
'cash-only' 0.123921 2 0
but still lots of obvious spam clues (ie, not everything was screwed).
However, I was certain these don't appear in ham, so I did a full re-train.
Then, these were correctly identified as only in spam (ie, not in ham), so
the spam got a solid 100%. Interestingly I did a full retrain very recently
before this.
I suspect incremental retrain is broken, but I haven't looked too far - I
just throw this out in speculation that there may be a more subtle bug in
the training rather than the algorithm or in the options that control it.
Mark.
More information about the Spambayes
mailing list