[Spambayes] FYI: Java implementation
richard at jowsey.com
Mon Jan 20 06:38:26 EST 2003
> Upgrade to Python and you would have finished a couple months ago
Yeah, that thought had occurred to me too... <grin>
> [chi-combining] This gives it some nice
> properties for automated decision making (the cutoff points for
> gary-combining were too touchy, across test sets, and across
> time). But if you like a mode where you simply sort msgs by
> score, you can stop with gary-combining and be happy.
I have a very large training corpus, so I'm seeing well-
separated distributions of good versus spam probs, with a
sprinkling of "unsures" scattered through the middle. An
uncertain cutoff at 3 sigma from the means should work, but this
notion needs some testing. That chi2 test is definitely on the
drawing boards, even if only for comparison purposes...
Death To Spam!
More information about the Spambayes