[Spambayes] FYI: Java implementation

Richard Jowsey richard at jowsey.com
Wed Jan 15 12:20:43 EST 2003


Hi all,

I've been building a Java implementation of Paul Graham's 
"Bayesian" classification logic over the past couple months, 
intended as a plug-in filter for the Apache JAMES mail server. 

However, after considerable testing, tweaking and tuning via a 
proxy setup (similar to POPFile), plus some recent lurking on 
the Spambayes list, I'm now modifying this project to 
incorporate the excellent notions contributed by Gary Robinson, 
et al, as implemented in your Python code.

Early results are *very* promising!!! This death2spam stuff is 
definitely heading in the right direction! I haven't quite 
finished the chi2 comparison logic, but even using just "gary-
combining", the kinds of messages ending up in my "uncertain" 
category make much more sense. Plus I'm now seeing far less 
weirdness caused by Graham's "2 * nGood + nSpam >= 5" trick, 
etc. Will keep the list posted as to further progress.

I'd sure love to attend the upcoming spam-fest at MIT, but we 
moved downunder (Seattle -> Sydney) last year, and it's one 
helluva long way to go just for a day...

Many thanks for all your fine coding, testing efforts, and 
thoughtful conversations! It's been very helpful, not to mention 
highly entertaining at times.  ;-)

Cheers,
Richard




More information about the Spambayes mailing list