[Spambayes] FYI: Java implementation
richard at jowsey.com
Wed Jan 15 12:20:43 EST 2003
I've been building a Java implementation of Paul Graham's
"Bayesian" classification logic over the past couple months,
intended as a plug-in filter for the Apache JAMES mail server.
However, after considerable testing, tweaking and tuning via a
proxy setup (similar to POPFile), plus some recent lurking on
the Spambayes list, I'm now modifying this project to
incorporate the excellent notions contributed by Gary Robinson,
et al, as implemented in your Python code.
Early results are *very* promising!!! This death2spam stuff is
definitely heading in the right direction! I haven't quite
finished the chi2 comparison logic, but even using just "gary-
combining", the kinds of messages ending up in my "uncertain"
category make much more sense. Plus I'm now seeing far less
weirdness caused by Graham's "2 * nGood + nSpam >= 5" trick,
etc. Will keep the list posted as to further progress.
I'd sure love to attend the upcoming spam-fest at MIT, but we
moved downunder (Seattle -> Sydney) last year, and it's one
helluva long way to go just for a day...
Many thanks for all your fine coding, testing efforts, and
thoughtful conversations! It's been very helpful, not to mention
highly entertaining at times. ;-)
More information about the Spambayes