[Spambayes] Two Scheme Enter, One Scheme Leave.

Anthony Baxter anthony@ekit-inc.com
Thu, 26 Sep 2002 14:49:36 +1000


---------------------- multipart/mixed attachment
My normal email's off the air at the moment, but here's some more data, 
plus a dinky little hack.

I wrote a quickie script to trawl through the test*.txt full output and
try and find the best results overall. It's attached. For me, when run
against all the robinson tests, I get:

best results
got   9fp +  21fn =  30 tot, at a:0.100 x:0.500 min: 0.080 cutoff: 60.000
got   9fp +  21fn =  30 tot, at a:0.100 x:0.500 min: 0.090 cutoff: 60.000
got   9fp +  21fn =  30 tot, at a:0.100 x:0.500 min: 0.100 cutoff: 60.000
got   8fp +  23fn =  31 tot, at a:0.100 x:0.500 min: 0.050 cutoff: 60.000
got   9fp +  22fn =  31 tot, at a:0.200 x:0.500 min: 0.100 cutoff: 60.000
got   9fp +  22fn =  31 tot, at a:0.250 x:0.500 min: 0.100 cutoff: 60.000
got  10fp +  21fn =  31 tot, at a:0.075 x:0.500 min: 0.100 cutoff: 60.000
got  10fp +  21fn =  31 tot, at a:0.125 x:0.500 min: 0.100 cutoff: 60.000
got  10fp +  21fn =  31 tot, at a:0.150 x:0.500 min: 0.100 cutoff: 60.000
got  10fp +  21fn =  31 tot, at a:0.350 x:0.500 min: 0.100 cutoff: 60.000
got  12fp +  19fn =  31 tot, at a:0.100 x:0.600 min: 0.100 cutoff: 60.000
got  10fp +  22fn =  32 tot, at a:0.450 x:0.500 min: 0.100 cutoff: 60.000
got  11fp +  21fn =  32 tot, at a:0.500 x:0.500 min: 0.100 cutoff: 60.000
got  12fp +  20fn =  32 tot, at a:0.100 x:0.500 min: 0.110 cutoff: 60.000
got  12fp +  20fn =  32 tot, at a:0.100 x:0.500 min: 0.120 cutoff: 60.000
got  13fp +  19fn =  32 tot, at a:0.100 x:0.500 min: 0.150 cutoff: 60.000
got   8fp +  25fn =  33 tot, at a:0.100 x:0.400 min: 0.100 cutoff: 60.000
got  12fp +  24fn =  36 tot, at a:0.025 x:0.500 min: 0.100 cutoff: 60.000
got  20fp +  16fn =  36 tot, at a:0.100 x:0.400 min: 0.100 cutoff: 57.500
got  22fp +  14fn =  36 tot, at a:0.010 x:0.500 min: 0.100 cutoff: 57.500
got  23fp +  13fn =  36 tot, at a:0.025 x:0.500 min: 0.100 cutoff: 57.500
got   9fp +  28fn =  37 tot, at a:0.100 x:0.500 min: 0.150 cutoff: 62.500

Anyway, the script's attached. It could do with being generalised (in
some way... not sure what's best), but hopefully it's useful.

You could easily enough pull the data out of the Storage class at the
end and plot it with your favourite graphing tool that's capable of 
plotting 3 data sets on a 4-dimensional graph...

Anthony


---------------------- multipart/mixed attachment
A non-text attachment was scrubbed...
Name: not available
Type: text/x-python
Size: 3613 bytes
Desc: spamnalysis.py
Url : http://mail.python.org/pipermail-21/spambayes/attachments/20020926/570601f0/attachment.py

---------------------- multipart/mixed attachment--