[Spambayes] Two Scheme Enter, One Scheme Leave.
Anthony Baxter
anthony@ekit-inc.com
Thu, 26 Sep 2002 14:49:36 +1000
---------------------- multipart/mixed attachment
My normal email's off the air at the moment, but here's some more data,
plus a dinky little hack.
I wrote a quickie script to trawl through the test*.txt full output and
try and find the best results overall. It's attached. For me, when run
against all the robinson tests, I get:
best results
got 9fp + 21fn = 30 tot, at a:0.100 x:0.500 min: 0.080 cutoff: 60.000
got 9fp + 21fn = 30 tot, at a:0.100 x:0.500 min: 0.090 cutoff: 60.000
got 9fp + 21fn = 30 tot, at a:0.100 x:0.500 min: 0.100 cutoff: 60.000
got 8fp + 23fn = 31 tot, at a:0.100 x:0.500 min: 0.050 cutoff: 60.000
got 9fp + 22fn = 31 tot, at a:0.200 x:0.500 min: 0.100 cutoff: 60.000
got 9fp + 22fn = 31 tot, at a:0.250 x:0.500 min: 0.100 cutoff: 60.000
got 10fp + 21fn = 31 tot, at a:0.075 x:0.500 min: 0.100 cutoff: 60.000
got 10fp + 21fn = 31 tot, at a:0.125 x:0.500 min: 0.100 cutoff: 60.000
got 10fp + 21fn = 31 tot, at a:0.150 x:0.500 min: 0.100 cutoff: 60.000
got 10fp + 21fn = 31 tot, at a:0.350 x:0.500 min: 0.100 cutoff: 60.000
got 12fp + 19fn = 31 tot, at a:0.100 x:0.600 min: 0.100 cutoff: 60.000
got 10fp + 22fn = 32 tot, at a:0.450 x:0.500 min: 0.100 cutoff: 60.000
got 11fp + 21fn = 32 tot, at a:0.500 x:0.500 min: 0.100 cutoff: 60.000
got 12fp + 20fn = 32 tot, at a:0.100 x:0.500 min: 0.110 cutoff: 60.000
got 12fp + 20fn = 32 tot, at a:0.100 x:0.500 min: 0.120 cutoff: 60.000
got 13fp + 19fn = 32 tot, at a:0.100 x:0.500 min: 0.150 cutoff: 60.000
got 8fp + 25fn = 33 tot, at a:0.100 x:0.400 min: 0.100 cutoff: 60.000
got 12fp + 24fn = 36 tot, at a:0.025 x:0.500 min: 0.100 cutoff: 60.000
got 20fp + 16fn = 36 tot, at a:0.100 x:0.400 min: 0.100 cutoff: 57.500
got 22fp + 14fn = 36 tot, at a:0.010 x:0.500 min: 0.100 cutoff: 57.500
got 23fp + 13fn = 36 tot, at a:0.025 x:0.500 min: 0.100 cutoff: 57.500
got 9fp + 28fn = 37 tot, at a:0.100 x:0.500 min: 0.150 cutoff: 62.500
Anyway, the script's attached. It could do with being generalised (in
some way... not sure what's best), but hopefully it's useful.
You could easily enough pull the data out of the Storage class at the
end and plot it with your favourite graphing tool that's capable of
plotting 3 data sets on a 4-dimensional graph...
Anthony
---------------------- multipart/mixed attachment
A non-text attachment was scrubbed...
Name: not available
Type: text/x-python
Size: 3613 bytes
Desc: spamnalysis.py
Url : http://mail.python.org/pipermail-21/spambayes/attachments/20020926/570601f0/attachment.py
---------------------- multipart/mixed attachment--