[Spambayes] There Can Be Only One
Tim Peters
tim.one@comcast.net
Wed, 25 Sep 2002 19:45:54 -0400
Inspired by Neil and Anthony, I tried
robinson_probability_a: 0.225
in my test case. It made no difference to results, but separated the ham
and spam means at the cost of increasing their variances (a=0.5 vs a=0.225
here):
ham mean ham sdev
33.01 32.00 -3.06% 6.26 6.39 +2.08%
32.19 31.14 -3.26% 5.38 5.51 +2.42%
32.99 31.96 -3.12% 5.60 5.71 +1.96%
33.46 32.46 -2.99% 5.77 5.90 +2.25%
33.16 32.20 -2.90% 5.56 5.74 +3.24%
32.81 31.85 -2.93% 5.72 5.81 +1.57%
33.38 32.36 -3.06% 5.76 5.89 +2.26%
32.55 31.51 -3.20% 5.70 5.91 +3.68%
33.11 32.14 -2.93% 5.52 5.63 +1.99%
34.21 33.22 -2.89% 5.84 5.95 +1.88%
ham mean and sdev for all runs
33.09 32.09 -3.02% 5.73 5.87 +2.44%
spam mean spam sdev
82.95 83.89 +1.13% 6.82 7.16 +4.99%
82.17 83.11 +1.14% 6.34 6.85 +8.04%
82.06 83.08 +1.24% 6.14 6.58 +7.17%
82.39 83.20 +0.98% 5.93 6.31 +6.41%
82.53 83.31 +0.95% 7.00 7.33 +4.71%
82.76 83.76 +1.21% 6.56 7.01 +6.86%
82.06 82.97 +1.11% 5.73 6.02 +5.06%
82.26 83.09 +1.01% 5.97 6.17 +3.35%
82.65 83.55 +1.09% 6.71 7.07 +5.37%
83.43 84.37 +1.13% 6.37 6.71 +5.34%
spam mean and sdev for all runs
82.53 83.43 +1.09% 6.37 6.75 +5.97%
ham/spam mean difference: 49.44 51.34 +1.90
I can live with that.
In contrast, Skip reported a ham mean of 22.67 (much lower) with sdev 7.57
(higher), and a spam mean of 69.27 (much lower) with sdev 12.38 (much
higher). His spam and his ham are hammier, but less certain of themselves.
I expect talk therapy would do them some good <wink>.