[Spambayes] Mining the headers

Skip Montanaro skip@pobox.com
Mon Oct 28 00:08:35 2002


    Tim> Skip, I think there's a bug in the extract_dow code.  

Thanks for catching it.  for: ... else: isn't a construct I use often, so
it's not entirely surprising that I muffed it.  How did you generate the
table of tokens in your note?

    Tim>               #ham  #spam        spamprob
    Tim> 'dow:0'          2      7  0.890542594688
    Tim> 'dow:1'          3      7  0.854937008074
    Tim> 'dow:2'        725     71  0.220827483069
    Tim> 'dow:3'       1038    261  0.420993872704
    Tim> 'dow:4'        845    234  0.444677806501
    Tim> 'dow:5'        126    196  0.81766035841
    Tim> 'dow:6'          0    137  0.998363041106
    Tim> 'dow:invalid' 2741    946  0.499472081328

The only tokens I've ever seen are in the summaries.

Skip