[Spambayes-checkins] spambayes Options.py,1.62,1.63 tokenizer.py,1.55,1.56

Tim Peters tim_one@users.sourceforge.net
Mon Oct 28 20:19:52 2002


Update of /cvsroot/spambayes/spambayes
In directory usw-pr-cvs1:/tmp/cvs-serv14627

Modified Files:
	Options.py tokenizer.py 
Log Message:
Repaired comments about what generate_time_buckets does.  Purged
reference to now-gone cvcost.py.


Index: Options.py
===================================================================
RCS file: /cvsroot/spambayes/spambayes/Options.py,v
retrieving revision 1.62
retrieving revision 1.63
diff -C2 -d -r1.62 -r1.63
*** Options.py	28 Oct 2002 07:04:12 -0000	1.62
--- Options.py	28 Oct 2002 20:19:46 -0000	1.63
***************
*** 97,102 ****
  skip_max_word_size: 12
  
! # Generate tokens which resemble the posting time in 6-minute buckets:
! # int((h*60+m)/10).
  generate_time_buckets: False
  
--- 97,102 ----
  skip_max_word_size: 12
  
! # Generate tokens which resemble the posting time in 10-minute buckets:
! #     'time:'  hour  ':'  minute//10
  generate_time_buckets: False
  
***************
*** 158,163 ****
  # something scores >= spamc, it's called spam; and everything else is
  # called 'I am not sure' -- the middle ground.
- #
- # Note that cvcost.py does a similar analysis.
  #
  # Note:  You may wish to increase nbuckets, to give this scheme more cutoff
--- 158,161 ----

Index: tokenizer.py
===================================================================
RCS file: /cvsroot/spambayes/spambayes/tokenizer.py,v
retrieving revision 1.55
retrieving revision 1.56
diff -C2 -d -r1.55 -r1.56
*** tokenizer.py	28 Oct 2002 17:07:28 -0000	1.55
--- tokenizer.py	28 Oct 2002 20:19:47 -0000	1.56
***************
*** 1066,1070 ****
                  mat = self.date_hms_re.search(header)
                  # return the time in Date: headers arranged in
!                 # six-minute buckets
                  if mat is not None:
                      h = int(mat.group('hour'))
--- 1066,1070 ----
                  mat = self.date_hms_re.search(header)
                  # return the time in Date: headers arranged in
!                 # 10-minute buckets
                  if mat is not None:
                      h = int(mat.group('hour'))