[Spambayes-checkins] spambayes Options.py,1.62,1.63
tokenizer.py,1.55,1.56
Tim Peters
tim_one@users.sourceforge.net
Mon Oct 28 20:19:52 2002
Update of /cvsroot/spambayes/spambayes
In directory usw-pr-cvs1:/tmp/cvs-serv14627
Modified Files:
Options.py tokenizer.py
Log Message:
Repaired comments about what generate_time_buckets does. Purged
reference to now-gone cvcost.py.
Index: Options.py
===================================================================
RCS file: /cvsroot/spambayes/spambayes/Options.py,v
retrieving revision 1.62
retrieving revision 1.63
diff -C2 -d -r1.62 -r1.63
*** Options.py 28 Oct 2002 07:04:12 -0000 1.62
--- Options.py 28 Oct 2002 20:19:46 -0000 1.63
***************
*** 97,102 ****
skip_max_word_size: 12
! # Generate tokens which resemble the posting time in 6-minute buckets:
! # int((h*60+m)/10).
generate_time_buckets: False
--- 97,102 ----
skip_max_word_size: 12
! # Generate tokens which resemble the posting time in 10-minute buckets:
! # 'time:' hour ':' minute//10
generate_time_buckets: False
***************
*** 158,163 ****
# something scores >= spamc, it's called spam; and everything else is
# called 'I am not sure' -- the middle ground.
- #
- # Note that cvcost.py does a similar analysis.
#
# Note: You may wish to increase nbuckets, to give this scheme more cutoff
--- 158,161 ----
Index: tokenizer.py
===================================================================
RCS file: /cvsroot/spambayes/spambayes/tokenizer.py,v
retrieving revision 1.55
retrieving revision 1.56
diff -C2 -d -r1.55 -r1.56
*** tokenizer.py 28 Oct 2002 17:07:28 -0000 1.55
--- tokenizer.py 28 Oct 2002 20:19:47 -0000 1.56
***************
*** 1066,1070 ****
mat = self.date_hms_re.search(header)
# return the time in Date: headers arranged in
! # six-minute buckets
if mat is not None:
h = int(mat.group('hour'))
--- 1066,1070 ----
mat = self.date_hms_re.search(header)
# return the time in Date: headers arranged in
! # 10-minute buckets
if mat is not None:
h = int(mat.group('hour'))