[Spambayes-checkins] spambayes/spambayes tokenizer.py,1.31,1.32
Tony Meyer
anadelonbrin at users.sourceforge.net
Thu Aug 5 02:56:56 CEST 2004
Update of /cvsroot/spambayes/spambayes/spambayes
In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv17978/spambayes
Modified Files:
tokenizer.py
Log Message:
Goodbye to support code for two deprecated options: [Tokenizer] x-extract_dow
and [Tokenizer] x-generate_time_buckets. No-one objected on the list (and some
agreed), and they've been deprecated for a while.
Index: tokenizer.py
===================================================================
RCS file: /cvsroot/spambayes/spambayes/spambayes/tokenizer.py,v
retrieving revision 1.31
retrieving revision 1.32
diff -C2 -d -r1.31 -r1.32
*** tokenizer.py 12 Feb 2004 22:07:55 -0000 1.31
--- tokenizer.py 5 Aug 2004 00:56:53 -0000 1.32
***************
*** 1469,1498 ****
yield 'received:' + tok
- # Date:
- if options["Tokenizer", "x-generate_time_buckets"]:
- for header in msg.get_all("date", ()):
- mat = self.date_hms_re.search(header)
- # return the time in Date: headers arranged in
- # 10-minute buckets
- if mat is not None:
- h = int(mat.group('hour'))
- bucket = int(mat.group('minute')) // 10
- yield 'time:%02d:%d' % (h, bucket)
-
- if options["Tokenizer", "x-extract_dow"]:
- for header in msg.get_all("date", ()):
- # extract the day of the week
- for fmt in self.date_formats:
- try:
- timetuple = time.strptime(header, fmt)
- except ValueError:
- pass
- else:
- yield 'dow:%d' % timetuple[6]
- break
- else:
- # if nothing matches, declare the Date: header invalid
- yield 'dow:invalid'
-
# Message-Id: This seems to be a small win and should not
# adversely affect a mixed source corpus so it's always enabled.
--- 1469,1472 ----
More information about the Spambayes-checkins
mailing list