[spambayes-dev] Very small change for composite word tokenizing.

Meyer, Tony T.A.Meyer at massey.ac.nz
Tue Aug 5 11:40:24 EDT 2003


> -> longword_re = re.compile(r"[a-zA-Z1-9$]+")

Note that Sean admitted (privately) that leaving out the '0' was
unintentional ;)  I tried it with [a-zA-Z0-9$] and got the same results,
though.

=Tony Meyer



More information about the spambayes-dev mailing list