[Spambayes] Watch out for this

Skip Montanaro skip at pobox.com
Wed Sep 10 11:49:52 EDT 2003


    Skip> Good suggestion.  I'm not sure if the tokenizer does this already,
    Skip> but a quick grep for '&#[0-9];' through my current training
    Skip> database (about 3 million lines) suggests this is still fairly
    Skip> infrequently used.

Before anyone chides me for the incorrect regex, I actually grepped for
'&#[0-9]+;'.  The results were correct.  I just mistyped the regex in my
message.

S



More information about the Spambayes mailing list