FW: [spambayes-dev] Results for DNS lookup in tokenizer
skip at pobox.com
Sun Apr 11 19:09:49 EDT 2004
>> I'll restate my question. What does Matt's proposal do that
>> mine_received_headers doesn't do already?
Phillip> It looks at URLs embedded in the message *body*. As a simple
Phillip> contrast, if I link here to:
Phillip> That will produce a very *different* set of IP tokens than the
Phillip> Received: headers of this message.
Ah, okay. I missed that in Matt's post. If the tokenizer's
x-pick_apart_urls option is True, it picks apart URLs embedded in the body
of the message. It's not as ip-centered as Matt's code.
More information about the spambayes-dev