[spambayes-dev] Results for DNS lookup in tokenizer

Tony Meyer tameyer at ihug.co.nz
Wed Apr 14 19:46:23 EDT 2004

> Just as a side issue... they only need a subdomain for 
> message, not a full domain. I.e. aaa.spamisevil.com is just 
> as unique as aaaspamisevil.com

I was really talking about the x-slurp_urls option, rather than the DNS
lookup.  With that option's x-only_slurp_base the URL that is retrieved is
the simplest form of the url, i.e. "aaaspamisevil.com" or "massey.ac.nz".
Doing a simple HTTP request for a webpage like that does (AFAICT) include
any information at all about who is doing the request.  This means that you
*do* need a domain per message.  It also means that if I have a spammy page
at "spam.massey.ac.nz", but "massey.ac.nz" is ham, the clues generated will
make things worse, not better.  Of course, if the root domain is
legitimately hammy and they have spammy subdomains/pages, there's a
reasonable chance that you can get the spammy people kicked off.

=Tony Meyer

