[spambayes-dev] trimming email addresses

T. Alexander Popiel popiel at wolfskeep.com
Tue Jul 29 11:34:52 EDT 2003


In message:  <16166.43297.843579.921132 at montanaro.dyndns.org>
             Skip Montanaro <skip at pobox.com> writes:
>
>It occurred to me yesterday that it might be worthwhile trimming email
>addresses which contain "+" signs.  Most MTAs understand this notation and
>deliver such messages to the email address on the left of the "+".  For
>example, Mailman 2.1 uses this.  The email address in the Sender field of my
>spambayes mail is
>
>    spambayes-bounces+skip=pobox.com at python.org
>
>A message to that address would go to spambayes-bounces at python.org where the
>"skip=python.org" part is extracted and treated as a parameter by the
>recipient (usually a program).
>
>It seems to me that the Spambayes tokenizer should only consider the
>"spambayes-bounces" part of the address.  I don't think this will improve
>the tokenizer in the general case, but it does seem like the correct way to
>handle such addresses.

Actually, I use such + notation in addresses to help distinguish
certain sources of ham; e.g. popiel+<mybank>@wolfskeep.com is a much
stronger ham clue than popiel at wolfskeep.com, because <mybank> is
the only one who uses that form of address (generally for account
policy notices, etc.).  So I think that stripping off the +<foo>
in the address would be a significant lose for me.

>Any objection to me implementing this and checking it in?

Go ahead and implement it, as an option.  That'll make it easier
to actually test. :-)

- Alex



More information about the spambayes-dev mailing list