[Mailman-Users] possible arch bug?

Mark Sapiro mark at msapiro.net
Fri May 28 01:35:28 CEST 2010


Robert Khachikyan wrote:

>I've been importing thousands of emails from majordomo to mailman and I 
>might have found a bug...
>
>The original email has this:
>
> From - Fri Jan 1 00:00:01 2010
>Delivered-To: bla at bla.com
>Message-ID: <7CB5F45D9E56D511836F0002A537DD6992CA9A at bla.com>
>From: bla.bla at bla.com
>To: bla.bla at bla.com
>Subject: here is the subject line
>Date: Thu, 28 Jul 2005 16:38:27 -0700
>Content-Type: text/plain; charset="iso-8859-1"
>Content-Transfer-Encoding: 7bit
>Sender: owner-bla
>Precedence: bulk
>
> From today, everything should be back to 'normal' again. This applies
>to all stations mentioned above, except for DOJO. This station is
>presently in a 'transition' mode. Due to 're-configuration' activities,
>....
>
>~mailman/bin/arch interprets this as 2 emails simply because it detected 
>the ^From again.
>Shouldn't it be seeking for "^From(\s)-(\s)(<date reg exp here>)$" ??


bin/arch uses the Python mailbox.PortableUnixMailbox class which is
very loose about what constitutes a From_ separator.

That's why we provide a script, bin/cleanarch for checking archive mbox
files and escaping lines beginning with From_ which aren't message
separators.

-- 
Mark Sapiro <mark at msapiro.net>        The highway is for gamblers,
San Francisco Bay Area, California    better use your sense - B. Dylan



More information about the Mailman-Users mailing list