On 4/24/20 6:32 PM, Thomas Coradeschi via Mailman-Developers wrote:
I have a test list (called ’test’) and can see that /var/lib/mailman/archives/private/test/2020-April.txt and /var/lib/mailman/archives/private/test.mbox/test.mbox have both been created and have messages being written to them, but the html archive pages are not being built.
manually running '/var/lib/mailman/bin/arch test’ gives me
Note, do not run /var/lib/mailman/bin/arch more than once on a list without the --wipe option.
#00000 1F9F4DC9-AF84-4D49-B1EE-ABDC0A612692@skylands.ibmwr.org figuring article archives 2020-April Pickling archive state into /var/lib/mailman/archives/private/test/pipermail.pck Traceback (most recent call last): File "bin/arch", line 201, in <module> main() File "bin/arch", line 189, in main archiver.processUnixMailbox(fp, start, end) File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 596, in processUnixMailbox self.add_article(a) File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 640, in add_article author = fixAuthor(article.decoded['author']) File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 63, in fixAuthor while i>0 and (L[i-1] in lowercase or UnicodeDecodeError: 'ascii' codec can't decode byte 0xaa in position 26: ordinal not in range(128)
This looks like a manifestation of an issue we've seen before. There are multiple threads on this issue in the archive of the email@example.com list The bulk of it is at https://mail.python.org/pipermail/mailman-users/2019-March/thread.html in threads with
Subject: [Mailman-Users] Uncaught runner exception
The bottom line is in https://mail.python.org/pipermail/mailman-users/2019-March/084280.html. We could never figure out where it was coming from, but the import
from string import lowercase
in /var/lib/mailman/Mailman/Archiver/pipermail.py was returning a string that contained many accented characters in addition to the 26 letters a-z, namely the iso-8859-1 encoding of
This is some packaging quirk in (I think) Debian's Python.
If you follow all through the threads, you'll see several suggested patches for diagnosis and avoidance, and there is also a thread at https://mail.python.org/pipermail/mailman-users/2019-May/084432.html.
All that notwithstanding, I think this is the best patch for avoiding/fixing the issue.
=== modified file 'Mailman/Archiver/pipermail.py' --- Mailman/Archiver/pipermail.py 2018-05-03 21:23:47 +0000 +++ Mailman/Archiver/pipermail.py 2020-04-25 02:13:46 +0000 @@ -60,7 +60,7 @@ else: # Mixed case; assume that small parts of the last name will be # in lowercase, and check them against the list.
while i>0 and (L[i-1] in lowercase or
author = SPACE.join(L[-1:] + L[i:-1]) + ', ' + SPACE.join(L[:i])
while i>0 and (L[i-1] in lowercase[:26] or L[i-1].lower() in smallNameParts): i = i - 1