
On 4/24/20 6:32 PM, Thomas Coradeschi via Mailman-Developers wrote:
I have a test list (called ’test’) and can see that /var/lib/mailman/archives/private/test/2020-April.txt and /var/lib/mailman/archives/private/test.mbox/test.mbox have both been created and have messages being written to them, but the html archive pages are not being built.
manually running '/var/lib/mailman/bin/arch test’ gives me
Note, do not run /var/lib/mailman/bin/arch more than once on a list without the --wipe option.
#00000 <1F9F4DC9-AF84-4D49-B1EE-ABDC0A612692@skylands.ibmwr.org> figuring article archives 2020-April Pickling archive state into /var/lib/mailman/archives/private/test/pipermail.pck Traceback (most recent call last): File "bin/arch", line 201, in <module> main() File "bin/arch", line 189, in main archiver.processUnixMailbox(fp, start, end) File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 596, in processUnixMailbox self.add_article(a) File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 640, in add_article author = fixAuthor(article.decoded['author']) File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 63, in fixAuthor while i>0 and (L[i-1][0] in lowercase or UnicodeDecodeError: 'ascii' codec can't decode byte 0xaa in position 26: ordinal not in range(128)
This looks like a manifestation of an issue we've seen before. There are multiple threads on this issue in the archive of the mailman-users@python.org list The bulk of it is at <https://mail.python.org/pipermail/mailman-users/2019-March/thread.html> in threads with Subject: [Mailman-Users] Uncaught runner exception The bottom line is in <https://mail.python.org/pipermail/mailman-users/2019-March/084280.html>. We could never figure out where it was coming from, but the import from string import lowercase in /var/lib/mailman/Mailman/Archiver/pipermail.py was returning a string that contained many accented characters in addition to the 26 letters a-z, namely the iso-8859-1 encoding of 'abcdefghijklmnopqrstuvwxyzµßàáâãäåæçèéêëìíîïðñòóôõöøùúûüýþÿ' This is some packaging quirk in (I think) Debian's Python. If you follow all through the threads, you'll see several suggested patches for diagnosis and avoidance, and there is also a thread at <https://mail.python.org/pipermail/mailman-users/2019-May/084432.html>. All that notwithstanding, I think this is the best patch for avoiding/fixing the issue. === modified file 'Mailman/Archiver/pipermail.py' --- Mailman/Archiver/pipermail.py 2018-05-03 21:23:47 +0000 +++ Mailman/Archiver/pipermail.py 2020-04-25 02:13:46 +0000 @@ -60,7 +60,7 @@ else: # Mixed case; assume that small parts of the last name will be # in lowercase, and check them against the list. - while i>0 and (L[i-1][0] in lowercase or + while i>0 and (L[i-1][0] in lowercase[:26] or L[i-1].lower() in smallNameParts): i = i - 1 author = SPACE.join(L[-1:] + L[i:-1]) + ', ' + SPACE.join(L[:i]) -- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan