On 4/24/20 6:32 PM, Thomas Coradeschi via Mailman-Developers wrote:
> 
> I have a test list (called ’test’) and can see that 
> /var/lib/mailman/archives/private/test/2020-April.txt and 
> /var/lib/mailman/archives/private/test.mbox/test.mbox have both been created 
> and have messages being written to them, but the html archive pages are not 
> being built.
> 
> manually running '/var/lib/mailman/bin/arch test’ gives me


Note, do not run /var/lib/mailman/bin/arch more than once on a list
without the --wipe option.


> #00000 <1f9f4dc9-af84-4d49-b1ee-abdc0a612...@skylands.ibmwr.org>
> figuring article archives
> 2020-April
> Pickling archive state into 
> /var/lib/mailman/archives/private/test/pipermail.pck
> Traceback (most recent call last):
>   File "bin/arch", line 201, in <module>
>     main()
>   File "bin/arch", line 189, in main
>     archiver.processUnixMailbox(fp, start, end)
>   File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 596, in 
> processUnixMailbox
>     self.add_article(a)
>   File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 640, in 
> add_article
>     author = fixAuthor(article.decoded['author'])
>   File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 63, in fixAuthor
>     while i>0 and (L[i-1][0] in lowercase or
> UnicodeDecodeError: 'ascii' codec can't decode byte 0xaa in position 26: 
> ordinal not in range(128)


This looks like a manifestation of an issue we've seen before. There are
multiple threads on this issue in the archive of the
mailman-us...@python.org list The bulk of it is at
<https://mail.python.org/pipermail/mailman-users/2019-March/thread.html>
in threads with

Subject: [Mailman-Users] Uncaught runner exception

The bottom line is in
<https://mail.python.org/pipermail/mailman-users/2019-March/084280.html>. We
could never figure out where it was coming from, but the import

from string import lowercase

in /var/lib/mailman/Mailman/Archiver/pipermail.py was returning a string
that contained many accented characters in addition to the 26 letters
a-z, namely the iso-8859-1 encoding of

'abcdefghijklmnopqrstuvwxyzµßàáâãäåæçèéêëìíîïðñòóôõöøùúûüýþÿ'

This is some packaging quirk in (I think) Debian's Python.

If you follow all through the threads, you'll see several suggested
patches for diagnosis and avoidance, and there is also a thread at
<https://mail.python.org/pipermail/mailman-users/2019-May/084432.html>.

All that notwithstanding, I think this is the best patch for
avoiding/fixing the issue.

=== modified file 'Mailman/Archiver/pipermail.py'
--- Mailman/Archiver/pipermail.py       2018-05-03 21:23:47 +0000
+++ Mailman/Archiver/pipermail.py       2020-04-25 02:13:46 +0000
@@ -60,7 +60,7 @@
     else:
         # Mixed case; assume that small parts of the last name will be
         # in lowercase, and check them against the list.
-        while i>0 and (L[i-1][0] in lowercase or
+        while i>0 and (L[i-1][0] in lowercase[:26] or
                        L[i-1].lower() in smallNameParts):
             i = i - 1
     author = SPACE.join(L[-1:] + L[i:-1]) + ', ' + SPACE.join(L[:i])


-- 
Mark Sapiro <m...@msapiro.net>        The highway is for gamblers,
San Francisco Bay Area, California    better use your sense - B. Dylan
_______________________________________________
Mailman-Developers mailing list -- mailman-developers@python.org
To unsubscribe send an email to mailman-developers-le...@python.org
https://mail.python.org/mailman3/lists/mailman-developers.python.org/
Mailman FAQ: https://wiki.list.org/x/AgA3

Security Policy: https://wiki.list.org/x/QIA9

Reply via email to