[Mailman-Users] UnicodeDecodeError with Mailman 2.1 and Python 2.6
David Magda
dmagda at ee.ryerson.ca
Wed Sep 2 03:38:34 CEST 2015
> On Sep 1, 2015, at 21:02, Mark Sapiro <mark at msapiro.net> wrote:
>
> In my testing with GNU grep on Ubuntu 15.04, 'grep "\xea"' interprets \x
> as a literal x and therefore looks for the string "xea", not for the
> character whose hex value is EA.
For the record/archives: GNU grep also as the “-P” option, which allows Perl regexes (PCRE), and \xhh searches for characters with hex code hh (per pcrepattern(3)):
http://stackoverflow.com/questions/3001177/ how-do-i-grep-for-all-non-ascii-characters-in-unix
>> Doing a "arch --wipe mylist" seems to have solved the issue, though now
>> I'm curious to know why \xea was a problem before but suddenly isn't after
>> the wipe.
>
>
> Here's what I suspect was going on.
>
> Your first run of bin/arch encountered some non-ascii in a header and
> threw the exception, but not before writing bad data to the pipermail
> database for that month.
>
> You then "fixed" the non-ascii in the input mbox, but subsequent runs of
> bin/arch still encountered the bad data in the database when they got to
> that month.
>
> Finally, you added the --wipe option and that removed everythin and
> rebuilt from scratch and as there was no non-ascii in the mbox headers,
> it worked.
>
> As to why this didn't happen before, see my next reply.
Sounds plausible.
More information about the Mailman-Users
mailing list