[ mailman-Bugs-926034 ] Crashes on encoding errors
SourceForge.net
noreply at sourceforge.net
Tue Mar 30 09:51:22 EST 2004
Bugs item #926034, was opened at 2004-03-30 16:06
Message generated for change (Comment added) made by heikkilevanto
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=100103&aid=926034&group_id=103
Category: Pipermail
Group: 2.1 (stable)
Status: Open
Resolution: None
Priority: 5
Submitted By: Heikki Levanto (heikkilevanto)
Assigned to: Nobody/Anonymous (nobody)
Summary: Crashes on encoding errors
Initial Comment:
I have seen crashes on some Danish-speaking lists, where
users put Danish characters in the headers. It seems to
depend on the encoding used. I do not have much statistical
material, as I can not produce the crash myself, but have to
ask some of the "troublemakers" to post a test message every
time I need to see the crash.
Here is a traceback:
Mar 30 15:17:03 2004 (339) Uncaught runner exception:
unknown encoding
Mar 30 15:17:03 2004 (339) Traceback (most recent call
last):
File "/home/mailman/Mailman/Queue/Runner.py", line 105,
in _oneloop
self._onefile(msg, msgdata)
File "/home/mailman/Mailman/Queue/Runner.py", line 155,
in _onefile
keepqueued = self._dispose(mlist, msg, msgdata)
File "/home/mailman/Mailman/Queue/IncomingRunner.py",
line 130, in _dispose
more = self._dopipeline(mlist, msg, msgdata, pipeline)
File "/home/mailman/Mailman/Queue/IncomingRunner.py",
line 153, in _dopipeline
sys.modules[modname].process(mlist, msg, msgdata)
File "/home/mailman/Mailman/Handlers/CookHeaders.py",
line 75, in process
prefix_subject(mlist, msg, msgdata)
File "/home/mailman/Mailman/Handlers/CookHeaders.py",
line 262, in prefix_subject
h.append(s, c)
File "/home/mailman/pythonlib/email/Header.py", line 285,
in append
s = s.encode(outcodec, errors)
LookupError: unknown encoding
The sad thing is that these mails just disappear, and my
users wonder why they can not post to the list. Even a bounce
would be to prefer, or at least something in the log that tells
whom I could send apologies to...
Here are the headers of one offending message, as sent
directly to my mailbox:
>From cfb at pikobryg.dk Tue Mar 30 15:47:23 2004
Return-path: <cfb at pikobryg.dk>
Envelope-to: heikki at indexdata.dk
Delivery-date: Tue, 30 Mar 2004 15:47:23 +0200
Received: from smtp020.tiscali.dk ([212.54.64.104])
by bagel.index with esmtp (Exim 3.35 #1 (Debian))
id 1B8JaM-0003l8-00
for <heikki at indexdata.dk>; Tue, 30 Mar 2004 15:47:22
+0200
Received: from pikobryg.dk
(213.237.80.191.adsl.vbr.worldonline.dk [213.237.80.191])
by smtp020.tiscali.dk (8.12.10/8.12.10) with ESMTP id
i2UDlKsm001688
for <heikki at indexdata.dk>; Tue, 30 Mar 2004 15:47:20
+0200 (MEST)
Message-ID: <40697A66.5030604 at pikobryg.dk>
Date: Tue, 30 Mar 2004 15:47:18 +0200
From: "Christian F. Behrens" <cfb at pikobryg.dk>
Reply-To: cfb at pikobryg.dk
Organization: Pikobryg
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1;
en-US; rv:1.6) Gecko/20040113
X-Accept-Language: da, en-us, en
MIME-Version: 1.0
To: heikki at indexdata.dk
Subject: Korrektion af
=?windows-1252?Q?sukkerm=E6ngde_til_den_sids?=
=?windows-1252?Q?te_nadver?=
Content-Type: text/plain; charset=windows-1252;
format=flowed
Content-Transfer-Encoding: quoted-printable
X-MIME-Autoconverted: from 8bit to quoted-printable by
smtp020.tiscali.dk id i2UDlKsm001688
Note that the subject line carries encoding "windows-1252",
which comes from Mime-Autoconversion at tiscali.dk.
The sender claims (and I believe him) that in his end the
subject line looks something like this:
Subjekt: Test mail encoded
=?ISO-8859-1?Q?=5B=E6=F8=E5_=C6=C5=D8=5D?=
I am not asking mailman to support mysterious windows code
pages, but if it could handle unknown encodings better (ie
without crashing) I would be much happier.
I am also sending a polite letter to tiscali.dk, asking them to
get their act together.
----------------------------------------------------------------------
>Comment By: Heikki Levanto (heikkilevanto)
Date: 2004-03-30 16:51
Message:
Logged In: YES
user_id=854974
Sorry, forgot to specify that I run on a Debian/Stable, with
hand-installed mailman 2.1.2 from June 2003.
It may also be that the original mail in my example was written in
MS-Word, and pasted from there to Mozilla. Maybe that has
something to do with the encoding, perhaps more than tiscali.dk's
autoconversion.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=100103&aid=926034&group_id=103
More information about the Mailman-coders
mailing list