[ mailman-Bugs-926034 ] Crashes on encoding errors

SourceForge.net noreply at sourceforge.net
Tue Mar 30 09:51:22 EST 2004


Bugs item #926034, was opened at 2004-03-30 16:06
Message generated for change (Comment added) made by heikkilevanto
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=100103&aid=926034&group_id=103

Category: Pipermail
Group: 2.1 (stable)
Status: Open
Resolution: None
Priority: 5
Submitted By: Heikki Levanto (heikkilevanto)
Assigned to: Nobody/Anonymous (nobody)
Summary: Crashes on encoding errors

Initial Comment:
I have seen crashes on some Danish-speaking lists, where 
users put Danish characters in the headers. It seems to 
depend on the encoding used. I do not have much statistical 
material, as I can not produce the crash myself, but have to 
ask some of the "troublemakers" to post a test message every 
time I need to see the crash. 
 
Here is a traceback: 
Mar 30 15:17:03 2004 (339) Uncaught runner exception: 
unknown encoding 
Mar 30 15:17:03 2004 (339) Traceback (most recent call 
last): 
  File "/home/mailman/Mailman/Queue/Runner.py", line 105, 
in _oneloop 
    self._onefile(msg, msgdata) 
  File "/home/mailman/Mailman/Queue/Runner.py", line 155, 
in _onefile 
    keepqueued = self._dispose(mlist, msg, msgdata) 
  File "/home/mailman/Mailman/Queue/IncomingRunner.py", 
line 130, in _dispose 
    more = self._dopipeline(mlist, msg, msgdata, pipeline) 
  File "/home/mailman/Mailman/Queue/IncomingRunner.py", 
line 153, in _dopipeline 
    sys.modules[modname].process(mlist, msg, msgdata) 
  File "/home/mailman/Mailman/Handlers/CookHeaders.py", 
line 75, in process 
    prefix_subject(mlist, msg, msgdata) 
  File "/home/mailman/Mailman/Handlers/CookHeaders.py", 
line 262, in prefix_subject 
    h.append(s, c) 
  File "/home/mailman/pythonlib/email/Header.py", line 285, 
in append 
    s = s.encode(outcodec, errors) 
LookupError: unknown encoding 
 
The sad thing is that these mails just disappear, and my 
users wonder why they can not post to the list. Even a bounce 
would be to prefer, or at least something in the log that tells 
whom I could send apologies to... 
 
Here are the headers of one offending message, as sent 
directly to my mailbox: 
 
>From cfb at pikobryg.dk Tue Mar 30 15:47:23 2004 
Return-path: <cfb at pikobryg.dk> 
Envelope-to: heikki at indexdata.dk 
Delivery-date: Tue, 30 Mar 2004 15:47:23 +0200 
Received: from smtp020.tiscali.dk ([212.54.64.104]) 
        by bagel.index with esmtp (Exim 3.35 #1 (Debian)) 
        id 1B8JaM-0003l8-00 
        for <heikki at indexdata.dk>; Tue, 30 Mar 2004 15:47:22 
+0200 
Received: from pikobryg.dk 
(213.237.80.191.adsl.vbr.worldonline.dk [213.237.80.191]) 
        by smtp020.tiscali.dk (8.12.10/8.12.10) with ESMTP id 
i2UDlKsm001688 
        for <heikki at indexdata.dk>; Tue, 30 Mar 2004 15:47:20 
+0200 (MEST) 
Message-ID: <40697A66.5030604 at pikobryg.dk> 
Date: Tue, 30 Mar 2004 15:47:18 +0200 
From: "Christian F. Behrens" <cfb at pikobryg.dk> 
Reply-To: cfb at pikobryg.dk 
Organization: Pikobryg 
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; 
en-US; rv:1.6) Gecko/20040113 
X-Accept-Language: da, en-us, en 
MIME-Version: 1.0 
To: heikki at indexdata.dk 
Subject: Korrektion af 
=?windows-1252?Q?sukkerm=E6ngde_til_den_sids?= 
 =?windows-1252?Q?te_nadver?= 
Content-Type: text/plain; charset=windows-1252; 
format=flowed 
Content-Transfer-Encoding: quoted-printable 
X-MIME-Autoconverted: from 8bit to quoted-printable by 
smtp020.tiscali.dk id i2UDlKsm001688 
 
Note that the subject line carries encoding "windows-1252", 
which comes from Mime-Autoconversion at tiscali.dk. 
 
The sender claims (and I believe him) that in his end the 
subject line looks something like this: 
Subjekt: Test mail encoded 
=?ISO-8859-1?Q?=5B=E6=F8=E5_=C6=C5=D8=5D?= 
 
 
I am not asking mailman to support mysterious windows code 
pages, but if it could handle unknown encodings better (ie 
without crashing) I would be much happier. 
 
I am also sending a polite letter to tiscali.dk, asking them to 
get their act together. 
 
 
 
 
 

----------------------------------------------------------------------

>Comment By: Heikki Levanto (heikkilevanto)
Date: 2004-03-30 16:51

Message:
Logged In: YES 
user_id=854974

Sorry, forgot to specify that I run on a Debian/Stable, with 
hand-installed mailman 2.1.2 from June 2003. 
 
It may also be that the original mail in my example was written in 
MS-Word, and pasted from there to Mozilla. Maybe that has 
something to do with the encoding, perhaps more than tiscali.dk's 
autoconversion. 
 
 

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=100103&aid=926034&group_id=103



More information about the Mailman-coders mailing list