Mailman archiver (inadvisable) translates Latin 1 characters

Pearu Peterson pearu at cens.ioc.ee
Thu Aug 9 14:18:55 EDT 2001


This message was originally sent to mailman users group but due to the
delay there I'll try this newsgroup if someone could help me.

---------- Forwarded message ----------
Date: Thu, 9 Aug 2001 17:46:28 +0200 (EET)
From: Pearu Peterson <pearu at cens.ioc.ee>
To: mailman-users at python.org
Subject: Latin 1 characters


Hi!

I have noticed that latin 1 (or rather non US ascii) characters are mapped
to 3-strings starting with = character when messages are shown in the
Mailman archive. For example,

õ -> =F5
Õ -> =D5
ä -> =E4
Ä -> =C4
ö -> =F6
Ö -> =D6
ü -> =FC
Ü -> =DC
\t -> =09

Which part of Mailman does this translation? Can I affect this in any way?
Currently I am having a thought of scanning
	mailman/archives/private/<list>.mbox
for strings r'=[A-F0-9][A-F0-9]' and replacing them with the original
characters (and then rerunning arch on that list).
But is there a better way of getting the same result?

Any hints are appreciated.
Thanks,
	Pearu





More information about the Python-list mailing list