[spambayes-dev] Unicode bug samples
G. Armour Van Horn
vanhorn at whidbey.com
Wed Jul 16 11:22:24 EDT 2003
Okay, I've been getting about one of these per day and have been tucking them away
in a Netscape mailbox called "SB Failures". Zipped up it's a tiny file, about 16K.
When zipping it up, I noticed something extremely odd. Normally, a Netscape
mailbox file is accompanied by an index file (*.snm) that is far smaller. I assume
this is like a Eudora .toc file, the part you actually see when you look at your
mailboxes. Anyway, in this case, the "SB Failures" file is 72K, while the "SB
Failures.snm" is over three times as large at 240K.
Rather than mail it to Tony, I have it parked at
http://verbose.twistedhistory.com/dropbox/unicode.zip
Hope that helps somebody.
Van
Richie Hindle wrote:
> Hi Van,
>
> > Is this the known Unicode bug, or something new?
> >
> > X-Spambayes-Exception:
> > exceptions.UnicodeError(ASCII decoding error: ordinal not in range(128)) in
> > append() at
> > C:\Python22\lib\email\Header.py line 230: ustr =unicode(s, incodec)
>
> That's a malformed email message that Python's email package can't
> understand (because it has unencoded high-bit-set characters in one of its
> headers). Anthony will probably hate me for saying this, but you could
> forward the whole message (including headers, and preferably zipped up so
> that it doesn't get modified by intervening email systems) to him:
> anthony at interlink.com.au He's the guy developing a better email parser
> (where 'better' means 'more able to cope with broken emails').
>
> --
> Richie Hindle
> richie at entrian.com
>
> _______________________________________________
> spambayes-dev mailing list
> spambayes-dev at python.org
> http://mail.python.org/mailman/listinfo/spambayes-dev
--
----------------------------------------------------------
Sign up now for Quotes of the Day, a handful of quotations
on a theme delivered every morning.
Enlightenment! Daily, for free!
mailto:twisted at whidbey.com?subject=Subscribe_QOTD
For web hosting and maintenance,
visit Van's home page: http://www.domainvanhorn.com/van/
----------------------------------------------------------
More information about the spambayes-dev
mailing list