Multibyte Character Surport for Python

Stephen J. Turnbull stephen at xemacs.org
Mon May 13 01:01:57 EDT 2002


>>>>> "Martin" == Martin v Loewis <martin at v.loewis.de> writes:

    Martin> Why does it help to have "UTF-16" to be a synonym to
    Martin> either "UTF-16BE" or "UTF-16LE", but not telling anybody
    Martin> what it is a synonym to?

Ask whoever implemented a UTF-16 codec for python, not me.  Evidently
there's a good reason for it.

The fact is that the current implementation is just begging to produce
broken output that will be invisible to anyone who has a Unicode-
capable console.  And that the only way to avoid it (without rewriting
all the APIs to pass Unicode objects instead of pre-encoded strings)
is really ugly code like the code I presented earlier.

-- 
Institute of Policy and Planning Sciences     http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba                    Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
 My nostalgia for Icon makes me forget about any of the bad things.  I don't
have much nostalgia for Perl, so its faults I remember.  Scott Gilbert c.l.py



More information about the Python-list mailing list