Multibyte Character Surport for Python
Stephen J. Turnbull
stephen at xemacs.org
Mon May 13 01:01:57 EDT 2002
>>>>> "Martin" == Martin v Loewis <martin at v.loewis.de> writes:
Martin> Why does it help to have "UTF-16" to be a synonym to
Martin> either "UTF-16BE" or "UTF-16LE", but not telling anybody
Martin> what it is a synonym to?
Ask whoever implemented a UTF-16 codec for python, not me. Evidently
there's a good reason for it.
The fact is that the current implementation is just begging to produce
broken output that will be invisible to anyone who has a Unicode-
capable console. And that the only way to avoid it (without rewriting
all the APIs to pass Unicode objects instead of pre-encoded strings)
is really ugly code like the code I presented earlier.
--
Institute of Policy and Planning Sciences http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
My nostalgia for Icon makes me forget about any of the bad things. I don't
have much nostalgia for Perl, so its faults I remember. Scott Gilbert c.l.py
More information about the Python-list
mailing list