Multibyte Character Surport for Python

Martin v. Loewis martin at v.loewis.de
Thu May 9 06:32:58 EDT 2002


"Stephen J. Turnbull" <stephen at xemacs.org> writes:

> What Unicode objects?  They find ordinary strings that are mandated to
> be encoded in UTF-8.  

That could be done, but I would discourage it. The Unicode type in
Python is the type to represent Unicode, and there is only one way to
do it.

> We do the migration to Unicode objects later, at the same time that
> you would have done it anyway.  In the meantime, this fits right in
> with the kind of "backwards compatibility" that PEP 263 is all about.

You can't use UTF-8 to represent non-ASCII identifiers, and Unicode
objects later. Old byte code would not interoperate with new byte
code.

Regards,
Martin



More information about the Python-list mailing list