[Python-Dev] thoughts on the bytes/string discussion
Greg Ewing
greg.ewing at canterbury.ac.nz
Sun Jun 27 11:48:22 CEST 2010
Stefan Behnel wrote:
> Greg Ewing, 26.06.2010 09:58:
>
>> Would there be any sanity in having an option to compile
>> Python with UTF-8 as the internal string representation?
>
> It would break Py_UNICODE, because the internal size of a unicode
> character would no longer be fixed.
It's not fixed anyway with the 2-char build -- some
characters are represented using a pair of surrogates.
--
Greg
More information about the Python-Dev
mailing list