[Python-Dev] thoughts on the bytes/string discussion

Greg Ewing greg.ewing at canterbury.ac.nz
Sun Jun 27 11:48:22 CEST 2010


Stefan Behnel wrote:
> Greg Ewing, 26.06.2010 09:58:
> 
>> Would there be any sanity in having an option to compile
>> Python with UTF-8 as the internal string representation?
> 
> It would break Py_UNICODE, because the internal size of a unicode 
> character would no longer be fixed.

It's not fixed anyway with the 2-char build -- some
characters are represented using a pair of surrogates.

-- 
Greg


More information about the Python-Dev mailing list