[Python-3000] string C API

Josiah Carlson jcarlson at uci.edu
Thu Sep 14 22:19:03 CEST 2006


"Bob Ippolito" <bob at redivi.com> wrote:
> The argument for UTF-8 is probably interop efficiency. Lots of C
> libraries, file formats, and wire protocols use UTF-8 for interchange.
> Verifying the validity of UTF-8 during string creation isn't that big
> of a deal.

Indeed, UTF-8 validation/creation isn't a big deal.  But that wasn't my
concern.  My concern was Python-only operation efficiency, for which a
fixed-length-per-character encoding generally wins (at least for
operations involving two strings with the same internal encoding).


 - Josiah



More information about the Python-3000 mailing list