[Python-3000] string C API
jcarlson at uci.edu
Thu Sep 14 22:19:03 CEST 2006
"Bob Ippolito" <bob at redivi.com> wrote:
> The argument for UTF-8 is probably interop efficiency. Lots of C
> libraries, file formats, and wire protocols use UTF-8 for interchange.
> Verifying the validity of UTF-8 during string creation isn't that big
> of a deal.
Indeed, UTF-8 validation/creation isn't a big deal. But that wasn't my
concern. My concern was Python-only operation efficiency, for which a
fixed-length-per-character encoding generally wins (at least for
operations involving two strings with the same internal encoding).
More information about the Python-3000