[Python-3000] string module trimming

Jeffrey Yasskin jyasskin at gmail.com
Sat Apr 28 01:59:26 CEST 2007


On 4/18/07, Jim Jewett <jimjjewett at gmail.com> wrote:
> On 4/18/07, Guido van Rossum <guido at python.org> wrote:
> > On 4/18/07, Jim Jewett <jimjjewett at gmail.com> wrote:
>
> > > Today, string.letters works most easily with ASCII supersets, and is
> > > effectively limited to 8-bit encodings.  Once everything is unicode, I
> > > don't think that 8-bit restriction should apply any more.
>
> > But we already went over this. There are over 40K letters in Unicode.
> > It simply makes no sense to have a string.letters approaching that
> > size.
>
> Agreed.  But there aren't 40K (alphabetic) letters in any particular
> locale.  Most individual languages will have less than 100.

Here's a relevant bunch of data from the CLDR:
http://www.unicode.org/cldr/data/charts/by_type/misc.exemplarCharacters.html

-- 
Namasté,
Jeffrey Yasskin


More information about the Python-3000 mailing list