[Python-3000] string module trimming
Jeffrey Yasskin
jyasskin at gmail.com
Sat Apr 28 01:59:26 CEST 2007
On 4/18/07, Jim Jewett <jimjjewett at gmail.com> wrote:
> On 4/18/07, Guido van Rossum <guido at python.org> wrote:
> > On 4/18/07, Jim Jewett <jimjjewett at gmail.com> wrote:
>
> > > Today, string.letters works most easily with ASCII supersets, and is
> > > effectively limited to 8-bit encodings. Once everything is unicode, I
> > > don't think that 8-bit restriction should apply any more.
>
> > But we already went over this. There are over 40K letters in Unicode.
> > It simply makes no sense to have a string.letters approaching that
> > size.
>
> Agreed. But there aren't 40K (alphabetic) letters in any particular
> locale. Most individual languages will have less than 100.
Here's a relevant bunch of data from the CLDR:
http://www.unicode.org/cldr/data/charts/by_type/misc.exemplarCharacters.html
--
Namasté,
Jeffrey Yasskin
More information about the Python-3000
mailing list