[Python-3000] string module trimming

Jeffrey Yasskin jyasskin at gmail.com
Thu Apr 19 08:19:26 CEST 2007


On 4/18/07, Jim Jewett <jimjjewett at gmail.com> wrote:
> On 4/18/07, Guido van Rossum <guido at python.org> wrote:
> > On 4/18/07, Jim Jewett <jimjjewett at gmail.com> wrote:
> > But we already went over this. There are over 40K letters in Unicode.
> > It simply makes no sense to have a string.letters approaching that
> > size.
>
> Agreed.  But there aren't 40K (alphabetic) letters in any particular
> locale.  Most individual languages will have less than 100.

I missed the beginning of this discussion, so sorry if you've already
covered this. Are you saying that in your app, just because I've set
the en_US locale, I won't be able to type "こにちは"? Or that those
characters won't be recognized as letters?

The Unicode character database (http://www.unicode.org/ucd/) seems
like the obvious way to handle character properties if you want to get
the right answers.

Namasté,
Jeffrey Yasskin


More information about the Python-3000 mailing list