[Python-3000] string module trimming
jyasskin at gmail.com
Thu Apr 19 08:19:26 CEST 2007
On 4/18/07, Jim Jewett <jimjjewett at gmail.com> wrote:
> On 4/18/07, Guido van Rossum <guido at python.org> wrote:
> > On 4/18/07, Jim Jewett <jimjjewett at gmail.com> wrote:
> > But we already went over this. There are over 40K letters in Unicode.
> > It simply makes no sense to have a string.letters approaching that
> > size.
> Agreed. But there aren't 40K (alphabetic) letters in any particular
> locale. Most individual languages will have less than 100.
I missed the beginning of this discussion, so sorry if you've already
covered this. Are you saying that in your app, just because I've set
the en_US locale, I won't be able to type "こにちは"? Or that those
characters won't be recognized as letters?
The Unicode character database (http://www.unicode.org/ucd/) seems
like the obvious way to handle character properties if you want to get
the right answers.
More information about the Python-3000