sre is broken in SuSE 9.2

Serge Orlov Serge.Orlov at gmail.com
Fri Feb 11 09:48:59 EST 2005


Denis S. Otkidach wrote:
> On 10 Feb 2005 11:49:33 -0800
> "Serge Orlov" <Serge.Orlov at gmail.com> wrote:
>
> > This thread is about problems only with LANG=C or LANG=POSIX, it's
not
> > about other locales. Other locales are working as expected.
>
> You are not right.  I have LANG=de_DE.UTF-8, and the Python
test_re.py
> doesn't pass.

I meant "only with C or POSIX locales" when I wrote "only with LANG=C
or LANG=POSIX". My bad.

> $LANG doesn't matter if I don't call setlocale.

Sure.

> Fortunately setting any non-C locale solves the problem for all (I
> believe) unicode character:
>
> >>> re.compile(ur'\w+', re.U).findall(u'\xb5\xba\xe4\u0430')
> [u'\xb5\xba\xe4\u0430']

I can't find the strict definition of isalpha, but I believe average
C program shouldn't care about the current locale alphabet, so isalpha
is a union of all supported characters in all alphabets

  Serge.




More information about the Python-list mailing list