[Tutor] Problems processing accented characters in ISO-8859-1 encoded texts

Steven D'Aprano steve at pearwood.info
Fri Dec 24 14:13:58 CET 2010


Josep M. Fontana wrote:

> Just one more question. You say that \w means alphanumeric, not just
> alpha. Is there any expression that would mean "just alpha" and (given
> the appropriate LOCALE setting) would match 'a' and 'ö' but not '9'?

Unfortunately, I don't think there is a standard code for just alpha. 
Apart from listing all the characters individually, I haven't been able 
to find a way to get the result you want, but I'll admit I haven't tried 
that hard. Perhaps somebody with more regex skills can answer?




-- 
Steven



More information about the Tutor mailing list