problems with regex in Japanese?

Paul Prescod paulp at ActiveState.com
Sat Aug 11 13:54:37 EDT 2001


Just van Rossum wrote:
> 
>...
> 
> How would PCRE ever be able to match groups of characters above code
> point 127, that are represented as more than one byte in utf-8? Or is
> that a limitation you decided to live with?

You would just match on the byte-patterns you expect. I think that's
more or less how Perl Unicode works.
-- 
Take a recipe. Leave a recipe.  
Python Cookbook!  http://www.ActiveState.com/pythoncookbook




More information about the Python-list mailing list