[Python-Dev] I18N Toolkit

Andrew M. Kuchling akuchlin@mems-exchange.org
Tue, 9 Nov 1999 14:36:28 -0500 (EST)


Barry A. Warsaw writes:
(in relation to support for Unicode regexes)
>We need to be careful not to worsen performance for 8bit strings.  I
>think we're already on the edge of acceptability w.r.t. P*** and
>hopefully we can /improve/ performance here.

I don't think that will be a problem, given that the Unicode engine
would be a separate C implementation.  A bit of 'if type(strg) ==
UnicodeType' in re.py isn't going to cost very much speed.

(Speeding up PCRE -- that's another question.  I'm often tempted to
rewrite pcre_compile to generate an easier-to-analyse parse tree,
instead of its current complicated-but-memory-parsimonious compiler,
but I'm very reluctant to introduce a fork like that.)

-- 
A.M. Kuchling			http://starship.python.net/crew/amk/
The world does so well without me, that I am moved to wish that I could do
equally well without the world.
    -- Robertson Davies, _The Diary of Samuel Marchbanks_