much slower then grep on some regular expressions

Kris Kennaway kris at
Tue Jul 8 15:58:31 CEST 2008

samwyse wrote:
> On Jul 4, 6:43 am, Henning_Thornblad <Henning.Thornb... at>
> wrote:
>> What can be the cause of the large difference between and
>> grep?
>> While doing a simple grep:
>> grep '[^ "=]*/' input                  (input contains 156.000 a in
>> one row)
>> doesn't even take a second.
>> Is this a bug in python?
> You might want to look at Plex.
> "Another advantage of Plex is that it compiles all of the regular
> expressions into a single DFA. Once that's done, the input can be
> processed in a time proportional to the number of characters to be
> scanned, and independent of the number or complexity of the regular
> expressions. Python's existing regular expression matchers do not have
> this property. "

Very interesting!  Thanks very much for the pointer.


