re.search much slower then grep on some regular expressions
kris at FreeBSD.org
Tue Jul 8 15:58:31 CEST 2008
> On Jul 4, 6:43 am, Henning_Thornblad <Henning.Thornb... at gmail.com>
>> What can be the cause of the large difference between re.search and
>> While doing a simple grep:
>> grep '[^ "=]*/' input (input contains 156.000 a in
>> one row)
>> doesn't even take a second.
>> Is this a bug in python?
> You might want to look at Plex.
> "Another advantage of Plex is that it compiles all of the regular
> expressions into a single DFA. Once that's done, the input can be
> processed in a time proportional to the number of characters to be
> scanned, and independent of the number or complexity of the regular
> expressions. Python's existing regular expression matchers do not have
> this property. "
Very interesting! Thanks very much for the pointer.
More information about the Python-list