How to make regexes faster? (Python v. OmniMark)

M.-A. Lemburg mal at lemburg.com
Sat Apr 20 05:10:48 EDT 2002


Tim Peters wrote:
> 
> [Donn Cave]
> > By the way, I'll second Johannes Stiehler's recommendation of
> > MxTextTools.  Definitely appropriate for SGML parsing, and much
> > better than regexps for extensive parsing in my opinion - not
> > just in terms of speed, but I suspect a more powerful way to
> > describe text patterns than regexps.
> 
> Yes, it is.  "More convenient" is arguable, though -- there's a steep
> learning curve, but then people often forget how hard it was to learn regexp
> syntax and pragmatics too.

BTW, in you care, the next version will have full Unicode support,
so parsing e.g. a Unicode XML file is possible too. I've also
added better support for writing tag tables: jump targets can
now be string labels, there's a tag table compiler and some
other interesting things in the new beta.

Also, all the other functions and tools in mxTextTools
are now Unicode aware.

If you're interested in the beta, please subscribe to the
egenix-users mailing list (see the web-site). The archives
have the latest announcements.

-- 
Marc-Andre Lemburg
CEO eGenix.com Software GmbH
______________________________________________________________________
Company & Consulting:                           http://www.egenix.com/
Python Software:                   http://www.egenix.com/files/python/





More information about the Python-list mailing list