Code that ought to run fast, but can't due to Python limitations.
nobody at nowhere.com
Sun Jul 5 03:25:38 CEST 2009
On Sat, 04 Jul 2009 16:35:08 -0700, John Nagle wrote:
> The temptation is to write tokenizers in C, but that's an admission
> of language design failure.
The only part that really needs to be written in C is the DFA loop. The
code to construct the state table from regexps could be written
entirely in Python, but I don't see any advantage to doing so.
More information about the Python-list