Code that ought to run fast, but can't due to Python limitations.

Nobody nobody at nowhere.com
Sun Jul 5 03:25:38 CEST 2009


On Sat, 04 Jul 2009 16:35:08 -0700, John Nagle wrote:

>     The temptation is to write tokenizers in C, but that's an admission
> of language design failure.

The only part that really needs to be written in C is the DFA loop. The
code to construct the state table from regexps could be written
entirely in Python, but I don't see any advantage to doing so.




More information about the Python-list mailing list