
Aug. 21, 2002
4:24 p.m.
On Wed, Aug 21, 2002, Skip Montanaro wrote:
aahz> I'm mostly going by comments Tim Peters has made in the past aahz> suggesting that regex tools are poor for parsing.
parsing != tokenizing. ;-) Regular expressions are great for tokenizing (most of the time).
Ah. Here we see one of the little drawbacks of not finishing my CS degree. ;-) Can someone suggest a good simple reference on the distinctions between parsing / lexing / tokenizing, particularly in the context of general string processing (e.g. XML) rather than the arcane art of compiler technology? -- Aahz (aahz@pythoncraft.com) <*> http://www.pythoncraft.com/ Project Vote Smart: http://www.vote-smart.org/