word stemming / conflation

Andreas Jung andreas at andreas-jung.com
Sun Apr 21 22:00:36 EDT 2002


I have written  wrappers for the snowball stemmers of
Dr. Porter: http://pystemmer.sourceforge.net.

Andreas


----- Original Message ----- 
From: "Terry Hancock" <hancock at anansispaceworks.com>
To: <python-list at python.org>
Sent: Sunday, April 21, 2002 09:44
Subject: word stemming / conflation


> I was working on an algorithm to condense strings of
> text into mnemonic labels (hopefully this will produce
> more meaningful labels than simply assigning a number!).
> I got the idea from the way later versions of
> latex2html generate filenames.
> 
> Anyway, after tinkering with this for awhile I've
> discovered that it can be a bottomless pit! A little
> web research reveals various MS / PhD papers for doing
> this task (which apparently is called "conflation" as
> well as "word stemming") in English and lots of other
> languages. Looks like it's a lot harder than you
> might at first think. I didn't find much in Python,
> though.
> 
> So, I'm wondering if there are some Python resources
> (or better-yet already written modules) for doing
> this sort of thing.  I don't really need a completely
> robust system -- the occasional error will be quite
> tolerable. So I'm more interested in quick and simple
> solutions that work most of the time, rather than
> the ultra-robust library-science type.
> 
> Thanks for any information or leads you might be
> able to suggest,
> 
> Terry
> 
> -- 
> ------------------------------------------------------
> Terry Hancock
> hancock at anansispaceworks.com       
> ------------------------------------------------------
> 
> 
> -- 
> http://mail.python.org/mailman/listinfo/python-list
> 






More information about the Python-list mailing list