word stemming / conflation
Andreas Jung
andreas at andreas-jung.com
Sun Apr 21 22:00:36 EDT 2002
I have written wrappers for the snowball stemmers of
Dr. Porter: http://pystemmer.sourceforge.net.
Andreas
----- Original Message -----
From: "Terry Hancock" <hancock at anansispaceworks.com>
To: <python-list at python.org>
Sent: Sunday, April 21, 2002 09:44
Subject: word stemming / conflation
> I was working on an algorithm to condense strings of
> text into mnemonic labels (hopefully this will produce
> more meaningful labels than simply assigning a number!).
> I got the idea from the way later versions of
> latex2html generate filenames.
>
> Anyway, after tinkering with this for awhile I've
> discovered that it can be a bottomless pit! A little
> web research reveals various MS / PhD papers for doing
> this task (which apparently is called "conflation" as
> well as "word stemming") in English and lots of other
> languages. Looks like it's a lot harder than you
> might at first think. I didn't find much in Python,
> though.
>
> So, I'm wondering if there are some Python resources
> (or better-yet already written modules) for doing
> this sort of thing. I don't really need a completely
> robust system -- the occasional error will be quite
> tolerable. So I'm more interested in quick and simple
> solutions that work most of the time, rather than
> the ultra-robust library-science type.
>
> Thanks for any information or leads you might be
> able to suggest,
>
> Terry
>
> --
> ------------------------------------------------------
> Terry Hancock
> hancock at anansispaceworks.com
> ------------------------------------------------------
>
>
> --
> http://mail.python.org/mailman/listinfo/python-list
>
More information about the Python-list
mailing list