"Standard" Full Text Search Engine
Stephan Diehl
stephan.diehl at gmx.net
Fri Oct 26 09:09:48 EDT 2007
Martin Marcher wrote:
> Hello,
>
> is there something like a standard full text search engine?
>
> I'm thinking of the equivalent for python like lucene is for java or
> ferret for rails. Preferrably something that isn't exactly a clone of
> one of those but more that is python friendly in terms of the API it
> provides.
>
> Things I'd like to have:
>
> * different languages are supported (it seems most FTSs do only english)
> * I'd like to be able to provide an identifier (if I index files in
> the filesystem that would be the filename, or an ID if it lives in a
> database, or whatever applies)
> * I'd like to pass it just some (user defined) keywords with content,
> the actual content (as string, or list of strings or whatever) and to
> retrieve the results by search by keyword
> * something like a priority should be assignable to different fields
> (like field: title(priority=10, content="My Draft"),
> keywords(priority=50, list_of_keywords))
>
> Unnecessary:
>
> * built-in parsing of different files
>
> The "standard" I'm referring to would be something with a large and
> active user base. Like... WSGI is _the_ thing to refer to when doing
> webapps it should be something like $FTS-Engine is _the_ engine to
> refer to.
>
> any hints?
>
I'm using swish-e (swish-e.org) for all my indexing needs. I'm not sure if
there's a python binding available, I'm using swish-e as an external
executable and live quite happyly with that.
More information about the Python-list
mailing list