Storing pairs of (int, int) in a database : which db to choose ?

Stormbringer andreif at mail.dntis.ro
Wed Dec 24 04:48:46 EST 2003


Jp Calderone <exarkun at intarweb.us> wrote in message news:<mailman.87.1072220178.684.python-list at python.org>...
> On Tue, Dec 23, 2003 at 12:54:39PM -0800, Stormbringer wrote:
> > Jp Calderone <exarkun at intarweb.us> wrote in message news:<mailman.75.1072
>  189772.684.python-list at python.org>...
> > > On Tue, Dec 23, 2003 at 04:35:50AM -0800, Stormbringer wrote:
> > > > Hi,
> > > > 
> > > > I want to implement a fulltext search for messages in a forum. More
> > > > exactly for each message I store pairs (wordId, msgId) for each
> > > > identified word and when I search something I want to be able to
> > > > retrieve very quickly all msgId for a given wordId.
> > > > 
> > > 
> > >   A pure Python fulltext indexer - http://divmod.org/Lupy/index.html
> > 
> > Thanks ! This is exactly what I needed, and the size of the indexes is
> > around 30%, much much less than what I could have achieved with my
> > code. Not to mention the fact that I get phrase search and some other
> > goodies :)
> > 
> > The only thing that bothers me a little is the speed for building the
> > index, I tried with around 5000 messages and I am not quite thrilled,
> > it's not  extremly  slow but it has to be faster for what I need.
> > Perhaps I'll use the C++ version with some Python bindings.
> > 
> 
>   Yea, I hear that.  Work is being done on speeding it up (pretty much the
> only development on it now is optimization).  I don't know how it will end
> up, but things look promising so far.  On the other hand, if you don't want
> to wait for that to be finished...

Well - that depends. If there will be a faster version of lupy when
I'll really need it in 1-2 months then I will use that. Else if I can
find a faster equivalent I will use that. Just beeing practical.

Andrei




More information about the Python-list mailing list