Storing pairs of (int, int) in a database : which db to choose ?
Stormbringer
andreif at mail.dntis.ro
Wed Dec 24 04:48:46 EST 2003
Jp Calderone <exarkun at intarweb.us> wrote in message news:<mailman.87.1072220178.684.python-list at python.org>...
> On Tue, Dec 23, 2003 at 12:54:39PM -0800, Stormbringer wrote:
> > Jp Calderone <exarkun at intarweb.us> wrote in message news:<mailman.75.1072
> 189772.684.python-list at python.org>...
> > > On Tue, Dec 23, 2003 at 04:35:50AM -0800, Stormbringer wrote:
> > > > Hi,
> > > >
> > > > I want to implement a fulltext search for messages in a forum. More
> > > > exactly for each message I store pairs (wordId, msgId) for each
> > > > identified word and when I search something I want to be able to
> > > > retrieve very quickly all msgId for a given wordId.
> > > >
> > >
> > > A pure Python fulltext indexer - http://divmod.org/Lupy/index.html
> >
> > Thanks ! This is exactly what I needed, and the size of the indexes is
> > around 30%, much much less than what I could have achieved with my
> > code. Not to mention the fact that I get phrase search and some other
> > goodies :)
> >
> > The only thing that bothers me a little is the speed for building the
> > index, I tried with around 5000 messages and I am not quite thrilled,
> > it's not extremly slow but it has to be faster for what I need.
> > Perhaps I'll use the C++ version with some Python bindings.
> >
>
> Yea, I hear that. Work is being done on speeding it up (pretty much the
> only development on it now is optimization). I don't know how it will end
> up, but things look promising so far. On the other hand, if you don't want
> to wait for that to be finished...
Well - that depends. If there will be a faster version of lupy when
I'll really need it in 1-2 months then I will use that. Else if I can
find a faster equivalent I will use that. Just beeing practical.
Andrei
More information about the Python-list
mailing list