[Catalog-sig] Search engine relevance
donald at stufft.io
Fri Mar 8 16:22:39 CET 2013
On Mar 8, 2013, at 9:51 AM, Jacob Kaplan-Moss <jacob at jacobian.org> wrote:
> Hi Antoine -
> Yes, PyPI's search engine is rather simplistic, I think that's a
> pretty well-known problem.
> For the time being you might try Crate instead (crate.io); I've found
> its search engine to be much much better.
Crate's search uses ElasticSearch whereas I believe PyPI is just using SQL against the DB.
That being said Crate's search could be a lot better still :/ But I'm not an expert on how to get the best search results.
> On Fri, Mar 8, 2013 at 8:00 AM, Antoine Pitrou <solipsis at pitrou.net> wrote:
>> It seems the PyPI search engine is quite crude and doesn't try to make the
>> results relevant at all.
>> For example, if I'm trying to search "agi" in the hope of finding modules
>> relevant to the Asterisk Gateway Interface (nicknamed "AGI"), I get the
>> following results:
>> As you can see, a large number of results pop up simply because they contain
>> the word "magic", which apparently is considered to match the "agi" request.
>> Clearly either the selection or the weighting algorithm isn't very efficient
>> Catalog-SIG mailing list
>> Catalog-SIG at python.org
> Catalog-SIG mailing list
> Catalog-SIG at python.org
PGP: 0x6E3CBCE93372DCFA // 7C6B 7C5D 5E2B 6356 A926 F04F 6E3C BCE9 3372 DCFA
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 841 bytes
Desc: Message signed with OpenPGP using GPGMail
More information about the Catalog-SIG