[Doc-SIG] Building Python Document 30% faster.

Naoki INADA inada-n at klab.jp
Sat Apr 4 23:01:35 CEST 2009


>>> But searchindex.js with PyStemmer is different to one with PorterStemmer.
>>
>> This could be a problem.  The client-side search implemented in JavaScript
>> uses exactly the same stemmer (which is necessary to be able to find all
>> words).  In short, if you can find a C implementation of the Porter stemmer
>> we could include it in Sphinx as an optional extension.
>
> I see.
> Original Porter Stemmer is here.
> http://tartarus.org/~martin/PorterStemmer/
>
> And that implemented in C. I'll try to make Python wrapper with swig and
> compare searchindex.js. Wait for a while.

I make a Python wrapper!
http://bitbucket.org/methane/porterstemmer/

This is my first extension module, and still alpha version.
But I can build Python document with the porterstemmer and searchindex.js is
same to original.

-- 
Naoki INADA  <inada-n at klab.jp>
   KLab Inc.  <http://www.klab.jp>


More information about the Doc-SIG mailing list