ANN: DocIndexer 0.9.1.0 released
srackham at methods.co.nz
Sun Nov 4 05:33:13 CET 2007
DocIndexer now handles unicode (the previous release was only really
comfortable with ascii). A full list of changes is in the CHANGELOG.
What is it?
DocIndexer is a document indexer toolkit that uses the PyLucene search
engine for indexing and searching document files. DocIndexer includes
command-line utilities, Python index and search classes plus a Win32
COM server that can be used to integrate indexing and searching into
application software. The current version has parser support for
Microsoft Word, HTML, PDF and plain text documents.
Win32: None (compiled binary distribution).
Linux: Python 2.5, PyLucene 2, antiword and poppler-utils.
More information about the Python-announce-list