Question regarding Indexting
I have some questions about MM3 Indexing. I am using Xapian as the search engine.
- I first tested it on a small archive of 3 messages:
/opt/mailman/mm/bin/django-admin update_index_one_list test@subdomain-lists.com
It said "Indexing 3 emails". I assume that is good and then it stopped. No success message at all. Is that normal? I reran the above line and it said "Indexing 3 emails again. Is that normal behavior as well?
- In the fulltext_index folder is a lot of .seg files and other file types as well such as .glass. What are these files/folders in the fulltext_index folder?
Sorry for these questions. I am trying to learn some of the additional features of Mailman and am looking in detail at the search indexing. I have a client who is bringing over a large mbox file of archives dating back to 2006.
Thanks, Brian
On Mon, Mar 16, 2020, at 2:28 PM, brian_carpenter@emwd.com wrote:
I have some questions about MM3 Indexing. I am using Xapian as the search engine.
- I first tested it on a small archive of 3 messages:
/opt/mailman/mm/bin/django-admin update_index_one_list test@subdomain-lists.com
It said "Indexing 3 emails". I assume that is good and then it stopped. No success message at all. Is that normal? I reran the above line and it said "Indexing 3 emails again. Is that normal behavior as well?
That is the normal behavior, no error implies success, but a success message might be worth adding I suppose.
There are two commands,
update_index
: This is meant for cron job and by default indexes only the new emails since the last indexing operation.update_index_one_list
: This is meant for manual run after you import archives a list to Hyperkitty and indexes all the emails.
The calls to update_index_one_list
are sort of idempotent and will do the same operation every time. Hence the output is same each time you run it.
The calls to update_index
depends on the number of new emails received since last run.
- In the fulltext_index folder is a lot of .seg files and other file types as well such as .glass. What are these files/folders in the fulltext_index folder?
Not sure, custom Xapian files I presume. The format of the fulltext index is more or less Opaque. We interact with it only through the Python APIs and the folder is mostly under the control of the Xapian library. Or whatever indexing library HK is configured to use.
Sorry for these questions. I am trying to learn some of the additional features of Mailman and am looking in detail at the search indexing. I have a client who is bringing over a large mbox file of archives dating back to 2006.
Thanks, Brian
Mailman-Developers mailing list -- mailman-developers@python.org To unsubscribe send an email to mailman-developers-leave@python.org https://mail.python.org/mailman3/lists/mailman-developers.python.org/ Mailman FAQ: https://wiki.list.org/x/AgA3
Security Policy: https://wiki.list.org/x/QIA9
-- thanks, Abhilash Raj (maxking)
On 3/16/20 7:00 PM, Abhilash Raj wrote:
Not sure, custom Xapian files I presume. The format of the fulltext index is more or less Opaque. We interact with it only through the Python APIs and the folder is mostly under the control of the Xapian library. Or whatever indexing library HK is configured to use.
Thank you Abhilash. You have been very helpful today.
-- Please let me know if you need further assistance.
Thank you for your business. We appreciate our clients. Brian Carpenter EMWD.com
-- EMWD's Knowledgebase: https://clientarea.emwd.com/index.php/knowledgebase
EMWD's Community Forums http://discourse.emwd.com/
participants (3)
-
Abhilash Raj
-
Brian Carpenter
-
brian_carpenter@emwd.com