On Fri, Mar 30, 2012 at 10:48 AM, Stephen J. Turnbull email@example.com:
On Fri, Mar 30, 2012 at 12:24 PM, Shayan Md firstname.lastname@example.org wrote:
On Fri, Mar 30, 2012 at 5:05 AM, Stephen J. Turnbull <email@example.com wrote:
And (2) search and retrieval may do a *lot* of message access, for example if you want to do data mining (see Ana from Spain's thread).
Isn't it the purpose of index?
Yes, of course, but possibly it's not good enough. Yes, when you know what "features" (eg, author) you want to index. Then you can use an online algorithm, indexing messages as they come in. However, many data mining methods are adaptive, meaning that they discover features of the corpus over time (eg, through "Bayesian" algorithms) and then wish to go back and reindex or cross-reference previously examined messages based on more accurate feature specifications.
As I say, if that's not your purpose, then you don't have to worry about it. But in some cases it will matter (eg, pretend you're Google, not just a GSoC student!)
Okay then, can you please tell me how we can put this search code in best use of mailman3? I have a proposal to write, I am getting unsure of things day by day. Can you also tell me who is the mentor of this project?
Mailman-Developers mailing list Mailman-Developers@python.org http://mail.python.org/mailman/listinfo/mailman-developers Mailman FAQ: http://wiki.list.org/x/AgA3 Searchable Archives: http://www.mail-archive.com/mailman-developers%40python.org/ Unsubscribe: http://mail.python.org/mailman/options/mailman-developers/mdoshayan%40gmail....
Security Policy: http://wiki.list.org/x/QIA9