Writing huge Sets() to disk

Istvan Albert ialbert at mailblocks.com
Mon Jan 10 15:44:49 EST 2005


Martin MOKREJŠ wrote:

> Istvan Albert wrote:

> So you say 1 million words is better to store in dictionary than
> in a set and use your own function to get out those unique or common
> words?

I have said nothing even remotely like that.

> Fine, that's what I wanted to hear. How do you improve the algorithm?
> Do you delay indexing to the very latest moment or do you let your
> computer index 999 999 times just for fun?

I think that you need to first understand how dictionaries work.
The time needed to insert a key is independent of
the number of values in the dictionary.

Istvan.



More information about the Python-list mailing list