[Tutor] Simple counter to determine frequencies of words in adocument
alan.gauld at btinternet.com
Sun Nov 21 01:49:36 CET 2010
"col speed" <ajarncolin at gmail.com> wrote
> Just my twopenneth, I'm a noob and I'm not going to try such a big
> file on
> my old machine, but:
> 1. Maybe create a *set* from the wordlist, loop through that, so you
> "count" on wordlist only once. OR
This would be an improvement but still involves traversing the entire
list N times where N is the number of unique words.
> 2. Use collections.defaultdict(int) and loop through wordlist and do
> dic[word] += 1
This is what his second version is supposed to do and is the
best solution since it only involves a single trasverse of the file.
More information about the Tutor