A little advice please? (Convert my boss to Python)

Paul Rubin phr-n2002a at nightsong.com
Tue Apr 16 03:54:58 EDT 2002


"Duncan Smith" <buzzard at urubu.freeserve.co.uk> writes:
> So what I'm looking for is speed, and some advice so that I don't end up
> trying too many alternatives. 

If you have to do something like that over and over for zillions of
huge files, you're best off writing in C and tuning carefully.

Regarding duplicates, maybe you can just sort the file with an
external sort utility, so the duplicates will all be next to each
other.  Then you don't have to mess with dicts.  I didn't examine your
code closely enough to figure out if that makes sense, so maybe it doesn't.



More information about the Python-list mailing list