[perl-python] a program to delete duplicate files

David Eppstein eppstein at ics.uci.edu
Thu Mar 10 23:32:56 EST 2005


In article <1110372973.657649.212920 at l41g2000cwc.googlegroups.com>,
 "Xah Lee" <xah at xahlee.org> wrote:

> a absolute requirement in this problem is to minimize the number of
> comparison made between files. This is a part of the spec.

You need do no comparisons between files.  Just use a sufficiently 
strong hash algorithm (SHA-256 maybe?) and compare the hashes.

-- 
David Eppstein
Computer Science Dept., Univ. of California, Irvine
http://www.ics.uci.edu/~eppstein/



More information about the Python-list mailing list