Howto find same files?

Favre Gregoire greg at ulima.unil.ch
Sun Oct 29 09:52:07 EST 2000


Erik Max Francis wrote:
> 
> gregoire.favre at ima.unil.ch wrote:
> 
> > Would it be a good idea to create a files which contains the
> > path,filename,size,md5sum and then working on it?
> 
> My utility which eliminates duplicate files (written in Python) simply
> keeps 1. the filename it first saw the file as (for reference), 2. a tag
> which can be used to indicate in what group it was seen (say, a date),
> 3. the file size, and 4. a 32-bit CRC.  Since you're evidently already
> using a UNIX-like system, you can get the CRC for free:  `cksum'.  (This
> also gives you the file size without any extra work, which is helpful
> not so much for uniquely identifying the file, but also for being able
> to determine the size of your processed collection from the databaes
> alone.)
> 
> It's really quite straightforward to write one; what's giving you
> trouble?

>  Erik Max Francis / max at alcyone.com / http://www.alcyone.com/max/

That's depend on your knowledge on programming... I have looked for your utility
on you HP, is it really on it?

Thanks you very much,

	Greg
________________________________________________________________
http://ulima.unil.ch/greg ICQ:16624071 mailto:greg at ulima.unil.ch



More information about the Python-list mailing list