md5 and large files

Roger Binns rogerb at rogerbinns.com
Sun Oct 17 22:42:27 EDT 2004


Andrew Dalke wrote:
> I think most of the time is spent doing I/O, not computing
> the checksum.  That's probably even true if written in C.

The OP said they were using 4GB files, so unless the machine
has more than that amount of memory, either algorithm on
the entire file will be almost entirely disk bound.

Roger 





More information about the Python-list mailing list