Shorter checksum than MD5

Dan Bishop danb_83 at
Fri Sep 10 08:47:20 CEST 2004

Paul Rubin <> wrote in message news:<7xllfjfe9m.fsf at>...
> Where are the updates coming from?  Note that if you use a 32-bit
> checksum, with 100000 records you will probably have some records with
> the same checksum by accident.

Only if you use a checksum algorithm with really bad clustering

If all 2**32 checksums are equally likely, the probability of a
collision is only about 0.0000232828.

More information about the Python-list mailing list