Shorter checksum than MD5

Dan Bishop danb_83 at yahoo.com
Fri Sep 10 08:47:20 CEST 2004


Paul Rubin <http://phr.cx@NOSPAM.invalid> wrote in message news:<7xllfjfe9m.fsf at ruckus.brouhaha.com>...
...
> Where are the updates coming from?  Note that if you use a 32-bit
> checksum, with 100000 records you will probably have some records with
> the same checksum by accident.

Only if you use a checksum algorithm with really bad clustering
problems.

If all 2**32 checksums are equally likely, the probability of a
collision is only about 0.0000232828.



More information about the Python-list mailing list