Fuzzy Lookups

Gregory Piñero gregpinero at gmail.com
Tue Jan 31 10:51:44 EST 2006


> Thanks for that, I'll have a look. (So many packages, so little
> time...)

Yes, there's a standard library for everything it seems!  Except for a
MySQL api :-(

> > I wrote a script to delete duplicate mp3's by filename a few years
> > back with this.  If anyone's interested in seeing it, I'll post a blog
> > entry on it.  I'm betting it uses a similiar algorithm your functions.
>
> I would be very interested it seeing that.

Done, see:
http://www.blendedtechnologies.com/removing-duplicate-mp3s-with-python-a-naive-yet-fuzzy-approach/60

If anyone would be kind enough to improve it I'd love to have these
features but I'm swamped this week!

- MD5 checking for find exact matches regardless of name
- Put each set of duplicates in its own subfolder.



>
>


--
Gregory Piñero
Chief Innovation Officer
Blended Technologies
(www.blendedtechnologies.com)



More information about the Python-list mailing list