[Tutor] First program after PyCamp
Chris Calloway
cbc at unc.edu
Wed Jun 12 20:39:02 CEST 2013
On 6/12/2013 11:18 AM, bjames at Jamesgang.dyndns.org wrote:
> I've updated this code and to make it more easily readible put it in a
> github repo https://github.com/CyberCowboy/FindDuplicates
>
> Everything is working, however the code is hard to read and I'll be
> working on cleaning that up, as well as splitting the program into 3
> different functions (one that gets hashes, one that finds and identifies
> the duplicates, and one that outputs the results)
>
> However I'm having a problem in that if during the hashing faze a filename
> with non-ascii characters is encountered the file errors out. Since this
> is going to be used at work and we have a large number of Chinese and
> Arabic filenames I need to have the search allow a unicode character set.
> How would I go about doing this? Python 2.7 btw.
Feed os.walk a unicode path and you'll get unicode filenames back.
--
Sincerely,
Chris Calloway http://nccoos.org/Members/cbc
office: 3313 Venable Hall phone: (919) 599-3530
mail: Campus Box #3300, UNC-CH, Chapel Hill, NC 27599
More information about the Tutor
mailing list