counting character occurrences

Sat Jan 23 13:44:00 EST 2010

Op vrijdag 22 januari 2010 schreef Arnaud:

> Why not just start with (untested):
> 
> import codecs
> from collections import defaultdict
> 
> tcounters = defaultdict(int)
> f = codecs.open('/home/gavron/git/screen/src/screen.c', 'r', "utf-8")
> 
> for c in f.read():
>     tcounters[c] += 1
> 
> for c, n in tcounters.iteritems():
>     print "%r\t%i" % (c, n)

Or using Counter from Python3.1 collections:

import codecs
from collections import Counter

filename = '/home/gavron/git/screen/src/screen.c'
with codecs.open(filename, 'r', 'utf-8') as f:
	counted = Counter(f.read())

for c, n in counted:
	print(c, n, sep='\t')

with best regards,
Wilbert Berendsen

-- 
http://www.wilbertberendsen.nl/