[Tutor] calculate the sum of a variable - python
Wayne Werner
waynejwerner at gmail.com
Mon Mar 7 04:44:00 CET 2011
On Sun, Mar 6, 2011 at 9:31 PM, nookasree ponamala <nookasree at yahoo.com>wrote:
> Hi :
>
> I'm a Senior SAS Analyst. I'm trying to learn Python. I would appreciate if
> anybody could help me with this. It works fine if I give input instead of
> reading a text file. I don't understand where I'm going wrong.
>
> I'm trying to read a text file and find out the following:
> 1. Sum of amt for each id
> 2. Count of id
> 3. minimum of date1
> 4. maximum of date1
>
> Here is the sample text file:
>
> test.txt file:
>
> bin1 cd1 date1 amt cd id cd2
> 452 2 2010-02-20 $23.26 0 8100059542 06107
> 452 2 2010-02-20 $20.78 0 8100059542 06107
> 452 2 2010-02-24 $5.99 2 8100839745 20151
> 452 2 2010-02-12 $114.25 7 8100839745 98101
> 452 2 2010-02-06 $28.00 0 8101142362 06032
> 452 2 2010-02-09 $15.01 0 8100274453 06040
> 452 18 2010-02-13 $113.24 0 8100274453 06040
> 452 2 2010-02-13 $31.80 0 8100274453 06040
>
>
> Here is the code I've tried out to calculate sum of amt by id:
>
> import sys
> from itertools import groupby
> from operator import itemgetter
> t = ()
> tot = []
> for line in open ('test.txt','r'):
> aline = line.rstrip().split()
> a = aline[5]
> b = (aline[3].strip('$'))
> t = (a,b)
> t1 = str(t)
> tot.append(t1)
> print tot
> def summary(data, key=itemgetter(0), value=itemgetter(1)):
> for k, group in groupby(data, key):
> yield (k, sum(value(row) for row in group))
>
> if __name__ == "__main__":
> for id, tot_spend in summary(tot, key=itemgetter(0),
> value=itemgetter(1)):
> print id, tot_spend
>
>
> Error:
> Traceback (most recent call last):
> File "<stdin>", line 2, in <module>
> File "<stdin>", line 3, in summary
> TypeError: unsupported operand type(s) for +: 'int' and 'str'
>
Of course I first have to commend you for including the full traceback with
the code because it makes this entirely easy to answer.
In general, the traceback tells you the most important stuff last, so I'll
start with this line:
> TypeError: unsupported operand type(s) for +: 'int' and 'str'
That tells us that the problem is you are trying to use + (addition) on an
integer and a string - which you can't do because of the type mismatch
(TypeError).
The next line
> File "<stdin>", line 3, in summary
tells us that the error occurred on line3 in summary:
1 | def summary(data, key=itemgetter(0), value=itemgetter(1)):
2 | for k, group in groupby(data, key):
3 | yield (k, sum(value(row) for row in group))
Well, there's no '+', but you do have 'sum', which uses addition under the
hood. So how do you go about fixing it? Well, you change the value getting
passed to sum to an integer (or other number):
sum(int(value(row)) for row in group)
Should either fix your problem, or throw a differen error if you try to
convert a string like 'Hello' to an integer. (Alternatively, use float if
you're interested in decimals)
HTH,
Wayne
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/tutor/attachments/20110306/e17b9314/attachment.html>
More information about the Tutor
mailing list