How to read such file and sumarize the data?
huisky
huisky at gmail.com
Thu Nov 18 04:13:47 EST 2010
Thank you very much for your reply.
I think you just count the total number of NCPU used for each user.
And it does NOT show how much time used for each user.
Huisky
On Nov 18, 12:10 am, Steve Holden <st... at holdenweb.com> wrote:
> On 11/17/2010 4:45 PM, huisky wrote:> Say I have following log file, which records the code usage.
> > I want to read this file and do the summarize how much total CPU time
> > consumed for each user.
> > Is Python able to do so or say easy to achieve this?, anybody can give
> > me some hints, appricate very much!
>
> > Example log file.
> > **************************************************************************************
>
> I'm assuming the following (unquoted) data is in file "data.txt":
>
>
>
> > LSTC license server version 224 started at Sun Dec 6 18:56:48 2009
> > using configuration file /usr/local/lstc/server_data
> > xyz 15... at trofast3.marin.ntnu.no LS-DYNA_971 NCPU=1 started Sun Dec 6
> > 18:57:40
> > 15... at trofast3.marin.ntnu.no completed Sun Dec 6 19:42:55
> > xyz 15... at trofast3.marin.ntnu.no LS-DYNA_971 NCPU=2 started Sun Dec 6
> > 20:17:02
> > 15... at trofast3.marin.ntnu.no completed Sun Dec 6 20:26:03
> > xyz 18... at trofast2.marin.ntnu.no LS-DYNA_971 NCPU=1 started Sun Dec 6
> > 21:01:17
> > 18... at trofast2.marin.ntnu.no completed Sun Dec 6 21:01:28
> > tanhoi 5... at iimt-tanhoi-w.ivt.ntnu.no LS-DYNA_971 NCPU=1 started Mon
> > Dec 7 09:31:00
> > 5... at iimt-tanhoi-w.ivt.ntnu.no presumed dead Mon Dec 7 10:36:48
> > sabril 18... at trofast2.marin.ntnu.no LS-DYNA_971 NCPU=2 started Mon
> > Dec 7 13:14:47
> > 18... at trofast2.marin.ntnu.no completed Mon Dec 7 13:24:07
> > sabril 18... at trofast2.marin.ntnu.no LS-DYNA_971 NCPU=2 started Mon
> > Dec 7 14:21:34
> > sabril 18... at trofast2.marin.ntnu.no LS-DYNA_971 NCPU=2 started Mon
> > Dec 7 14:28:42
> > 18... at trofast2.marin.ntnu.no killed Mon Dec 7 14:31:48
> > 18... at trofast2.marin.ntnu.no killed Mon Dec 7 14:32:06
>
> The line wrapping being wrong shouldn't affect the logic.
>
> $ cat data.py
> lines = open("data.txt").readlines()
> from collections import defaultdict
> c = defaultdict(int)
> for line in lines:
> ls = line.split()
> if len(ls) > 3 and ls[3].startswith("NCPU="):
> amt = int(ls[3][5:])
> c[ls[0]] += amt
> for key, value in c.items():
> print key, ":", value
>
> $ python data.py
> xyz : 4
> tanhoi : 1
> sabril : 6
>
> regards
> Steve
> --
> Steve Holden +1 571 484 6266 +1 800 494 3119
> PyCon 2011 Atlanta March 9-17 http://us.pycon.org/
> See Python Video! http://python.mirocommunity.org/
> Holden Web LLC http://www.holdenweb.com/
More information about the Python-list
mailing list