[Tutor] extract information from txtfile
bob gailer
bgailer at gmail.com
Wed Oct 5 15:42:56 CEST 2011
On 10/4/2011 4:00 PM, Anna Olofsson wrote:
> Hi,
> I'm a beginner at Python and I'm trying to extract some information
> from a textfile where I have a lot of data.
>
> The table looks something like this, but with many more columns and rows:
>
> # chr Pos ID REF ALT . . . . . . . . . . .
> 1 13645 - C T
> . . . . .
> . . . . .
>
> I want to extract certain data from this table above and make it to a
> sentence looking something like this: chr1:13645 C/T
>
> I have created a pseudo code, just to get an overview of want I want
> to do, looking like this:
>
> import OS
> open vcf_file.vcf
> for each line
> words =
> split(line)
> # words[0] = 1 , words[1]=Pos , words[3]=REF , words[4]=ALT
> if(words[0] not
> "#"):
> #Sometimes the word starts with a hash and I don't want that to be
> included.
> str="chr" + words[0] + ":" + words[1] + "\t" + words[3] +
> "/" + words[4]
> score = OS.system(str)
> vcf_file.close()
>
> print result
>
> return score
>
>
> I'm not exactly sure how to proceed from here.
I suggest you convert the pseudo-code into a Pythin program, then run it.
If that gives you grief then tackle it one task at a time. For example,
open, read, print, close a file. Then add another level of complication,
for example open, read, split, print, close a file.
The part that makes no sense to me is score = OS.system(str). What will
str be and what do you want system to return?
--
Bob Gailer
919-636-4239
Chapel Hill NC
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/tutor/attachments/20111005/605c94b6/attachment-0001.html>
More information about the Tutor
mailing list