[Tutor] getting text within an element [attribute text] using elementtree
ps python
ps_python3 at yahoo.co.in
Fri Jan 20 01:10:59 CET 2006
Dear group,
In my XML file some attributes have text within an
element. I want to get that text. How can I do that.
I looked into ElementTree -bits and pieces: and there
is a small function.
>>> def gettext(elem):
... text = elem.text or ""
... for e in elem:
... text += gettext(e)
... if e.tail:
... text += e.tail
... return text
>>> for m in
tree.findall('//{org:hprd:dtd:hprdr2}proteinInteractor'):
... k = gettext(m)
... print k
...
>>>
I got nothing out of it.
I tried more preliminary way:
>>> for m in
tree.findall('//{org:hprd:dtd:hprdr2}proteinIn
... k =
m.findtext('{org:hprd:dtd:hprdr2}secondaryRef'
... print k
...
>>> for m in
tree.findall('//{org:hprd:dtd:hprdr2}proteinIn
... print m
...
>>>
The part of the xml file is listd below.
Could any one you help me please.
I want to get :
<proteinInteractor id="HISTONEH3">
id value:
Thanks
w
<interactions>
<entrySet xmlns="net:sf:psidev:mi"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="net:sf:psidev:mi
http://psidev.sourceforge.net/mi/xml/src/MIF.xsd"
level="1" version="1">
<entry>
<source>
<names>
<shortLabel>HPRD</shortLabel>
<fullName>Human Protein Reference
Database</fullName>
</names>
<bibref>
<xref>
<primaryRef db="PubMed"
id="14525934"/>
<secondaryRef db="PubMed"
id="14681466"/>
</xref>
</bibref>
</source>
<interactorList>
<proteinInteractor id="HISTONEH3">
<names>
<shortLabel>Histone H3</shortLabel>
</names>
<organism ncbiTaxId="9606">
<names>
<shortLabel>Human</shortLabel>
<fullName>Homo
sapiens</fullName>
</names>
</organism>
</proteinInteractor>
<proteinInteractor id="ID_HPRD_00303">
<names>
<shortLabel>Minichromosome
maintenance protein 2</shortLabel>
</names>
<organism ncbiTaxId="10090">
<names>
<shortLabel>Mouse</shortLabel>
<fullName>Mus
musculus</fullName>
</names>
</organism>
</proteinInteractor>
<proteinInteractor id="ID_HPRD_04072">
<names>
<shortLabel>MCM3</shortLabel>
</names>
<organism ncbiTaxId="10090">
<names>
<shortLabel>Mouse</shortLabel>
<fullName>Mus musculus</fullName>
</names>
</organism>
</proteinInteractor>
<proteinInteractor id="ID_HPRD_09094">
<names>
<shortLabel>MCM4</shortLabel>
</names>
<organism ncbiTaxId="10090">
<names>
<shortLabel>Mouse</shortLabel>
<fullName>Mus
musculus</fullName>
</names>
</organism>
</proteinInteractor>
Send instant messages to your online friends http://in.messenger.yahoo.com
More information about the Tutor
mailing list