![](https://secure.gravatar.com/avatar/8ccd9e166d8233d31c600816aa6efa42.jpg?s=120&d=mm&r=g)
Hi Martin, On Sun, 25 Nov 2012 23:01:03 +0000 Martin Mueller <martinmueller@northwestern.edu> wrote:
I have used lxml to extract attribute values specifying lemma and pos tag from some 2000 TEI encoded texts. The program has to chew its way through ~160 million instances of the following type of w element
<w lem="be" pos="vbz" reg="is" spe="is" xml:id="K000039_000-000220">is</w> [...] I am using lxm-l2.3.4, which is part of the Enthought Python 2.7 distribution. [...]
Have you tried it with the latest version (3.0.1)? What is your version of libxml2 and libxslt? Both libraries received some important updates last time, so they might be part of your issue. Maybe you find something useful in the Changelogs: http://lxml.de/3.0/changes-3.0.1.html http://xmlsoft.org/news.html Hope this helps. -- Gruß/Regards, Thomas Schraitle