Why is .text_content() only in HTML?
data:image/s3,"s3://crabby-images/4cf20/4cf20edf9c3655e7f5c4e7d874c5fdf3b39d715f" alt=""
Frederik Elwert, 24.06.2014 11:20:
The usual way to do this is with tostring(), e.g. if you want a Unicode string for further processing: etree.tostring(element, method='text', encoding='unicode') Wrap it in a nicely named function and you're done. Alternatively, you can use an XPath expression, e.g. "string()" or "normalize-text()" and compile it into an XPath() callable to get the same thing. I generally consider a function better than a method here. Stefan
data:image/s3,"s3://crabby-images/f7f24/f7f247680a4034f86b661ae2d68487e21fa6c395" alt=""
Am 24.06.2014 18:45, schrieb Stefan Behnel:
Thanks, I was unaware of the "text" method of tostring().
Yes, I also thought about that.
I generally consider a function better than a method here.
I guess I don’t see much of a difference. The method in lxml.html just seemed so nice and intuitive that I looked for something similar. I am just teaching XML parsing with lxml, and ''.join(element.itertext()) wasn’t really easy to explain to beginners. Thanks again, Frederik
data:image/s3,"s3://crabby-images/4cf20/4cf20edf9c3655e7f5c4e7d874c5fdf3b39d715f" alt=""
Frederik Elwert, 24.06.2014 11:20:
The usual way to do this is with tostring(), e.g. if you want a Unicode string for further processing: etree.tostring(element, method='text', encoding='unicode') Wrap it in a nicely named function and you're done. Alternatively, you can use an XPath expression, e.g. "string()" or "normalize-text()" and compile it into an XPath() callable to get the same thing. I generally consider a function better than a method here. Stefan
data:image/s3,"s3://crabby-images/f7f24/f7f247680a4034f86b661ae2d68487e21fa6c395" alt=""
Am 24.06.2014 18:45, schrieb Stefan Behnel:
Thanks, I was unaware of the "text" method of tostring().
Yes, I also thought about that.
I generally consider a function better than a method here.
I guess I don’t see much of a difference. The method in lxml.html just seemed so nice and intuitive that I looked for something similar. I am just teaching XML parsing with lxml, and ''.join(element.itertext()) wasn’t really easy to explain to beginners. Thanks again, Frederik
participants (2)
-
Frederik Elwert
-
Stefan Behnel