lxml.objectify drops leading whitespace
data:image/s3,"s3://crabby-images/ed8d7/ed8d7e68609eb981b65ba380b85b19ae47c36c93" alt=""
Hello, lxml.objectify drops leading whitespace before a tag:
lxml.etree.tostring(lxml.objectify.fromstring('<p> <a>foo</a></p>')) '<p><a>foo</a></p>'
while lxml.etree handles the same situation just fine:
lxml.etree.tostring(lxml.etree.fromstring('<p> <a>foo</a></p>')) '<p> <a>foo</a></p>'
What's the reason for the different behaviour? Is this intentional? Most importantly, how can I make lxml.objectify preserve whitespace correctly? Thanks for your help, Wolfgang
data:image/s3,"s3://crabby-images/8bbe6/8bbe681f08550d13b35a459376ee85cf203c1262" alt=""
Hi,
It is intentional. For data binding the text content of elements with children is (usually) irrelevant. IIRC this only drops whitespace for elements with children, not for elements with text-only content. You should be able to change objectify's behaviour by changing it's default parser using makeparser() and set_default_parser: def set_default_parser(new_parser = None): u"""set_default_parser(new_parser = None) Replace the default parser used by objectify's Element() and fromstring() functions. The new parser must be an etree.XMLParser. Call without arguments to reset to the original parser. """ def makeparser(**kw): u"""makeparser(remove_blank_text=True, **kw) Create a new XML parser for objectify trees. You can pass all keyword arguments that are supported by ``etree.XMLParser()``. Note that this parser defaults to removing blank text. You can disable this by passing the ``remove_blank_text`` boolean keyword option yourself. """ Holger Landesbank Baden-Wuerttemberg Anstalt des oeffentlichen Rechts Hauptsitze: Stuttgart, Karlsruhe, Mannheim, Mainz HRA 12704 Amtsgericht Stuttgart
data:image/s3,"s3://crabby-images/8bbe6/8bbe681f08550d13b35a459376ee85cf203c1262" alt=""
Hi,
It is intentional. For data binding the text content of elements with children is (usually) irrelevant. IIRC this only drops whitespace for elements with children, not for elements with text-only content. You should be able to change objectify's behaviour by changing it's default parser using makeparser() and set_default_parser: def set_default_parser(new_parser = None): u"""set_default_parser(new_parser = None) Replace the default parser used by objectify's Element() and fromstring() functions. The new parser must be an etree.XMLParser. Call without arguments to reset to the original parser. """ def makeparser(**kw): u"""makeparser(remove_blank_text=True, **kw) Create a new XML parser for objectify trees. You can pass all keyword arguments that are supported by ``etree.XMLParser()``. Note that this parser defaults to removing blank text. You can disable this by passing the ``remove_blank_text`` boolean keyword option yourself. """ Holger Landesbank Baden-Wuerttemberg Anstalt des oeffentlichen Rechts Hauptsitze: Stuttgart, Karlsruhe, Mannheim, Mainz HRA 12704 Amtsgericht Stuttgart
participants (2)
-
Holger Joukl
-
Wolfgang Schnerring