[Doc-SIG] Ambiguity in default output for publish_string
g.brandl at gmx.net
Wed Mar 4 19:06:43 CET 2009
Jeffrey C. Jacobs schrieb:
> However, when publish_string is called to output the tree for both of
> these expressions, they both return:
> <document source="<string>">
> This paragraph has a very funny
> after that word, right?
> which is not different. As far as I can tell, the internal node structure
> is correct, it's just when the node structure is displayed in string form,
> the default function of publish_string. Since this output is a
> serialization of the node structure, it seems that the output to
> publish_string should not be ambiguous in terms of what it truly
> represents. Or, is there a better way to represent the internal doc tree
> unambiguously as a string?
What you see there is the "pseudo-XML" output format, which is nice for a
quick view but not unambiguous.
Try publish_string(..., writer_name='xml') for real XML output which is
unambiguous in all cases.
More information about the Doc-SIG