[issue38011] xml.dom.pulldom splits text data at buffer size when parsing from file

Mon Sep 2 13:39:20 EDT 2019

Stefan Behnel <stefan_ml at behnel.de> added the comment:

I don't see anything inherently wrong with having multiple text nodes.

In fact, input with very large text content can be considered a security threat (c.f. compression bombs), so a tool like pulldom (which is designed for incremental processing) should not start collecting more content than the user asked for. Getting multiple text nodes in some cases seems an ok-ish price to pay.

A documentation PR is welcome.

----------
assignee:  -> docs at python
components: +Documentation
nosy: +docs at python
versions: +Python 3.8, Python 3.9

_______________________________________
Python tracker <report at bugs.python.org>
<https://bugs.python.org/issue38011>
_______________________________________