[pypy-issue] Issue #2205: Segfault on pyexpat.XMLParserType (pypy/pypy)

Silvio Ricardo Cordeiro issues-reply at bitbucket.org
Fri Dec 11 08:39:50 EST 2015


New issue 2205: Segfault on pyexpat.XMLParserType
https://bitbucket.org/pypy/pypy/issues/2205/segfault-on-pyexpatxmlparsertype

Silvio Ricardo Cordeiro:

The attached code segfaults on PyPy 4.1.0 for any sufficiently large input XML I have tried.  The problem seems to be the following expression:

self._parser.CurrentColumnNumber

The exact iteration where the segfault happens is unpredictable (but it always happens after processing around 5MB of XML).  I have 30GB free memory, so it should not be an OOM problem.

I understand the access through _start_list and _parse is not documented, but the crash seems to happen only when reading CurrentColumnNumber, which is a supported attribute for xmlparser objects.  The segfault also happens with CurrentLineNumber.

Any ideas on what's going on?  Should I provide a sample XML displaying the bug?




More information about the pypy-issue mailing list