[New-bugs-announce] [issue10149] Data truncation in expat parser
Maciek J
report at bugs.python.org
Wed Oct 20 03:43:21 CEST 2010
New submission from Maciek J <egil at wp.pl>:
Not sure if this is a Python problem or an expat problem, but I get truncated data while parsing XML documents.
This particular project is for parsing an XML file of Wikipedia dump.
The attached files are:
* xml-parse-revisions.py - parser script
* revision-test.xml - input XML
* revision-test.xml.sql - output XML
* revision_create.sql - not really needed for this test case, but attached for completeness
You can notice that the output file sometimes contains too short values for the "timestamp". Also note that if you add whitespace to the input XML, then different timestamps will be truncated.
My Python is 2.6.6.
----------
components: XML
files: pyxml_error.zip
messages: 119184
nosy: Maciek.J
priority: normal
severity: normal
status: open
title: Data truncation in expat parser
versions: Python 2.6
Added file: http://bugs.python.org/file19292/pyxml_error.zip
_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue10149>
_______________________________________
More information about the New-bugs-announce
mailing list