[Python-bugs-list] [ python-Bugs-745145 ] xml.sax Contenthandler passes truncated data

SourceForge.net noreply@sourceforge.net
Sat, 14 Jun 2003 00:56:43 -0700


Bugs item #745145, was opened at 2003-05-28 22:11
Message generated for change (Comment added) made by loewis
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=745145&group_id=5470

Category: XML
Group: None
>Status: Closed
>Resolution: Invalid
Priority: 5
Submitted By: Guido Treutwein (gtreutwein)
Assigned to: Nobody/Anonymous (nobody)
Summary: xml.sax Contenthandler passes truncated data

Initial Comment:
Error-Message in Python xml.sax module

writeXml_1 generates a xml file ('testOut.xml' )containing 
random strings and checksums
'readXml_2 testOut.xml' reads the file and verifies the 
checksums

Observation: readXml_2 reports checksum errors. The 
error occurs on all tried combinations of (Win2000, Win98 
and OpenBSD 3.2) with (Python 2.2.2 and 2.3a1).
The corrupted strings reported are located shortly before 
64kByte boundaries in the file.

----------------------------------------------------------------------

>Comment By: Martin v. Löwis (loewis)
Date: 2003-06-14 09:56

Message:
Logged In: YES 
user_id=21627

This is a bug in your code. There is no guarantee that, for
a chunk of PCDATA text, there will only be one .characters
callback; the parser may split this up into multiple calls. 

With the attached modified readXml_2.py, the data validate
just fine.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=745145&group_id=5470