[PyPy-issue] [issue578] expat parser incorrectly decodes data from expat C library

Simon Cross pypy-dev-issue at codespeak.net
Mon Nov 29 13:44:24 CET 2010


New submission from Simon Cross <hodgestar at gmail.com>:

The expat parser incorrectly decodes character data passed to event handlers
using self.encoding when this data is always encoded as UTF-8.

See http://www.xml.com/pub/a/1999/09/expat/index.html: "... although expat may
accept input in various encodings, the strings that it passes to the handlers
are always encoded in UTF-8."

----------
effort: easy
files: expat-test-and-fix-r79617.diff
messages: 1886
nosy: hodgestar, pypy-issue
priority: bug
release: 1.4
status: unread
title: expat parser incorrectly decodes data from expat C library

_______________________________________________________
PyPy development tracker <pypy-dev-issue at codespeak.net>
<https://codespeak.net/issue/pypy-dev/issue578>
_______________________________________________________
-------------- next part --------------
A non-text attachment was scrubbed...
Name: expat-test-and-fix-r79617.diff
Type: text/x-diff
Size: 1513 bytes
Desc: not available
URL: <http://mail.python.org/pipermail/pypy-issue/attachments/20101129/b1709abc/attachment.diff>


More information about the Pypy-issue mailing list