elementtree and gbk encoding
Diez B. Roggisch
deets at nospam.web.de
Tue Mar 14 17:45:19 EST 2006
> Here's what I get with the prepending hack:
>
> >>> et.fromstring('<?xml version="1.0" encoding="gbk"?>\n' +
> open(filename).read())
> Traceback (most recent call last):
> File "<interactive input>", line 1, in ?
> File "C:\Program
> Files\Python\lib\site-packages\elementtree\ElementTree.py", line 960, in
> XML
> parser.feed(text)
> File "C:\Program
> Files\Python\lib\site-packages\elementtree\ElementTree.py", line 1242,
> in feed
> self._parser.Parse(data, 0)
> ExpatError: unknown encoding: line 1, column 30
>
>
> Are the XML encoding names different from the Python ones? The "gbk"
> encoding seems to work okay from Python:
I had similar trouble with cElementTree and cp1252 encodings. But
upgrading to a more recent version helped. Did you try parsing with e.g.
sax?
Diez
More information about the Python-list
mailing list