Processing XML files in CJK encodings

gs gshibaya at
Thu Oct 21 23:39:27 CEST 2004

Python gurus,

I need to parse XML files in CJK encodings like GB2312 and Ja in UTF-8.
I was using xml.dom.minidom first. It works with Ja in UTF-8, but doesn't
work with GB2312. An article says,

Then I tried xml.parsers.xmlproc. It works fine with GB2312, but now it
doesn't work with Ja in UTF-8. Another article says,

Is there any way to parse both of them correctly?


More information about the Python-list mailing list