Processing XML files in CJK encodings
gshibaya at gmail.com
Thu Oct 21 23:39:27 CEST 2004
I need to parse XML files in CJK encodings like GB2312 and Ja in UTF-8.
I was using xml.dom.minidom first. It works with Ja in UTF-8, but doesn't
work with GB2312. An article says,
Then I tried xml.parsers.xmlproc. It works fine with GB2312, but now it
doesn't work with Ja in UTF-8. Another article says,
Is there any way to parse both of them correctly?
More information about the Python-list