Python / Chinese Encodings

Martin v. Löwis martin at v.loewis.de
Sun Sep 14 18:02:57 EDT 2003


"Achim Domma" <domma at procoders.net> writes:

> I  need to convert Big5 or GB encoded chinese strings to unicode. It would
> be also nice to be able to detect the encoding of the original string.
> Search with groups.google.com I found some links to different projects but
> they all look not very active. Can somebody give me a short overview of the
> status of processing chinese texts with python?

The very short summary: Use the CJK codecs package; it supports all
encodings you might encounter, and it is actively maintained.

As for detecting the encoding of the original string: Forget it. Tell
your communication partners to always properly declare the encoding.

Regards,
Martin





More information about the Python-list mailing list