
Oct. 5, 2005
6:40 p.m.
Walter Dörwald wrote:
OK, here's a patch that implements this enhancement to PyUnicode_DecodeCharmap(): http://www.python.org/sf/1313939
Looks nice!
Creating the decoding_map as a string should probably be done by gencodec.py directly. This way the first import of the codec would be faster too.
Hmm. How would you represent the string in source code? As a Unicode literal? With \u escapes, or in a UTF-8 source file? Or as a UTF-8 string, with an explicit decode call? I like the current dictionary style for being readable, as it also adds the Unicode character names into comments. Regards, Martin